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Series preface 


This international series covers all aspects of 
theoretical and applied optics and optoelectronics. 
Active since 1986, eminent authors have long been 
choosing to publish with this series, and it is now 
established as a premier forum for high-impact 
monographs and textbooks. The editors are proud 
of the breadth and depth showcased by published 
works, with levels ranging from advanced under- 
graduate and graduate student texts to professional 
references. Topics addressed are both cutting edge 
and fundamental, basic science and applications- 
oriented, on subject matter that includes lasers, 


photonic devices, nonlinear optics, interferom- 
etry, waves, crystals, optical materials, biomedical 
optics, optical tweezers, optical metrology, solid- 
state lighting, nanophotonics, and silicon photon- 
ics. Readers of the series are students, scientists, 
and engineers working in optics, optoelectronics, 
and related fields in the industry. 

Proposals for new volumes in the series may be 
directed to Lu Han, executive editor at CRC Press, 
Taylor & Francis Group (lu.han@taylorandfrancis. 
com). 
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Introduction to the Second Edition 


There have been many detailed technological changes 
since the first edition of the Handbook in 2006, with 
the most dramatic changes can be seen from the far 
more widespread applications of the technology. To 
reflect this, our new revision has a completely new 
Volume II focused on applications and covering 
many case studies from an ever-increasing range of 
possible topics. Even as recently as 2006, the high 
cost or poorer performance of many optoelectronics 
components was still holding back many develop- 
ments, but now the cost of many high-spec com- 
ponents, particularly ones such as light-emitting 
diodes (LEDs), lasers, solar cells, and other optical 
detectors, optoelectronic displays, optical fibers, 
and components, including optical amplifiers, has 
reduced to such an extent that they are now finding 
a place in all aspects of our lives. Solid-state opto- 
electronics now dominates lighting technology and 
is starting to dominate many other key areas such as 
power generation. It is revolutionizing our transport 
by helping to guide fully autonomous vehicles, and 
CCTV cameras and optoelectronic displays are seen 
everywhere we go. 

In addition to the widespread applications 
now routinely using optoelectronic components, 


since 2006 we have witnessed growth of various 
fundamentally new directions of optoelectronics 
research and likely new component technologies 
for the near future. One of the most significant new 
areas of activity has been in nano-optoelectronics; 
the use of nanotechnology science, procedures, and 
processes to create ultraminiature devices across 
all of the optoelectronics domain: laser and LED 
sources, optical modulators, photon detectors, and 
solar cell technology. Two new chapters on silicon 
photonics and nanophotonics and graphene opto- 
electronics attempt to cover the wide range of nan- 
otechnology developments in optoelectronics this 
past decade. It will, however, be a few years before 
the scale-up to volume-manufacturing of nano- 
based devices becomes an economically feasible 
reality, but there is much promise for new genera- 
tions of optoelectronic technologies to come soon. 
Original chapters of the first edition have been 
revised and brought up-to-date for the second edi- 
tion, mostly by the original authors, but in some cases 
by new authors, to whom we are especially grateful. 
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Introduction to the First Edition 


Optoelectronics is a remarkably broad scientific 
and technological field that supports a multibillion 
US dollar per annum global industry, employing 
tens of thousands of scientists and engineers. The 
optoelectronics industry is one of the great global 
businesses of our time. 

In this Handbook, we have aimed to produce 
a book that is not just a text containing theo- 
retically sound physics and electronics coverage, 
nor just a practical engineering handbook, but 
a text designed to be strong in both these areas. 
We believe that, with the combined assistance of 
many world experts, we have succeeded in achiev- 
ing this very difficult aim. The structure and con- 
tents of this Handbook have proved fascinating to 
assemble, using this input from so many leading 
practitioners of the science, technology, and art of 
optoelectronics. 

Today’s optical telecommunications, display, 
and illumination technologies rely heavily on 
optoelectronic components: laser diodes, LEDs, 
liquid crystal, and plasma screen displays, etc. In 
today’s world, it is virtually impossible to find a 
piece of electrical equipment that does not employ 
optoelectronic devices as a basic necessity—from 
CD and DVD players to televisions, from automo- 
biles and aircraft to medical diagnostic facilities 
in hospitals and telephones, from satellites and 
space-borne missions to underwater exploration 
systems—the list is almost endless. Optoelectronics 
is in virtually every home and business office in 
the developed modern world, in telephones, fax 
machines, photocopiers, computers, and lighting. 

“Optoelectronics” is not precisely defined in 
the literature. In this Handbook , we have covered 
not only optoelectronics as a subject concerning 
devices and systems that are essentially electronic 
in nature, yet involve light (such as the laser diode), 


but we have also covered closely related areas of 
electro-optics, involving devices that are essen- 
tially optical in nature but involve electronics (such 
as crystal light-modulators). 

To provide firm foundations, this Handbook 
opens with a section covering “Basic Concepts.” 
The “Introduction” is followed immediately by a 
chapter concerning “Materials,” for it is through 
the development and application of new materials 
and their special properties that the whole busi- 
ness of optoelectronic science and technology now 
advances. Many optoelectronic systems still rely on 
conventional light sources rather than semiconduc- 
tor sources, so we cover these in the third chapter, 
leaving semiconductor matters to a later section. 
The detection of light is fundamental to many 
optoelectronic systems, as are optical waveguides, 
amplifiers, and lasers, so we cover these in the 
remaining chapters of the Basic Concepts section. 

The “Advanced Concepts” section focuses 
on three areas that will be useful to some of our 
intended audience, both now, in advanced optics 
and photometry, and now and increasingly in 
the future concerning nonlinear and short-pulse 
effects. 

“Optoelectronics Devices and Techniques” is 
a core foundation section for this Handbook, as 
today’s optoelectronics business relies heavily on 
such knowledge. We have attempted to cover all 
the main areas of semiconductor optoelectronics 
devices and materials in the eleven chapters in this 
section, from LEDs and lasers of great variety to 
fibers, modulators, and amplifiers. Ultrafast and 
integrated devices are increasingly important, 
as are organic electroluminescent devices and 
photonic bandgap and crystal fibers. Artificially 
engineered materials provide a rich source of pos- 
sibility for next-generation optoelectronic devices. 


xi 


xii_ Introduction to the First Edition 


At this point, the Handbook “changes gear”— 
and we move from the wealth of devices now 
available to us—to how they are used in some 
of the most important optoelectronic systems 
available today. We start with a section covering 
“Communication,” for this is how the developed 
world talks and communicates by Internet and 
email today—we are all now heavily dependent 
on optoelectronics. Central to such optoelec- 
tronic systems are transmission, network architec- 
ture, switching, and multiplex architectures—the 
focus of our chapters here. In communication, we 
already have a multi-tens-of-billions-of-dollars- 
per-annum industry today. 

“Imaging and displays” is the other indus- 
try measured in the tens of billions of dollars per 
annum range at the present time. We deal here 
with most if not all of the range of optoelectronic 
techniques used today from cameras, vacuum and 
plasma displays to liquid crystal displays and light 
modulators, from electroluminescent displays 
and exciting new three-dimensional display tech- 
nologies just entering the market place in mobile 
telephone and laptop computer displays to the 
very different application areas of scanning and 
printing. 

“Sensing and Data Processing” is a growing 
area of optoelectronics that is becoming increas- 
ingly important—from non-invasive _ patient 
measurements in hospitals to remote sensing in 
nuclear power stations and aircraft. At the heart of 
many of today’s sensing capabilities is the business 
of optical fiber sensing, so we begin this section of 
the Handbook there, before delving into remote 
optical sensing and military systems (at an un- 
classified level—for here-in lies a problem for this 
Handbook—that much of the current development 
and capability in military optoelectronics is classi- 
fied and unpublishable because of its strategic and 
operational importance). Optical information stor- 
age and recovery is already a huge global industry 
supporting the computer and media industries in 
particular; optical information processing shows 
promise but has yet to break into major global uti- 
lization. We cover all of these aspects in our chap- 
ters here. 

“Industrial Medical and Commercial 
Applications” of optoelectronics abound, and 
we cannot possibly do justice to all the myriad 


inventive schemes and capabilities that have been 
developed to date. However, we have tried hard to 
give a broad overview within major classification 
areas, to give you a flavor of the sheer potential of 
optoelectronics for application to almost every- 
thing that can be measured. We start with the 
foundation areas of spectroscopy—and increas- 
ingly important surveillance, safety, and security 
possibilities. Actuation and control—the link from 
optoelectronics to mechanical systems is now per- 
vading nearly all modern machines: cars, aircraft, 
ships, industrial production, etc.—a very long list 
is possible here. Solar power is and will continue 
to be of increasing importance—with potential 
for urgently needed breakthroughs in photon to 
electron conversion efficiency and cost of pan- 
els. Medical applications of optoelectronics are 
increasing all the time, with new learned journals 
and magazines regularly being started in this field. 

Finally, we come to the art of practical optoelec- 
tronic systems—how do you put optoelectronic 
devices together into reliable and useful systems, 
and what are the “black art” experiences learned 
through painful experience and failure? This is 
what other optoelectronic books never tell you, and 
we are fortunate to have a chapter that addresses 
many of the questions we should be thinking about 
as we design and build systems—but often forget or 
neglect at our peril. 

In years to come, optoelectronics will develop 
in many new directions. Some of the more likely 
directions to emerge by 2010 will include optical 
packet switching, quantum cryptographic com- 
munications, three-dimensional and large-area 
thin-film displays, high-efficiency solar-power 
generation, widespread biomedical and biopho- 
tonic disease analyses and treatments, and opto- 
electronic purification processes. Many new 
devices will be based on quantum dots, photonic 
crystals, and nano-optoelectronic components. A 
future edition of this Handbook is likely to report 
on these rapidly changing fields currently pursued 
in basic research laboratories. 

We are confident you will enjoy using this 
Handbook of Optoelectronics, derive fascination 
and pleasure in this richly rewarding scientific and 
technological field, and apply your knowledge in 
either your research or your business. 
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4 Optical transmission 


1.6.6.2 Increasing the number 
of channels with a 
closer channel spacing 27 
PROLOGUE 


Optics has become the unique transmission tech- 
nology in backbone networks, providing capacities 
absolutely unknown before, a very high transmis- 
sion quality and a reduction of operational costs 
per transmitted bit. This is due to the development 
of wavelength division multiplexing (WDM) intro- 
duced in 1995 and allowing to transmit around 800 
Gbit/s (80 and 10 Gbit/s channels) over one single 
mode fiber in 2005. This chapter describes this his- 
tory, puts in evidence the basics of optical transmis- 
sion and the development of WDM technology, and 
shows how the capacity of WDM systems can be 
increased by extending the used bandwidth, increas- 
ing the channel number or the bit rate per channel. 

In 2005, coherent reception which had been 
explored by a many academic and industrial labo- 
ratories in the 1980s came back in foreground, but 
receivers were implemented quite differently from 
what had been proposed 30years before. In fact, 
this very important technological step is closely 
related to the progress of electronics, allowing the 
implementation of complex and powerful digital 
signal processing (DSP) algorithms. The receiver 
consists now in a very “simple” optical front end 
(local oscillator and photodiodes), analogue to 
digital converters and a DSP unit compensating 
for all the transmission impairments due to the 
propagation over the fiber. Due to the fact that 
the optical demodulator just translates the chan- 
nel transfer function into baseband, the baseband 
transfer function to be compensated for by the 
DSP unit is just the transfer function of the optical 
channel, which is not true with a quadratic detec- 
tion. Coherent reception opens the way to complex 
and more than binary modulation schemes, which 
results into an increase of the transmitted bit rate 
within a given bandwidth. 

Chromatic dispersion is no more in line 
compensated, which eliminates the dispersion- 
compensation fiber in each amplification site, 
polarization mode dispersion (PMD), which was 
a very serious problem can be compensated for 
very efficiently. Moreover, both polarizations 
can be used, which doubles the potential capac- 
ity of each channel by using polarization division 


1.6.6.3 Modulation schemes 28 
1.6.6.4 Selected recent results 28 
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multiplexing in conjunction with WDM. And DSP 
can cancel the interference between polarizations 
and then separate them without any problem. 

Commercial coherent systems transmitting 80 
100 Gbit/s channels in C band (8 Tbit/s over one 
single mode fiber) have been available since 2011 
and are installed in backbone networks to face the 
always increasing traffic demand, while 200 and 
beyond 400 Gbit/s per channel are actively inves- 
tigated in industrial laboratories. 


1.1 INTRODUCTION 


The enormous potential of optical waves for high- 
rate transmission of information was recognized 
as early as the 1960s. Because of their very high 
frequency, it was predicted that light waves could 
be ultimately modulated at extremely large bit 
rates, well in excess of 100 Gbit-s"! and orders of 
magnitudes faster than possible with standard 
microwave-based communication systems. The 
promise of optical waves for high-speed commu- 
nication became a reality starting in the late 1980s 
and culminated with the telecommunication 
boom of the late 1990s, during which time a world- 
wide communication network involving many 
tens of millions of miles of fiber was deployed in 
many countries and across many oceans. In fact, 
much of the material covered in this handbook 
was generated to a large extent as a result of the 
extensive optoelectronics research that was carried 
out in support of this burgeoning industry. The 
purpose of this chapter is to provide a brief over- 
view of the basic architectures and properties of 
the most widely used type of optical transmission 
line, which exploit the enormous bandwidth of 
optical fiber by a general technique called WDM. 
After a brief history of optical network develop- 
ment, this chapter examines the various physical 
mechanisms that limit the performance of WDM 
systems, in particular, their output power [which 
affects the output signal-to-noise ratio (SNR)], 
capacity (bit rate times number of channels), opti- 
cal reach (maximum distance between electronic 
regeneration), and cost. The emphasis is placed on 
the main performance-limiting effects, namely 
fiber optical nonlinearities, fiber chromatic and 
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group velocity dispersions (GVDs), optical ampli- 
fier noise and noise accumulation, and receiver 
noise. Means of reducing these effects, including 
fiber design, dispersion management, modula- 
tion schemes, and error-correcting codes, are also 
reviewed briefly. The text is abundantly illustrated 
with examples of both laboratory and commercial 
optical communication systems to give the reader 
a flavor of the kinds of system performance that are 
available. This chapter is not meant to be exhaus- 
tive, but to serve as a broad introduction and to 
supply background material for the following two 
chapters (optical network architecture and optical 
switching and multiplexed architectures), which 
dwell more deeply into details of system archi- 
tectures. We also refer the reader to the abundant 
literature for a more in-depth description of these 
and many other aspects of optical communication 
systems (see, for example, [1,17,32,34]). 


1.2 HISTORY OF THE 
INTRODUCTION OF OPTICS IN 
BACKBONE NETWORKS 


Enabling the implementation of the optical com- 
munication concept required the development of a 
large number of key technologies. From the 1960s 
through the 1980s, many academic and industrial 
laboratories around the world carried out exten- 
sive research towards this goal. The three most dif- 
ficult R&D tasks were the development of reliable 
laser sources and photodetectors to generate and 
detect the optical signals, of suitable optical fibers 
to carry the signals, and of the components needed 
to perform such basic functions as splitting, filter- 
ing, combining, polarizing, and amplifying light 
signals along the fiber network. Early silica-based 
fibers had a large core and consequently carried a 
large number of transverse modes, all of which 
travel at a different velocity, leading to unavoid- 
able spreading of the short optical bits that carry 
the information and thus to unacceptably low bit 
rates over long distances. Perhaps, the most crucial 
technological breakthrough was the development 
of single-mode fibers, which first appeared in the 
mid-1970s and completely eliminated this problem. 
Over the following decade, progress in both mate- 
rial quality and manufacturing processes led to a 
dramatic reduction in the propagation loss of these 
fibers, from tens of decibels per kilometer in early 


prototypes to the amazingly low typical current loss 
of 0.18 dB-km™ around 1.5m used in submarine 
systems (or an attenuation of only 50% through a 
slab of glass about 17 km thick!). The typical attenu- 
ation of fibers used in long-distance terrestrial net- 
works today is around 0.22 dB-km™ at 1550 nm. 

Fiber components were developed in the 1980s, 
including such fundamental devices as fiber cou- 
plers, fiber polarizers and polarization control- 
lers, fiber wavelength division multiplexers [5,48], 
and rare-earth-doped fiber sources and amplifi- 
ers [17,18,41]. The descendants of these and several 
other components now form the building blocks of 
modern optical networks. Interestingly, the original 
basic research on almost all of these components was 
actually done not with communication systems in 
mind, but for fiber sensor applications, often under 
military sponsorship, in particular for development 
of the fiber optic gyroscope [5]. Parallel work on 
optoelectronic devices produced other cornerstone 
active devices, including high quantum efficiency, 
low-noise photodetectors, efficient and low-noise 
semiconductor laser diodes in the near infrared, in 
particular distributed-feedback (DFB) lasers, as well 
as semiconductor amplifiers, although these were 
eclipsed in the late 1980s by rare-earth-doped fiber 
amplifiers. The development of high-power laser 
diodes, began in the 1980s to pump high-power solid- 
state lasers, in particular for military applications, 
sped up substantially in the late 1980s in response 
for the growing demand for compact pump sources 
around 980 and 1480 nm for then-emerging erbium- 
doped fiber amplifiers. Another key element in the 
development of optical communication networks 
was the advent of a new information management 
concept called Synchronous Optical NETworks [31], 
especially matched to optical signals but also usable 
for other transmission technologies. 

Up until the mid-1980s, long-distance com- 
munication network systems were based mostly 
on coaxial cable and radio frequency technologies. 
Although the maximum capacity of a single coaxial 
cable could be as high as 560 Mbit-s“, most installed 
systems operated at a bit rate of 140 Mbit-s”, while 
radio links could support typically eight 140 
Mbit-s™ radio channels. Intercontinental traffic was 
shared between satellite links and analogue coaxial 
undersea systems; digital undersea coaxial systems 
never existed. The switch to optical networks was 
motivated in part by the need for a much greater 
capacity, in part by the need for improved security 
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and reliability of radio-based and cable-based sys- 
tems. These systems were commonly affected by 
two different types of failures, namely signal fad- 
ing and cable breaks due to civil engineering work, 
respectively. The first optical transmission systems 
were introduced in communication networks in the 
mid-1980s. Early prototypes were classical digital 
systems with a capacity that started at 34 Mbit-s7! 
and rapidly grew to 140 Mbit-s", i.e., comparable to 
established technologies. Optical communication 
immediately outperformed the coaxial technology 
in terms of regeneration span, which was tens of kilo- 
meters compared to less than 2km for high-capacity 
coaxial-cable systems. However, there was no signif- 
icant advantage compared with radio links, in terms 
of either capacity or regeneration span length, the 
latter being typically around 50 km. One could thus 
envision future long-distance networks based on a 
combination of secure radio links and optical fibers. 
Soon after optical devices became reliable enough 
for operation in a submerged environment, optical 
fiber links rapidly replaced coaxial-cable systems. 
The very first optical systems used multimode fibers 
and operated around 800nm. This spectral window 
was changed to 1300nm for the second generation 
of systems, when lasers around this wavelength first 
became available. In Japan, where optical commu- 
nication links were installed early on, prior to the 
development of the 1550-nm systems, many systems 
operate in this window. Most of the current systems 
for backbone networks, especially in Europe and the 
United States, operate in the spectral region known 
as the C-band (1530-1565 nm). This has become the 
preferred window of operation because the attenu- 
ation of silica-based single-mode fiber is minimum 
around 1550nm. The first transatlantic optical 
cable, TAT-8, was deployed in 1988. Containing two 
fiber pairs and a large number of repeaters, it spans a 
distance of about 6600km under the Atlantic Ocean 
between Europe and the United States and car- 
ries 280 Mbits of information per second. In 1993, 
optical transmission systems carrying 2.5 Gbit-s7 
(16x 155 Mbit-s“!) over a single fiber with a typical 
regeneration span of 100km began to be added to 
the growing worldwide optic—optic network. In 
terms of both capacity and transmission quality, 
radio-based systems could no longer compete, and 
optics became the unique and dominating technol- 
ogy in backbone networks. 

The single most important component that 
made high-speed communication possible over 


great distances (>100km) without electronic 
regeneration is the optical amplifier. Although 
the loss of a communication optic around 1.5m 
is extremely small, after a few tens of kilometers, 
typically 50-100km, the signal power has been 
so strongly attenuated that further propagation 
would cause the SNR of the signal at the receiver 
to degrade significantly, and thus the transmission 
quality, represented by the bit error rate (BER), 
to be seriously compromised. The SNR can be 
improved by increasing the input signal power, 
but the latter can only be increased so much before 
the onset of devastating nonlinear effects in the 
optic, in particular stimulated Raman scattering 
(SRS), stimulated Brillouin scattering (SBS), and 
four-wave mixing (FWM). Moreover, the gain in 
distance would be limited: a transmission over 
200km instead of 100km of current optic would 
require the input power to be increased by roughly 
20 dB! 

This distance limitation was initially solved by 
placing optoelectronics repeaters along the opti- 
cal line. Each repeater detects the incoming data 
stream, amplifies it electronically, and modulates 
the current of a new laser diode with the detected 
modulation. The modulated diode’s output signal 
is then launched into the next segment in the optic 
link. This approach works well, but its cost is high 
and its bit rate is limited, on both counts by the 
repeaters’ high-speed electronics. A much cheaper 
alternative, which requires high-speed electron- 
ics only at the two ends of the transmission line, 
is optical amplification. Each electronic repeater is 
now replaced by an in-line optical amplifier, which 
amplifies the low-power signals that have traveled 
through a long optic span before their SNR gets 
too low and then reinjects them into the next seg- 
ment in the optic link. The advantage of this all- 
optical solution is clearly that the optical signal is 
never detected and turned into an electronic sig- 
nal, until it reaches the end of the long-haul optical 
line, which can be thousands of kilometers long. 
Because the noise figure (NF) of optical amplifiers 
is low, typically 3-5 dB, the SNR can still be quite 
good even after the signals have traveled through 
dozens of amplifiers. 

Starting as early as the 1960s, much research was 
devoted to several types of in-line optical amplifi- 
ers, first with semiconductor waveguide amplifiers 
[51], then with rare-earth-doped optic amplifiers [18], 
and more recently Raman optic amplifiers [28]. 
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Semiconductor amplifiers turned out to have 
the highest wall-plug efficiency. However, at bit 
rates under about 1 Gbit-s7?, in WDM systems 
they induce cross-talk between signal channels. 
Although solutions have been recently proposed, 
semiconductor amplifiers have not yet entered the 
market in any significant way, partly because of 
the resounding success of the erbium-doped optic 
amplifier (EDFA). First reported in 1987 [42], this 
device provides a high small-signal gain around 
1.5m (up to ~50dB) with a high saturation 
power and with an extremely high efficiency—the 
record is 11dB of small-signal single-pass gain 
per milliwatt of pump power [52]. EDFAs used 
in telecommunication systems operate in satura- 
tion and have a lower gain, but it is still typically 
as high as 20-30 dB. The EDFAs can be pumped 
with a laser diode, at either 980 or 1480 nm, and 
they are thus very compact. Another key prop- 
erty is their wide gain spectrum, which stretches 
from ~1475 to ~1610nm, or a total bandwidth 
of 135nm (~16.4 THz!). For technical reasons, a 
single EDFA does not generally supply gain over 
this entire range, but rather over one of three 
smaller bands, called the S-band (for “short,” 
~1480-1520 nm), the C-band (for “conventional,” 
~1530-1565nm), and the L-band (for “long,” 
~1565-1610nm). Amplification in the S-band can 
also be accomplished with a thulium-doped fiber 
amplifier (TDFA) [49]. Gain has been obtained 
over the S- and C-band by combining an EDFA 
and a TDFA [50]. 

Perhaps more importantly, EDFAs induce negli- 
gible channel cross-talk at modulation frequencies 
above about 1 MHz. These unique features make 
it nearly ideally suited for optical communication 
systems around 1.5 pm. Since the mid-1990s, it has 
been the amplifier of choice in the overwhelming 
majority of deployed systems, thus eliminating the 
electrical regeneration bottleneck. 

The very large gain bandwidth of EDFAs and 
other optical amplifiers also provided the oppor- 
tunity of amplifying a large number of modulated 
optical carriers at different wavelengths distrib- 
uted over the amplifier bandwidth. This concept of 
WDM had of course already been applied in radio 
links. One significant advantage of WDM optical 
systems is that the same amplifier amplifies many 
optical channels, in contrast with classical regener- 
ated systems, which require one repeater per chan- 
nel. Optical amplifiers thus reduce the installation 


cost of networks in two major ways. First, the 
WDM technique results in an increase in capac- 
ity without laying new fibers, which reduces optic 
cost. Second, the cost of amplification is shared 
by a large number of channels, and because the 
use of a single optical amplifier is cheaper than 
implementing one regenerator per channel, the 
transmission cost is reduced proportionally. This 
critical economic advantage provided the final 
impetus needed to displace regenerated systems 
and launch the deployment of the worldwide opti- 
cal WDM backbone networks that took place at the 
end of the 1990s. 


1.3 GENERAL STRUCTURE OF 
OPTICAL TRANSMISSION 
SYSTEMS 


1.3.1 Modulation and detection: 
RZ and NRZ codes 


While radio systems use a wide variety of modu- 
lation formats in order to improve spectrum uti- 
lization, in optical systems data have been so far 
transmitted using binary intensity modulation. A 
logic 1 (resp. 0) is associated to the presence (resp. 
absence) ofan optical pulse. Two types of line codes 
are mainly encountered: nonreturn to zero (NRZ), 
where the impulse duration is equal to the symbol 
duration (defined as the inverse of the data rate) 
and return to zero (RZ), where the impulse dura- 
tion is significantly smaller than the symbol dura- 
tion. This property explains why the name “return 
to zero” is used; if the impulse duration equals 
roughly one half of the symbol time, the modula- 
tion format is designed as RZ 50%. So at a bit rate 
of 10 Gbit-s!, NRZ uses impulses with a width of 
approximately 100 ps, while RZ 50% or RZ 25% 
will use 50 and 25 ps wide pulses, respectively. RZ 
has, for a given mean signal power, a higher signal 
peak power. This property can be used to exploit 
nonlinear propagation effects, which under cer- 
tain conditions can improve system performance. 
Details will be provided further on. 

Research is actively being conducted to investi- 
gate new modulation schemes for future high-bit- 
rate systems. For instance, duobinary encoding, a 
well-known modulation scheme in radio systems, 
has been proposed because of its higher resis- 
tance to chromatic dispersion. Carrier-suppressed 
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RZ, an RZ modulation format with an additional 
binary phase modulation, is also extensively stud- 
ied, as well as recently differential phase shift key- 
ing (DPSK); both provide a higher resistance to 
nonlinear effects. However, only NRZ and RZ are 
used in installed systems today. 

Detection of the optical signals at the end of a 
transmission line is performed with a photode- 
tector, which is typically a PIN or an avalanche 
diode. Photons are converted in the semiconduc- 
tor in electron-holes pairs and collected in an 
electrical circuit. The generated current is then 
amplified and sent to a decision circuit, where the 
data stream is detected by comparing the signal 
to a decision threshold, as in any digital system. 
Detection errors can occur in particular because 
of the presence of noise on the signal and in the 
detector. The error probability is a measurement 
of the transmission quality. In practice, the error 
probability is estimated by the BER, defined as the 
ratio of error bits over the total number of trans- 
mitted bits. 

Several sources of noise are typically present in 
the detection process of an optical wave. Shot noise, 
the most fundamental one, arises from the discon- 
tinuous nature of light. Thermal noise is generated 
in the electrical amplifiers that follow the photode- 
tector. In PIN receivers, thermal noise is typically 
15-20 dB larger than the quantum limit, and if the 
optical signal is low, thermal noise dominates shot 
noise. In the case of amplified systems under nor- 
mal operating conditions, the amplified spontane- 
ous emission (ASE) noise of the in-line amplifiers 
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is largely dominant compared to the receiver noise, 
which can thus be neglected. 


1.3.2 Basic architecture of amplified 
WDM communication links 


A typical amplified WDM optical link is illus- 
trated in Figure 1.1. The emitter consists of N lasers 
of different wavelengths, each one representing a 
communication channel. The lasers are typically 
DFB semiconductor lasers with a frequency sta- 
bilized by a number of means, including tempera- 
ture control and often Bragg gratings. Each laser is 
amplitude modulated by the data to be transmit- 
ted. This modulation is performed with an exter- 
nal modulator, such as an amplitude modulator 
based on lithium niobate waveguide technology. 
Direct modulation of the laser current would be 
simpler and less costly, but it introduces chirping 
of the laser frequency, which is unacceptable at 
high modulation frequencies over long distances 
[1,29]. The fiber-pigtailed laser outputs are com- 
bined onto the optical fiber bus using a wavelength 
division multiplexer, then generally amplified by a 
booster fiber amplifier. 

The multiplexer can be based on concatenated 
WDM couplers (for low number of channels) or 
arrayed waveguide grating multiplexers. This is the 
technology of choice for high channel counts, in 
particular in the so-called dense wavelength divi- 
sion multiplexed systems: although this term has 
no precise definition, it applies usually to systems 
with a channel spacing less than 200 GHz. In-line 
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Figure 1.1 Structure of an amplified WDM optical system. 


1.3 General structure of optical transmission systems 9 


optical amplifiers are distributed along the fiber 
bus to periodically amplify the power in the sig- 
nals, depleted by lossy propagation along the fiber. 
Ideally, each amplifier provides just enough gain 
in each channel to compensate for the loss in that 
channel, i.e., such that each channel experiences 
a net gain of unity. Because the gain of an optical 
amplifier and to a smaller extent the loss along the 
fiber are wavelength dependent, the net gain is dif- 
ferent for different channels. If the difference in net 
gain between extreme channels is too large, after 
a few amplifier/bus spans the power in the stron- 
gest channel will grow excessively, thus robbing 
the gain for other channels and making their SNR 
at the receiver input unacceptably low. This major 
problem is typically resolved by flattening or other- 
wise shaping the amplifier spectral gain profile, or 
equivalently equalizing the power in the channels, 
using one of several possible techniques, either pas- 
sive (for example, long-period fiber gratings) [58] 
or dynamic (e.g., with variable optical attenuators). 
While the first WDM systems that appeared in the 
mid-1980s used basic amplifiers without control 
system, the new very long reach systems include 
complex gain flattening devices that compensate 
for the accumulation of gain tilt. The gain of an 
in-line amplifier ranges approximately from 15 to 
30dB per channel. The distance between amplifi- 
ers is typically 30-100 km, depending on fiber loss, 
number of channels, and other system parameters. 
In deployed undersea systems, the amplifiers are 
equally spaced, whereas in terrestrial networks 
the amplifier location depends on geographical 
constraints, for instance, building location and 
amplifiers tend to be unevenly spaced. At the out- 
put of the transmission line, a wavelength division 
demultiplexer separates the N optical channels, 
which are then sent individually to a receiver, then 
electronically processed. 


1.3.3 Basic architectures of 
repeaterless systems 


A repeaterless communication system aims to 
accomplish a very long optical reach without in- 
line amplifiers. A common application is con- 
necting two terrestrial points on each side of a 
straight or narrow arm of sea, in which case it is 
generally not worth incurring the cost of undersea 
amplifiers. Some deployed repeaterless systems are 
extremely long, as much as hundreds of kilometers, 


and consequently they exhibit a high span attenua- 
tion, up to 50 or even 60 dB. The problem is then to 
ensure that the output power at the receiver is high 
enough, in spite of the high span loss, to achieve 
the required SNR at the end of the line. 

This goal has been achieved with a number of 
architectures. A common solution involves using 
a preamplifier, ie, an amplifier placed before the 
receiver to increase the detected power and reduce 
the receiver NF, as is also often done in classi- 
cal WDM systems. Another one is to use a high- 
power amplifier, i.e. an amplifier placed between 
the emitter and the transmission fiber to boost the 
signal power launched into the fiber. Although 
such a booster amplifier is also present in some 
of the amplified WDM systems described before, 
the typical feature in repeaterless systems is the 
high power level, which can reach up to 30 dBm. 
In both cases, the amplifier can be either an EDFA 
or a Raman amplifier, or a combination of both. 
This solution, as we will see further on, is limited 
by nonlinear effects in the fiber, although they can 
be somewhat mitigated with proper dispersion 
management. 

A third solution specific to repeaterless systems 
is to place an amplifier fiber in the transmission 
fiber itself and to pump it remotely with pump 
power launched into the transmission fiber from 
either end. The amplifier fiber can then be a length 
of Er-doped fiber; the entire transmission fiber can 
be lightly doped with erbium (the so-called dis- 
tributed fiber amplifier); or the transmission fiber 
can be used as a Raman amplifier. The drawback 
of this general approach is that it requires a sub- 
stantially higher pump power than a traditional 
EDFA, and it is therefore more costly. The reason 
is that the pump must propagate through a long 
length of transmission fiber before reaching the 
amplifier fiber, and because the transmission fiber 
is much more lossy at typical pump wavelengths 
than in the signal band, some of the pump power is 
lost. A fourth general solution, which is not specific 
to repeaterless systems, is to use powerful error- 
correcting schemes [12]. 


1.3.4 Optical reach and 
amplification span 
Two important features of a WDM communica- 


tion system are its total capacity, usually expressed 
as N x D, where N is the number of optical channels 
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and D the bit rate per channel, and the optical 
reach, which is the maximum distance over which 
the signal can be transmitted without regeneration. 
Even in amplified systems with a nominally unity 
net gain transmission, due to the accumulation of 
noise from the optical amplifiers and signal distor- 
tions, after a long enough transmission distance 
the bit error becomes unacceptably high, and the 
optical signals need to be regenerated. In practical 
deployed WDM systems in 2001, this limitation 
typically occurs after about seven amplifier spans 
with a loss of roughly 25 dB per span. 

Another important parameter is the amplifica- 
tion span, i.e., the distance between adjacent ampli- 
fiers. The performance of an optical WDM system 
cannot be expressed only in terms of optical reach; 
the number of spans must also be introduced. As 
an example, the optical reach of commercially 
available terrestrial systems in 2002 was around 
800km, compared to 6500km in transatlantic 
systems. A key difference between them is the 
amplification span, as will be explained in the next 
section. In the following, WDM systems with a bit 
rate per channel of 2.5, 10, and 40 Gbit-s“! will be 
designated as WDM 2.5G, WDM 10G, and WDM 
40G, respectively. 


1.4 LIMITATIONS OF OPTICAL 
TRANSMISSION SYSTEMS 


1.4.1 Noise sources and bit error rate 


1.4.1.1 AMPLIFIER NOISE 


Amplification cannot be performed without add- 
ing noise to the amplified signals. In optical ampli- 
fiers, this noise originates from ASE [17], which 
is made of spontaneous emission photons emit- 
ted by the active ions (Er** in the case of EDFAs) 
via radiative relaxation subsequently amplified as 
they travel through the gain medium. The spectral 
power density of the ASE signal per polarization 
mode is given by 


(1.1) 


where G is the amplifier gain, h Planck’s constant, 
and v the signal optical frequency. n,, is a dimen- 
sionless parameter larger or equal to unity called 
the spontaneous noise factor. It depends on the 
amplifier’s degree of inversion, and it approaches 
unity (lowest possible noise) for full inversion of 


Vase = np hv (G = 1) 


the active ion population. The ASE is a broadband 
noise generated at all frequencies where the ampli- 
fier supplies gain, and its bandwidth is nominally 
the same as that of the amplifier gain. The ASE 
power coming out of the amplifier, concomitantly 
with the amplified signals, is obtained by the inte- 
gration of y,., over the frequency bandwidth of the 
gain. As an example, in a particular C-band EDFA 
amplifying ten signals equally spaced between 
1531 and 1558 nm and with a power of 1 pW each, 
and with a peak gain of 33 dB at 1531 nm, the total 
power in the amplified signals is 5.5 mW, whereas 
the total ASE output power is 0.75mW, i.e., more 
than 10% of the signals’ power. 

The photodetectors used in receivers are the so- 
called quadratic detectors, i.e., they respond to the 
square of the optical field. Detection of an optical 
signal S corrupted by additive noise N (ASE noise 
in the case of amplified systems) in a photode- 
tector thus gives a signal proportional to |S+N)?. 
Expansion of this signal gives rise to the signal |S|* 
(the useful signal) plus two noise terms. The first 
term (2SN) is the beat noise between the signal and 
the ASE frequency component at the signal fre- 
quency; it is called the signal-ASE beat noise. The 
second term (N?) is the beat noise between each 
frequency component of the ASE with itself (the 
ASE-ASE beat noise). The signal-ASE beat noise 
varies from channel to channel, but the ASE-ASE 
beat noise is the same for all channels. A third and 
fourth noise terms are of course the shot noise of 
the amplified signal and the shot noise of the ASE, 
and to this must be added a fifth term, namely 
the receiver noise discussed earlier. In high-gain 
amplifiers with low input signals, which are appli- 
cable to most in-line amplifiers in communication 
links, the dominant amplifier noise term is the 
signal-ASE beat noise. In the amplifier example 
given at the end of the previous paragraph, the 
SNR degradation (also known as the NF) is ~3.4dB 
for all 10 signals, and it is due almost entirely to 
signal-ASE beat noise. This noise term is typically 
large compared to the receiver noise, which can 
usually be neglected. Note that the NF is defined 
as the SNR degradation of a shot-noise-limited 
input signal. The SNR degradation at the output 
of an amplifier is therefore equal to the NF only 
when the input signal is shot-noise limited. In a 
chain of amplifier, this is true for the first amplifier 
that the signal traverses. However, after traveling 
through several amplifiers, the signal is no longer 
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shot-noise limited but dominated by signal-ASE 
beat noise, and the SNR degradation is smaller 
than the NF. Refer to “Accumulation of Noise” sec- 
tion for further detail on noise accumulation in 
amplifier chains. 


1.4.1.2 PHOTORECEIVER THERMAL NOISE 


As mentioned earlier, the photoreceiver thermal 
noise is generally fairly large compared to shot 
noise. However, it can become negligible when the 
signals are amplified with a preamplifier placed 
before the detector. To justify this statement, con- 
sider a receiver consisting of an optical preamplifier 
of gain G followed by a photodetector. The optical 
signal and ASE noise powers at the receiver input 
are proportional to G and G—1, respectively (in 
practice, G is very large and G—1~G). Because the 
thermal noise does not depend on G, and because 
it is typically 15-20dB worse than the quantum 
limit, it is clear that if the preamplifier gain is large 
enough, say 20dB, the thermal noise is negligible 
compared to the signal-ASE beat noise. This is 
exactly the same phenomenon as in electronics, 
where the high-gain first stage of a receiver masks 
the noise of the following stages. This property 
illustrates another advantage brought by optical 
amplifiers: optical preamplifiers allow to get away 
from the relatively poor NF of electronic circuits 
and thus to achieve much better performance. 


1.4.1.3 RELATIONSHIP BETWEEN BIT ERROR 
RATE AND NOISE 


How does the error rate at the receiver depend 
on the noise level, or more exactly on the opti- 
cal signal-to-noise ratio (OSNR), of the detected 
signal? To answer this question, we must make 
some assumptions regarding both the signal and 
the noise. First, because the signal-ASE beat noise 
depends on the signal power, it also depends on the 
state of the signal, i.e., on the transmitted data. If 
we assume an ideal on-off keying (OOK) modula- 
tion, signal-ASE beat noise is present only when 
the signal is on, whereas the ASE-ASE beat noise 
is present even in the absence of signal. Because 
the data can be assumed to be equally often on 
and off, the mean signal power is equal to half the 
peak power. 

Second, to obtain an analytical expression for 
the bit error probability requires another assump- 
tion, common in communication theory, which is 
that the noise has a Gaussian statistics. This is true 


for signal-ASE beat noise, as a result of the linear 
processing of Gaussian processes, but it is not true 
for ASE-ASE beat noise. However, under normal 
operating conditions of amplified systems (i-e., 
with a sufficiently high OSNR), the influence of 
ASE-ASE beat noise remains relatively small. After 
a large number of optical amplifiers, however, the 
ASE-ASE beat noise component, which depends 
on the total ASE noise, can become significant. 
An effective way to reduce this noise component 
is then to place before the receiver an optical filter 
that cuts down the ASE power between the opti- 
cal signals. This can be accomplished with a comb 
filter or with the demultiplexer that separates the 
channels. Such a filter reduces the ASE-ASE beat 
noise, but of course it does not attenuate either the 
signals or the ASE at the signals’ frequencies, so it 
does not affect the signal-ASE beat noise. In the 
following, we assume that such a filter, with a rect- 
angular transmission spectrum of optical band- 
width B,, is placed before the receiver. 

Third, because the noise variance is not the 
same conditionally to the transmitted data, the 
best decision threshold is not just at equal distance 
between the two signal levels associated with the 
two possible data values at the sampling time, but 
rather some other optimum threshold value that 
depends on signal power. Assuming that this opti- 
mum threshold value is used, the bit error prob- 
ability (or BER) can be expressed as [30]: 


1 V2R 
Poicaes => erfc 1.2 
. 2 (soe | M2) 


where erfc is the complementary error func- 
tion and the SNR, R is the ratio of the mean sig- 
nal power to the ASE power within the electrical 
bandwidth B, i.e., y4s,B. The electrical bandwidth 
B is the bandwidth of the electronic post-detection 
circuits. The parameter m is the normalized opti- 
cal filter bandwidth, m=B,/B. Equation 1.2 can be 
easily derived by computing the variances of the 
signal-ASE beat noise and ASE-ASE beat noise 
contributions. For the computation of the first 
term, the average power of the signal is used. An 
ideal rectangular optical filter is assumed, as well 
as a rectangular electrical filter 

The SNR is usually measured not within the sig- 
nal bandwidth, but over a much larger bandwidth 
B,; corresponding generally to 0.1mm in wave- 
length (or 12.5GHz near 1550 nm). Calling this 
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parameter the optical SNR R,, the bit error prob- 
ability can be rewritten as 


1 Q 
Pract = —erfc}] 1.3 
tet{S) ay 
where Q is the quality factor: 
oi (1.4) 


5 Ju t+ Jut 4BR, 


B is the ratio B/B, of the electrical to measure- 
ment bandwidths, Ry=BR and p=mf?. An error 
probability of 10° (resp. 107) requires, for exam- 
ple, Q=6 (resp. 8). Neglecting the ASE-ASE beat 
noise contribution (1 > 0), the BER can be simply 


expressed as 
Pace =rerfe| |*2 
2 2B 


From Equation 1.4, the value of R, needed to 
achieve a given quality factor Q, is given by 


eteets) aaa 
n=pai[1 ] 


(1.5) 


(1.6) 


0 


As an example, consider a 10 Gbit-s! system with 
an optical bandwidth B, of 50 GHz, an electrical 
bandwidth B of ~7 GHz (as a rule of thumb, the 
electrical bandwidth is taken to be 70% of the bit 
rate), and a filter bandwidth B, of ~12.4GHz (0.1 
nm). Then B=0.56 and m=7, and a BER of 107 
(Q)=8) requires an OSNR R, of ~17 dB. 

Based on the various degradation mechanisms 
that induce power penalty along the transmis- 
sion system, equipment vendors specify a mini- 
mum SNR required by a given system. Typically, 
for WDM 10G systems, the minimum OSNR is in 
the range of 21-22dB. It must be noted that the 
required OSNR increases with increasing electri- 
cal bandwidth, which is proportional to the bit 
rate. For instance, by going from 2.5 to 10 Gbit-s“! 
the OSNR needs to be increased by about 6 dB (see 
Equation 1.6). This is a very important constraint, 
for system designers as well as operators. 


1.4.1.4 ACCUMULATION OF NOISE 


The accumulation of noise generated by succes- 
sive amplifiers along an optical line degrades the 
OSNR. This accumulated noise limits the number 


of successive amplifiers that can be used, and thus 
the optical reach. Assuming a link with N equally 
spaced amplifiers of mean output power per chan- 
nel P,, an inversion parameter n,,, and a gain G 
compensating exactly for the attenuation of the 
fiber span between them, the total noise power P, 
(including both polarization modes) within a band- 
width B, at the last amplifier output is given by [29]: 


P, = 2Nity(G—)hVBy = Mp 5 (expla, )-hvB, 
a 
a 2ngotl — hvB, 
1nG 


(1.7) 


where L is the total length of the link, Z, the length 
of the amplification span, and « the attenuation 
factor of the fiber. The SNR R is 


= & 
ae 2Nn,(G—lhvBy ey 
or in dB: 
OSNR (dB) =R,(dBm) —P,(dBm) (1.9) 


These relationships show that for a given launched 
power and a given amplification span, the maxi- 
mum transmission distance, represented by the 
maximum number of spans, is limited by the 
minimum required OSNR at the receiver input. 
For a given distance L, the OSNR increases when 
G decreases, ie., when the amplification span 
becomes shorter. As an example, Equation 1.7 
shows that for alink witha fixed length L anda fiber 
attenuation of 0.2 dB-km"!, an OSNR improvement 
of 7 dB is achieved when the span length is reduced 
from Z,=100km (aspan loss of 20 dB) to Z,=50km 
(a span loss of 10dB). The link with the 50-km 
span length does require twice as many amplifiers, 
but the gain they each need to supply is reduced 
by 10dB. So is their output noise power, and as a 
result the OSNR is improved. This illustrates how 
important a parameter the fiber attenuation is. For 
a given amplification span and a given number of 
spans, any reduction in this attenuation will result 
in a better OSNR simply because the amplifier gain 
G will be smaller. Equivalently, a lower fiber loss 
allows increasing the optical reach. For example, a 
fiber loss reduction as small as 0.02 dB-km™ from 
0.23 to 0.21 dB-km"™, in 100-km spans, will allow 
to reduce the gain by 2 dB and thus to improve the 
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Signal power per channel (dBm) 


Number of spans 


Figure 1.2 Output signal power per channel required to achieve an OSNR of 20dB as a function of 


the number of spans for different span losses. 


OSNR by as much as 2 GB. If with the 0.23 dB-km™ 
fiber, the required OSNR was reached after eight 
spans; with the 0.21 dB-km" fiber, it will be possi- 
ble to have 12 spans (10 log (12/8) = 1.7 dB), the same 
noise power being then produced by a larger num- 
ber of less noisy amplifiers. This dependence of the 
OSNR on the span loss explains why the amplifica- 
tion span is significantly shorter in undersea light 
wave systems compared to terrestrial ones, because 
transmission distances are much longer. 

Figure 1.2 shows the output signal power per chan- 
nel (calculated from Equation 1.8) needed to reach an 
OSNR of 20dB as a function of the number of spans 
for different span losses. The inversion parameter of 
the optical amplifiers is taken to be n,,=1.6, and the 
output ASE is assumed to be filtered with a 0.1-nm 
narrowband filter (B)= 12.5 GHz). These curves also 
allow comparing the power required for achieving 
a given optical reach. For instance, for a link length 
L=1000km and a fiber attenuation «0=0.25dB km", 
if there are N=20 spans, each one of them will have 
a loss aL/N=12.5dB, and Figure 1.2 shows that the 
required power per channel will be—7.5 dBm. When 
the number of spans is divided by two (N= 10), the 
span loss increases to 25dB and the required signal 
power jumps nearly 10-fold, to 2 dBm. 


1.4.2 Signal distortions induced by 
propagation 
In addition to SNR degradation due to optical 


amplifiers, transmitted optical signals also suffer 
distortions induced by propagation along the fiber. 


These effects become more and more important as 
the bit rate increases 


1.4.2.1 CHROMATIC DISPERSION 


Within a narrow bandwidth around the car- 
rier angular frequency @,, a fiber of length L 
can be viewed as an all-pass linear filter, with an 
attenuation nearly independent of wavelength 
over the small signal bandwidth prevailing at 
the bit rates under consideration. Expanding the 
phase up to the second order in frequency @ about 
@, allows to write the transfer function H(@) of 
this filter as 


H(@)=A vx { (0.)+9,100- @ )+ 5 Be(o- Wo) ] 


(1.10) 


Higher order terms in the expansion must be con- 
sidered when, for instance, B, equals zero. The first 
and second terms of the exponent represent a con- 
stant phase shift and the delay of the impulse (also 
called group delay), respectively. The third term, 
proportional to the second derivative of the signal 
mode index with respect to the wavelength, origi- 
nates physically from the dependence of the mode 
group velocity on wavelength. It is often referred 
to as GVD or chromatic dispersion. In an optical 
waveguide such as a fiber, GVD is approximately 
the sum of the material dispersion and the fiber 
dispersion. It is mathematically represented by B, 
usually expressed in ps*km™ or by the so-called 
chromatic dispersion D, which is a more familiar 
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parameter to system designers, expressed in ps 
nm km-!. It is the group delay variation over a 
1-nm bandwidth after propagation along a 1-km 
length of fiber. In a standard communication 
fiber, D is around 15 ps-nm7!-km“. D (expressed in 
psnmkm1) is related to B, (in pskm~!) and A (in 
nm) by 


(1.11) 


Chromatic dispersion results in broadening of the 
signal as it propagates through the fiber. When the 
signal pulse amplitude is Gaussian, the pulse width 
evolution along the fiber can be computed analyti- 
cally and pulse broadening can be expressed with 
simple expressions [1]. If a Gaussian pulse with 
a complex impulse envelope u(t, 0) of temporal 


width 0,, 
t’ 
u(t,0)=U, exp| -—— 
(t,0) 0 xp 262 


(1.12) 
is launched into the fiber at z=0, the impulse at 
distance L is given by [1]: 


u(t, L)= a (1.13) 


where U is the amplitude taking into account the 
fiber attenuation, T=it—,L the time in local coor- 
dinates associated to the signal and (x, T) a phase 
term. The parameter O(x) is the temporal pulse 
width at distance L, given by 


O(x)=O9-V1+x7 


where x=L/Ly is the propagation distance nor- 
malized to the characteristic dispersion length 
Lp =84 / |B). 

As expected from physical arguments, in the 
presence of chromatic dispersion the pulse width 
expands along the fiber in much the same way as 
a spatial beam expands in space due to diffraction 
(see Equation 1.14). Here the dispersion length 
Ly plays the same role as the Rayleigh range does 
in the diffraction of Gaussian beams. For a fiber 
of length L and dispersion coefficient B,, there is 
an optimum value of the incident pulse width 0, 
that minimizes the pulse width at the fiber output. 
This optimum pulse width, obtained by setting the 


(1.14) 


derivative of Equation 1.14 with respect to 8, equal 
to zero, is given by 


Bo,opt = \Ba|L (1.15) 
and the output pulse width is 
O(L) = V28o,0p = /2|Bo|L (1.16) 


Stated differently, Equation 1.15 shows that for 
a given input pulse width 6, the optimum fiber 
length that minimizes the output temporal pulse 
width is L=0) /|B2|=Lp , Le, one dispersion 
length. 

This analysis shows that the larger the chro- 
matic dispersion is, the narrower the initial pulse 
needs to be, and the larger the pulse width will 
be at the output of the fiber. It implies that if the 
input pulse width is not properly selected, i.e., if it 
is either too narrow or too wide, chromatic disper- 
sion will cause successive pulses to overlap, which 
creates what is known as intersymbol interference 
(ISI). This deleterious effect alters the decision pro- 
cess and thus increases the BER. It must be noted 
that a Gaussian pulse extends indefinitely in time 
and there is theoretically always a finite amount of 
ISI; the maximum distance is thus set by a “toler- 
able” level of ISI. This distance is exactly Ly if we 
define this acceptable ISI is reached when the ini- 


tial width of the pulse has been multiplied by V2 
(which is somewhat arbitrary). As an example, 
assuming Gaussian pulses with 0, equal to half 
the symbol duration, for a standard communica- 
tion fiber (D=17ps-nm™ km“, i.e., B,=20 ps*-km") 
this distance is equal to 2000 km at 2.5 Gbit-s™, but 
only 125km at 10 Gbit-s7. 

This result demonstrates that propagation 
at bit rates of 10 Gbit-s"' or greater in a standard 
fiber is not possible over distances longer than a 
few tens of kilometer without significant ISI. This 
problem is circumvented in practice by introduc- 
ing along the transmission line components with 
a negative dispersion coefficient to compensate for 
chromatic dispersion, in much the same way as an 
optical lens is used to refocus a free-space beam 
after it has expanded as a result of diffraction. This 
method has been demonstrated in the laboratory 
with a number of optical filters, especially fiber 
Bragg gratings for dynamic compensation [19], 
or more simply and commonly with a length of 
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dispersion-compensating fiber (DCF) designed to 
exhibit a strong negative dispersion coefficient D 
[17], in the range of —90 ps-nm7-km7! for standard 
fiber. This last solution is the only one used in com- 
mercial systems today, and fiber suppliers try to 
develop the best compensation fiber matched to the 
fiber they sell. DCFs are typically more lossy than 
standard communication fibers, with attenuation 
coefficients around 0.5 dB-km™. To make up for 
this additional loss, the DCF is typically inserted 
near an amplifier. In order to reduce the impact of 
the DCF loss on the amplifier NF, in WDM sys- 
tems the DCF is usually placed in the middle of a 
two-stage amplifier. 

Chromatic dispersion depends on wavelength. 
This dependence is characterized by the disper- 
sion slope (expressed in ps:nm~-km~'). In order to 
completely compensate for the dispersion at any 
wavelength, the fiber and the associated DCF must 
exhibit the same D/S ratio, where S is the slope of 
the dispersion D. The existence ofa perfectly slope- 
compensating DCF depends strongly on the type 
of fiber. For example, a DCF very well matched to 
standard single-mode fiber (SSMF) is available, 
but this is not true for all fibers. If the slope is not 
matched, some channels will exhibit a finite resid- 
ual chromatic dispersion outside the “acceptance 
window,” i.e., the interval within which the disper- 
sion must lie to ensure a correct transmission qual- 
ity. This window is typically 1000 ps-nm“! wide for 
a WDM 10G system. In general, unless carefully 
designed a dispersion-compensation filter does not 
cancel dispersion perfectly for all channels. So even 
after correcting dispersion to first order, in long- 
haul WDM systems the residual dispersion can 
still limit the transmission length and/or the num- 
ber of channels. To illustrate the magnitude of this 
effect, consider a link of length L carrying N chan- 
nels spaced by AA (i-e., NAA is the total multiplexed 
width), with a dispersion slope after compensation 
S. The cumulated dispersion at the output of the 
link is then SLNAX. The receiver can be designed 
to tolerate a certain amount of residual cumu- 
lated dispersion within some spectral window, for 
instance, typically 1000 ps-nm™ for a 10 Gbit-s“! 
system, as stated earlier. For a typical dispersion 
slope S=0.08 ps-nm~*-km7"!, N=64, and AA=0.8 nm 
(or a multiplexed width of 51.2 nm), the maximum 
possible fiber length for which the cumulated dis- 
persion reaches 1000 ps-nm! is 244 km. This effect 
can of course be avoided by reducing the length 


or the number of channels, which impacts system 
performance. Better solutions include designing 
broadband dispersion-compensation filters with a 
dispersion curve matched to that of the fiber link. 
This is a key issue for WDM systems, which has 
received a lot of attention from system designers. 


1.4.2.2 NONLINEAR EFFECTS 


The maximum power that can be transmitted 
through an optical fiber, and thus the SNR of the 
signal at the fiber output are ultimately limited by 
a number of optical nonlinearities present in the 
optical fiber. These nonlinear effects are the Kerr 
effect (dependence of the fiber refractive index 
on the signal intensity), SBS (conversion of signal 
power into a frequency-shifted backward wave), 
SRS (conversion of signal power into a forward 
frequency-shifted wave), and FWM (optical mix- 
ing of a signal with itself or other signals and con- 
comitant generation of spurious frequencies). The 
magnitude of these effects generally increases with 
increasing signal intensity, which can be relatively 
high in a single-mode fiber, even at low power, 
because of the fiber’s large optical confinement. For 
example, in a typical single-mode fiber at 1.55 ym 
with an effective mode area of 80,1m?, a 20-dBm 
signal has an intensity of ~1.2 kW-mm~! Because 
this intensity is sustained over very long lengths, 
and because the conversion efficiency of nonlin- 
ear effects generally increases with length, even 
the comparatively weak nonlinear effects present 
in silica-based fibers can have a substantial impact 
on system performance, even at low power. This 
section provides background on the magnitude of 
these nonlinear effects, describes their impact on 
system performance, and mentions typical means 
of reducing them. 


1.4.2.3 SELF-PHASE MODULATION 


When a signal propagates through an optical fiber, 
through the Kerr effect it causes a change An in the 
refractive index of the fiber material. In turn, this 
modification of the medium property reacts on the 
signal by changing its velocity and thus its phase. 
This nonlinear effect is known as self-phase mod- 
ulation (SPM). For a signal of power P, the index 
perturbation An is expressed as 


KipeGues (1.17) 
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where n, is the Kerr nonlinear constant of the fiber 
(n,=3.2X10-° m?-W"! for silica) [35] and A, the 
signal mode effective area. The resulting change in 
the mode propagation constant is 


_@An_ 2m 


I 1.18 
ote (1.18) 


AB 


where I=P/A,, is the signal intensity. In the case 
of a modulated signal, because the Kerr effect has 
an extremely fast response time («1ps) each por- 
tion of the signal pulse modulates its own phase 
independently of other portions of the pulse. If I,(¢) 
is the instantaneous intensity, or equivalently the 
intensity profile, of the signal launched into the 
fiber, and if « is the fiber loss at the signal wave- 
length, then the signal intensity at a point z along 
the fiber is I(t, z)=I,(t) exp(-az). The amount of 
SPM experienced by the signal pulse after a propa- 
gation length L is simply [1]: 


2TN 
r 
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L 
J I, exp(—az)dz = 


0 


L 
W(t, L)= J spar Leg 
0 


(1.19) 


where L.¢ =(1—e7°)/a is the effective fiber length. 
In principle, because photodetectors are qua- 
dratic and thus phase insensitive, SPM should not 
cause any detrimental effect. This is true in AM 
systems provided the fiber is free of dispersion. 
However, in practice, the presence of chromatic 
dispersion converts SPM into amplitude fluctua- 
tions [1]. When f, is positive, SPM combines to 
chromatic dispersion to produce pulse broaden- 
ing, just like chromatic dispersion alone does. 
When B, is negative, SPM combines to chromatic 
dispersion to produce pulse narrowing, i.e., they 
have opposite effects. In this case, SPM can be 
used to compensate for chromatic dispersion and 
thus improve the system performance. There is in 
fact a particular regime in which linear and non- 
linear effects compensate mutually exactly at any 
moment in time. This particular solution of the 
nonlinear Schrédinger equation, valid only in a 
lossless fiber, is called an optical soliton. Provided 
that it has the proper shape and intensity, a soliton 
propagates without any temporal deformation. 
This phenomenon was extensively studied in the 
1990s, and it continues to be an active research 


topic, because fiber-optic solitons are very prom- 
ising for ultralong distance transmission at high 
bit rates [26,43,44]. A soliton-based transmis- 
sion encrypts the information in extremely short 
pulses that neither spread nor compress as they 
propagate along the fiber because the soliton has 
just the right peak power for the Kerr nonlinear 
phase shift to exactly compensate for chromatic 
dispersion. Soliton-based communication links 
are, however, not compatible with WDM-based 
links because in order to have a relatively low 
peak power, a soliton needs a low-dispersion fiber, 
which is not well suited for WDM (see “Four- 
Wave Mixing (FWM)” section). The WDM solu- 
tion has obviously won so far, even for long-haul 
transmission. But the concept of soliton-based 
communication systems remains an interesting 
and promising approach that continues to stimu- 
late a lot of research and development. 

In parallel to these various schemes used to 
combat SPM, the most effective first-order solution 
to reduce SPM is to use a transmission fiber with 
a large mode effective area A,,. This is of course 
also applicable to other undesirable fiber nonlin- 
ear effects, in particular cross-phase modulation 
(XPM), FWM, and stimulated scattering pro- 
cesses. The reason is that the magnitude of all of 
these processes increases as the reciprocal of A, 
so a fiber with a higher A,, can tolerate a higher 
signal power. Large mode effective areas are typi- 
cally accomplished by designing fibers with a 
larger core and a concomitantly lower numerical 
aperture to ensure that the fiber carries a single 
mode. Communication-grade fibers have mode 
effective areas in the range of 50-100pm/?, for 
example, 801m? for the so-called standard fiber. 
Substantially higher values are typically precluded 
for transmission fibers because they require such 
low numerical apertures that the fiber becomes 
overly susceptible to bending loss. 


1.4.2.4 CROSS-PHASE MODULATION 


XPM has the same physical origin as SPM, namely 
the Kerr effect, except that the phase modulation is 
not induced by a signal on itself, but by one or more 
different signals propagation through the fiber. A 
different signal means any signal with a different 
wavelength, a different polarization, and/or a dif- 
ferent propagation direction. In a WDM system, 
the phase of a signal of wavelength A, is therefore 
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modulated by itself (wavelength A, SPM) and by all 
the other channels (wavelengths i,,,, XPM). 

The XPM affecting a particular channel i of a 
WDM system depends on the power (and therefore 
on the data) and wavelength of all other channels 
j#i. As in the case of SPM, XPM is converted into 
amplitude fluctuations through chromatic dis- 
persion. However, the main detrimental effect of 
XPM is time jitter, due to the fact that the other 
signals also change the group delay of channel 
i. The position of the impulses is thus changed 
randomly around an average position, and sam- 
pling before decision does not occur always at the 
same instant within the pulse, which cause a BER 
penalty. If we consider the case of one interfer- 
ing channel, interaction occurs when two pulses 
overlap. Because they propagate at different speeds 
(the group velocities at the wavelengths of the two 
channels), the interaction begins when the fastest 
impulse starts to overlap with the slowest pulse 
and ends when it has completely passed it. This 
phenomenon is called a collision. After one col- 
lision between symmetrical pulses, there is theo- 
retically no memory on the perturbed pulse. The 
problem occurs in the case of an incomplete col- 
lision, for instance, when it begins just before an 
amplifier and then the powers change during the 
collision. In this case, the affected pulse keeps the 
memory through a shifted temporal position. A 
key parameter is to characterize this effect is the 
difference of group velocity between the two chan- 
nels, which is equal to DAX, where D is the disper- 
sion and AA the channel spacing. If this parameter 
is high, the effect will be smaller, because collisions 
will be very rapid. Increasing the channel spacing 
will then reduce the interaction because the differ- 
ence between group velocities is larger. The influ- 
ence of chromatic dispersion is more complex. A 
higher dispersion reduces channel interaction and 
thus phase modulation, but as discussed earlier it 
also increases conversion into amplitude fluctua- 
tions. Further details can be found in Section 1.5. 


1.4.2.5 FOUR-WAVE MIXING 


FWM is another nonlinear process that results 
directly from the Kerr effect. Channels of a WDM 
system beat together in the receiver, giving rise to 
intermodulated sidebands at frequencies that are 
sums and differences of the channel’s frequen- 
cies. Each of these sidebands is modulated with the 


information encrypted on the channels that gave 
rise to it. When a sideband frequency happens to 
fall on or close to one of the channel’s frequencies, 
this channel becomes modulated with unwanted 
information from other channels. This intermodu- 
lation has the same undesirable side effects as simi- 
lar effects well known in radio systems. 

As an illustration, consider a communication 
system utilizing channels that are equally spaced in 
frequency, which is usual in deployed systems, i-e., 
the channel frequencies are f.+mAf, where m is an 
integer. The third-order beating between channels 
0, 1, and 2 at respective frequencies fo, f=fot+ Af 
and f,=f,+2Af produces sideband signals at fre- 
quencies pf,t+qf,t+rf, where |p|+|q|+|r|=3. In 
particular, an intermodulated sideband is gener- 
ated at frequency f)+Af by interaction of three 
channels together (p=1, q=—1, r=—1) but also by 
interaction of channels 0 and 1 only (p=1, q=2). 
This sideband has the same frequency as channel 1 
and thus adds to channel 1 data modulation from 
channels 0 and 2. The same argument applied to 
other channels clearly shows that if the interaction 
is strong enough, every channel becomes contami- 
nated with information from all other channels. 

The magnitude of FWM effects can be char- 
acterized by the power in the intermodulation 
sideband Ptermoa: Lhis power can be calculated 
analytically for pure unmodulated waves, in which 
case it is given by [15,56]: 


Patermod = Niwa y P’exp(-2aL) (1.20) 


where y=22n,/AA,_ represents the strength of the 
Kerr nonlinearity in the fiber, P is the power per 
channel, assumed the same for all channels, d is a 
constant equal to 6 if all channels are distinct and 
9 if there are not. The factor Npwy is the FWM efh- 
ciency, defined as 


Niewm = 


e 4exp(—o.L) sin*(ABrwuL / 2) 
(1—exp(—aL))* 


(1.21) 
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where AByw,y is the phase mismatch between inter- 
acting waves, which depends on chromatic disper- 
sion coefficient D, on its slope, and on the channel 
spacing Af according to 
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? dD 
ABrwm = ar( sar 2? | (1.22) 


In the usual case where the total attenuation of the 
span is high enough (exp(—aL) <1), the efficiency 
(Equation 1.21) is well approximated by 


oO 


TlewM =—3- 52 
ao + ABewa 


(1.23) 


FWM is a phase-matched process: for energy to 
flow effectively from one channel to another, the 
channels must remain in phase, i.e., the phase 
mismatch APyw)y must be small. It means that the 
closer the channel frequencies are (small Af), the 
more efficient FWM is, as indicated mathemati- 
cally by Equations 1.21 and 1.22. This explains why 
the intermodulation power increases with decreas- 
ing channel spacing. Chromatic dispersion plays a 
beneficial role by increasing the phase mismatch 
between channels and thus reducing the FWM 
efficiency, as shown by Equation 1.22. The inter- 
modulation power also increases with increasing 
channel power, and it does so rapidly (as the third 
power in P) because FWM is a nonlinear process. 
Figure 1.3 shows the effect of both dispersion 
and channel spacing on the interference-to-car- 
rier ratio, i.e., the difference between the channel 
power and the intermodulation product power. 
This quantity is plotted versus channel spacing 
for four values of the dispersion typical for chan- 
nels located near the zero-dispersion wavelength. 
This figure simulates a fiber link with a length 
L=100km, a fiber attenuation of 0.2 dB-km'!, a 
dispersion slope of 0.08 psnm™~-km1, a nonlinear 
coefficient y=3W |-km“|, and a launched power per 


channel of 4 dBm. It is clear that a higher disper- 
sion reduces FWM and thus allows a better utiliza- 
tion of the available bandwidth. For example, if a 
ratio of —60dB is required, Figure 1.3 shows that 
this can be accomplished with a 100-GHz channel 
spacing in a standard fiber (D=17 psnm km”), 
but only 210 GHz or higher in a typical nonzero- 
dispersion-shifted fiber (NZDSF, family G.655) 
with a chromatic dispersion of 3 ps/(nm-km). 

In single-channel transmission, a low disper- 
sion is beneficial because it reduces the amount of 
pulse spreading induced by (1) dispersion and (2) 
SPM combined with dispersion, and thus it reduces 
the amount of dispersion compensation needed to 
correct for these effects. In multichannel transmis- 
sion, the situation is not as simple because disper- 
sion now brings protection against interchannel 
effects, XPM and FWM. But the situation depends 
strongly on the channel spacing: for WDM 10G sys- 
tems with a typical channel spacing of 100 GHz or 
less, interchannel effects are dominant compared 
to intrachannel effects (SPM). This is the reason 
why a dispersion-shifted fiber (DSF) with zero dis- 
persion around 1550 nm is much worse for WDM 
transmission than a standard G.652 fiber, and also 
why this fiber provides the smallest channel spac- 
ing at this bit rate (25 GHz). When higher bit rates 
are considered, the channel spacing cannot be 
reduced so much due to the spectral width of the 
modulated signals, and then intrachannel cannot 
be neglected compared to interchannel effects. 


1.4.2.6 STIMULATED BRILLOUIN 
SCATTERING 


SBS belongs to the family of parametric amplifi- 
cation processes. Through interaction between the 
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Figure 1.3 Dependence of the interference-to-carrier ratio due to FWM on the channel spacing. 
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optical signal and acoustic phonons, it causes power 
conversion from the signal into a counterpropa- 
gating signal shifted in frequency by the acoustic 
phonon frequency [1]. The power in the SBS signal 
grows as exp(g,P,—a)z, where g, is the SBS gain, 
in Wknr!, which depends on the wavelength 
separation between the two signals, and « is the 
attenuation of the medium. SBS is a narrowband 
process. In a silica fiber, the Brillouin frequency 
shift at 1.55j1m is v, ~ 11GHz and the gain band- 
width is only Av, ~ 100MHz. It is customary to 
characterize SBS by its pov er threshold, ie., the 
power required to compensate for the medium 
attenuation and thus just begin to provide a posi- 
tive gain. For an unmodulated signal with a line 
width smaller or equal to the SBS gain bandwidth, 
the SBS threshold in a typical 1.5-jm fiber (mode 
effective area of 501m?) is around 21 mW-km, i.e., 
21 mW ina 1-km fiber and 2.1 mW ina 10-km fiber 
[1]. For powers larger than the threshold, a frac- 
tion or all of the signal is converted in the back- 
ward SBS signal. It is therefore essential to keep the 
power in each signal below the SBS threshold, and 
because this threshold is fairly low, SBS limits the 
power of a narrowband signal that can be trans- 
ported over a given distance. This is particularly 
critical in repeaterless systems [25]. 

Several solutions have been demonstrated and are 
routinely applied to increase the Brillouin threshold 
and thus increase the power and/or distance over 
which signal can be transported. When the sig- 
nal amplitude is modulated at bit rates higher than 
~100 MHz, as is the case in WDM systems, the signal 
bandwidth power exceeds the Brillouin line width 
and SBS is reduced. For this reason, SBS is gener- 
ally not a concern in WDM systems. One caveat is 
that the carrier component of the modulated signal 
retains the original line width of the unmodulated 
signal, and it is still backscattered by SBS. Because 
the carrier component carries only half the signal 
average power, the SBS threshold is increased (by 
3dB) compared to an unmodulated narrowband 
signal, but in OOK schemes using high powers SBS 
acting on the carrier has been observed to induce 
signal distortion [33]. 

Because the SBS gain decreases with increasing 
carrier line width Av as Av,/(Av,+Av), another 
solution to further increase the SBS threshold is to 
use a larger carrier line width. This can be done 
with a directly modulated laser (direct modulation 
tends to chirp the laser frequency), by applying to 


the laser either a phase modulation [25,36] or a 
small amount of frequency modulation (at a fre- 
quency much lower than the bit rate) [45]. Other 
techniques include using a duobinary modulation 
scheme to suppress the carrier component [37], 
concatenating fibers with different Brillouin shifts 
to reduce the interaction length [40], and placing 
isolators along the fiber to periodically suppress 
the backward SBS signal [54]. 


1.4.2.7 STIMULATED RAMAN SCATTERING 


Although caused by a different physical mecha- 
nism (interaction with vibrational modes of the 
medium structure instead of acoustic phonons), 
SRS can be modeled in a very similar manner, but 
its characteristics are quite different and so are its 
effects on transmission systems [1]. SRS is an opti- 
cal process that causes power transfer between an 
optical pump and a co- or counterpropagating 
signal. Most solid media exhibit SRS, including 
silica-based fibers. Spontaneous Raman scatter- 
ing occurs when a pump photon of frequency a, is 
scattered by a host phonon of frequency Q, which 
results in the annihilation of the pump photon and 
the spontaneous emission of a signal photon at a 
frequency ®,=@,—Q. This scattering process can 
also be stimulated when an incident signal photon 
of frequency @, interacts with a pump photon and 
a phonon, thus yielding the emission of a stimu- 
lated photon at frequency ,. This stimulated pro- 
cess thus provides what is known as Raman gain. 
The SRS gain spectrum is centered around a fre- 
quency downshifted from the pump frequency 
w,, by the mean phonon frequency Q of the mate- 
rial. The Raman gain spectrum and bandwidth 
are set by the finite-bandwidth phonon spectrum 
of the material. The Raman shift of silica is typi- 
cally 13 THz (or ~100 nm at 1550 nm), which is 
much higher than for SBS. Similarly, the gain full 
width at half maximum is larger, around 8 THz 
(70 nm for a pump around 1.55 1m). However, the 
Raman gain coefficient for a silica fiber is much 
weaker than the SBS gain, by a factor of about 500, 
so the Raman threshold is typically much higher, 
for example, around 1.2 W ina 10-km length of 
1.55-pym communication fiber with a 50-pm? effec- 
tive mode area [1]. Although much weaker, SRS 
can still be deleterious in WDM systems because 
optical channels located at the highest gain fre- 
quencies act as pumps and can be depleted, while 
other channels can be amplified. In conventional 
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systems using only the C-band (30 nm wide), SRS 
does not occur because the maximum separa- 
tion between channels is much smaller than the 
Raman shift. But in systems using both the C- and 
L-bands, power transfer between channels of C- 
and L-bands can be induced by SRS and must be 
taken into account in system design. 

It must be noted that SRS is also a useful mecha- 
nism: a pump signal injected in the fiber can trans- 
fer its power via SRS to one or more signals and 
thus provide amplification. This is the basic prin- 
ciple of fiber Raman amplifiers, which will be con- 
sidered in Section 1.6.4. 


1.4.2.8 POLARIZATION MODE DISPERSION 


A standard single-mode optical fiber does not 
actually carry a single mode but two modes with 
orthogonal, nearly linear polarizations. Because 
the index difference between the fiber core and the 
cladding is small, these two polarization modes 
have nearly degenerated propagation constants. 
However, these propagation constants are not 
exactly the same. In a communication link, the 
signal launched into the fiber is typically linearly 
polarized. The fiber exhibits random linear and 
circular birefringence, and as the signal propa- 
gates through it the signal polarization evolves 
through many states. Because the two orthogo- 
nal polarization modes travel at slightly different 
velocity, one lags behind the other, and because the 
signal is temporally modulated into short pulses, 
after long-enough propagation each pulse is split 
into two pulses. This produces two electrical pulses 
with amplitudes that depend on the polarization 
of the optical signal at the receiver, and separated 
by a random delay called differential group delay 
(DGD). For fibers with strong coupling between 
polarization modes, DGD follows a Maxwell dis- 
tribution. The mean value of DGD is called the 
PMD [3,16,20,22,23]. 

This multipath effect causes ISI and thus 
degrades the BER. Furthermore, random variations 
in the birefringence of the long fiber cause the DGD 
to be a random variable and thus the properties 
of the transmitted signals to be time dependent. 
Communication systems must then be character- 
ized by their outage probability, or outage time, i.e., 
the probability that the BER exceeds the maximum 
tolerable value, above which transmission is no 
longer possible with the required quality. PMD isa 
linear effect, which, just like chromatic dispersion, 


acts on each channel individually but does not 
cause coupling between them. 

WDM systems are usually designed to tolerate a 
PMD approximately equal to one-tenth of the sym- 
bol duration or around 10-12 ps for a 10 Gbit-s“! 
bit rate. When this value is exceeded by a small 
amount, transmission can still be sustained with 
fewer channels, which allows increasing the SNR 
of the remaining channels and provides a better 
resistance against PMD. When PMD is too high 
(for instance 20 ps or more for a WDM 10G sys- 
tem), distortion can cause closure of the eye dia- 
gram and increasing the power does not bring 
any improvement. Currently manufactured fibers 
allow 10 Gbit-s“! transmission over several thou- 
sands of kilometers, and PMD is not a problem at 
this bit rate. Recent advances in manufacturing 
processes have led to fibers with low enough PMD 
values for 40 Gbit-s! transmission over more than 
2000km. 

PMD compensation has been investigated in 
several laboratories, for example, using feedback 
equalizers in either the optical or the electrical 
domain [46,59]. The main application was the 
implementation of WDM 10G systems in existing 
fiber links that could originally not support this 
high bit rate because the fiber exhibited a high 
PMD. Although this method was successful, its 
economic viability has been questioned because 
it requires one equalizer per channel and its high 
cost cannot be shared. PMD compensation will 
certainly need to be implemented in the future 
in communication systems with higher bit rates 
over long distances, which have a reduced toler- 
ance to PMD. For example, a WDM 40G system 
typically requires no more than 2 or 2.5 ps of 
PMD. 


1.5 DESIGN OF AN OPTICAL WDM 
SYSTEM 


1.5.1 Global performance of a 
system: BER and OSNR 


As mentioned earlier, the performance of a WDM 
system is expressed in terms of its BER, which is 
obtained by measuring the number of error bits 
occurring over a given time interval. A minimum 
OSNR value is required in order to achieve the 
required transmission quality, i.e., the BER needs 
to be lower than a given threshold. Commercial 
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equipment is typically specified in terms of OSNR: 
the maximum number of spans is specified for dif- 
ferent losses per amplification span. For example, 
an OSNR of 22dB will be guaranteed for seven 
spans of 25-dB loss (7x25dB) or for ten spans of 
23-dB loss (10x23 dB). 

Another important feature is the system sen- 
sitivity to chromatic dispersion and dispersion- 
compensation strategy. The residual dispersion 
at the receiver input must remain within some 
interval. As an example, a 10Gbit-s! receiver 
will only accept a cumulated dispersion between 
—600 and+800ps-nm". For a particular chan- 
nel, it is always possible to bring the cumulated 
dispersion within this range with proper in-line 
compensation. However, due to the finite disper- 
sion slope, the other channels will experience a 
different cumulated dispersion, and if the link is 
too long and/or the dispersion slope is too high, 
it will not be possible to meet this specification 
for all channels. This limitation could be lifted 
by adjusting the cumulative dispersion channel 
by channel, but this is not practical for economic 
reasons. As a result, chromatic dispersion gener- 
ally imposes an upper limit on the bit rate and 
the optical reach. 


1.5.2 Critical parameters and trade- 
offs for terrestrial, undersea, 
and repeaterless systems 


As discussed earlier, the optical reach and ampli- 
fication span are critical parameters in com- 
munication systems. For a given optical reach 
and a fixed launched power, a shorter amplifica- 
tion span improves the OSNR. Conversely, for 
a given required OSNR it increases the optical 
reach. However, a shorter amplification span also 
results in a more expensive system, more com- 
plex monitoring, and a higher operating cost. 
Moreover, in a terrestrial network the location of 
the amplification sites and the network topology 
in general are parameters that the operator does 
not want to change. The network infrastructure 
and fibers are long-term investments, and they 
are required to be compatible with several gen- 
erations of systems. In particular, the attenuation 
per span is a constrained parameter. Its value is 
imposed by the characteristics of networks where 
systems have to be installed, and it is typically 


in the range of 20-25dB. Technical improve- 
ment goals in terrestrial WDM systems therefore 
consist in increasing the capacity and the optical 
reach within the framework of this attenuation 
per span. 

Undersea systems benefit from an additional 
degree of freedom. Unlike in terrestrial networks, 
the fiber and the system are laid together and can- 
not be separated, and therefore the cable does not 
need to be designed for successive generations 
of systems. The amplification span can then be 
selected without any location constraint. The pre- 
ferred solution is to space the amplifiers equally 
because it simplifies manufacturing, dispersion 
compensation, and cable maintenance. Because 
of the very long optical reach of undersea systems 
(for example, 6500 km between Europe and North 
America and 9000km across the Pacific Ocean), 
the amplification span must be substantially 
reduced down to around 40km. 

In principle, the OSNR can always be improved 
by increasing the power launched into the fiber. 
However, the maximum available power is lim- 
ited by the high cost of high-power amplifiers and 
also by safety rules. Quite independently, as dis- 
cussed earlier, the maximum usable power is lim- 
ited by nonlinear effects in the fiber (SPM, XPM, 
and FWM) toa level that insures that these effects 
are maintained at or below some acceptable level. 
Two strategies are possible to take into account 
this power limitation. The first strategy is to reduce 
the power level when the number of spans is 
increased, because impairments caused by nonlin- 
ear effects are cumulative along the transmission 
line. Systems using this type of approach are called 
linear, or also NRZ (because linear systems use 
NRZ pulses, i.e., pulses with a duration of roughly 
one symbol time). Another strategy is to exploit 
the beneficial effect of “soliton-like” propagation 
regimes, where linear and nonlinear effects cancel 
each other, which requires a precise compensation 
map. These systems are usually called nonlinear, or 
RZ (because they used pulses of the RZ type, i.e., 
significantly shorter than the symbol time and a 
higher peak power than NRZ pulses). 

In summary, the amplifier gain is determined 
by the span loss that needs to be compensated, the 
output power is limited by the cost and technology 
of amplifiers and by fiber nonlinear effects, and the 
optical reach is then given by the minimum SNR 
that can be achieved. 
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1.6 STATE OF THE ART AND 
FUTURE OF THE WDM 
TECHNOLOGY 


1.6.1 State-of-the-art WDM system 
capacity and distance 


The first WDM systems appeared in 1995 and could 
transmit four 2.5 Gbit-s channels. The number of 
channels was rapidly increased to 16, then 32, by 
reducing the frequency spacing between channels 
down to 100GHz (0.8nm). WDM systems were 
predicted to evolve toward even higher channel 
counts by reducing this spacing to 0.4 nm, but this 
change has taken place slowly because of the diffi- 
culty and higher cost of developing components, in 
particular multiplexers and demultiplexers, capa- 
ble of handling signals so closely spaced in wave- 
length. Instead, in the next generation of systems 
the capacity was increased by increasing the bit rate 
to 10 Gbit-s’. WDM 10G systems provide a higher 
capacity, but they are also subject to more severe 
propagation impairments. Chromatic dispersion, 
which can be neglected at 2.5 Gbit-s up to around 
800 km, becomes a critical issue at higher bit rates, 
and dispersion-compensation units must be incor- 
porated in all amplification sites. In 2001, the state 
of the art for typical engineering data provided to 
system operators by suppliers was 80x10 Gbit-s7 
channels with a 50-GHz spacing, or a total capac- 
ity of 800 Gbit-s over a single fiber with an opti- 
cal reach of around 700km. Subsequent progress 


in filtering and laser technology has allowed the 
achievement of 25-GHz channel spacing, offer- 
ing a capacity of 1.6 Tbit-s-! over the 30-nm-wide 
C-band of EDFAs. The capacity of commercially 
available fiber links has therefore been multi- 
plied by a factor 160 in 7years, and the symbolic 
barrier of 1 Tbit-s! has already been exceeded. 
Furthermore, much higher capacities have been 
demonstrated in laboratories, for example, up to 5 
Tbit-s over 12x100km [7] and even 10.2 Tbit-s~! 
over 100km using polarization multiplexing [9]. 

Figure 1.4 depicts the state of the art of WDM 
technology at the time of this writing. Circles rep- 
resent laboratory prototypes, and squares repre- 
sent either commercially available or announced 
systems. The diagonal line represents the 1 Tbit-sé! 
capacity boundary; systems that fall in the 
hatched quadrant above it have a capacity greater 
than 1 Tbit-s-!. The highest commercial capac- 
ity is 3.2 Tbit-s! (8040 Gbit-s). Systems with a 
1.6 Tbit-s“! capacity have been proposed in two con- 
figurations, namely 160 channels at 10 Gbit-s~! with 
a 25-GHz spacing and 40 channels at 40 Gbit-s7 
with a 100-GHz spacing [38]. Demonstrations have 
also been performed at 20 Gbit-s“! per channel, but 
no commercial system has yet been developed at 
this bit rate. 

In spite of these great advances in experi- 
mental communication links, most of the links 
deployed in the world operate at much lower bit 
rates. For example, in France most links operate 
at 2.5 Gbit-s"!. The network is being upgraded to 
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Figure 1.4 State of the art of WDM technology (see text for details). 
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10 Gbit-s“!, but it will take several years before the 
conversion is complete [47]. Operational systems 
in the United States are further along; most of the 
deployed links run at 10 Gbit-s. Very few com- 
mercial systems operate at 40 Gbit-s"'. Again the 
main difficulty is that these systems require more 
precise correction for chromatic dispersion and 
dispersion slope. It means using accurately tailored 
and stable fiber Bragg gratings so that dispersion 
does not take over. The system is then less lenient 
on imperfections in dispersion-compensation cir- 
cuits, and more difficult to develop and manage. 

It is significant that all nodes in deployed ter- 
restrial networks are opaque, i.e., the optical signals 
are detected, turned into electrical signals, ampli- 
fied, switched (routed) electronically into the right 
direction, and turned back into a light signal with 
a local laser oscillator. Such optoelectronic nodes 
constitute a large fraction of the cost of the network. 
The reason why this function is not performed in a 
fully optical manner is the high cost, and to some 
extent loss, of optical components, in particular 
optical switches. Only a few companies have devel- 
oped and are planning on deploying commercial 
transparent networks using all-optical nodes. The 
only market at the moment is secure military lines 
in the United States. Future WDM systems with 
extended optical reach (ultralong haul [ULH] and 
very long haul [VLH], see Section 1.6.3) are an effi- 
cient way to reduce the number of optoelectronic 
regeneration sites and thus reduce the network cost. 


1.6.2 Forward error-correcting codes 


As explained earlier, the compatibility of net- 
work topology with future system generations is a 
very strong requirement for terrestrial networks. 
For example, upgrading an existing link from 
2.5 to 10 Gbit-s! requires an OSNR increase of 
6 dB, which in turn should require a correspond- 
ing increase in power at the receiver. Instead of 
increasing the power, which again is not always 
possible for reasons covered earlier, a solution is 
the use of a forward error-correcting code [13], an 
approach widely used in satellite and radio systems. 
Redundancy is introduced in the input data (prior 
to transmission) and used by an electronic decoder 
to detect and eventually correct some of the errors 
affecting the data sequence through the detection 
process. A coding-decoding scheme is charac- 
terized by the proportion of errors it can detect 


and correct, which is related to the redundancy. 
An error-correcting code therefore allows the 
receiver to accept a smaller OSNR, which results 
in a degraded BER that is corrected by the decoder. 
The price to pay is an increase in transmitted data 
resulting from the required redundancy. 

A very simple example is the repeat code: binary 
bits of information are repeated an odd number of 
times, and the decoder corrects for errors by com- 
paring the like bits of replica and retaining the bits 
that occur most often (a process known as majority 
decoding). This is not a very efficient code, because 
it requires a large amount of redundancy, but 
other, far more powerful codes exist, such as the 
well-known Reed-Solomon and BCH codes, that 
can reduce the BER substantially at the cost of sur- 
prisingly low redundancy [13]. Most of the WDM 
10G systems today utilize a Reed-Solomon code, 
while other more powerful schemes have been 
extensively studied for the next generations of sys- 
tems [2,27]. To illustrate the improvement brought 
by forward-error correction, a BER of 10-° at the 
input of a Reed-Solomon decoder results into a 
BER of roughly 107} at its output. Assuming this 
last value is required by the user, it means that the 
transmission systems is only required to achieve a 
BER of 10-> and can then operate at a lower OSNR. 
This approach allows in particular to gain the 6-dB 
difference in OSNR between WDM 2.5G and 10G 
systems and thus to operate a 10 Gbit-s"! system 
with the same OSNR as a 2.5 Gbit-s"! system, at the 
cost of a modest redundancy (or data rate increase) 
of ~7%. Forward error-correcting codes can also 
clearly be used in 2.5 Gbit-s“! systems to increase 
the optical reach or to operate with higher span 
losses. Forward error-correcting codes are likely to 
play a major role in the higher speed communica- 
tion systems of the future. 


1.6.3 Ultralong-haul technology: 
New problems arising 


One key advantage of the WDM technology is that 
the high cost of amplifiers can be shared between 
all transmitted channels. Any reduction in the 
number of optoelectronic regeneration sites, or in 
other words any increase in the optical reach, is 
then very attractive. This is one major reason for 
the interest in ULH and VLH systems, which can 
operate today at 10 Gbit-s! per channel over respec- 
tive distances of ~1500 and ~3000km. Raman 
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amplifiers are a key element in this technology: 
they allow to improve the OSNR for a given dis- 
tance or equivalently to increase the optical reach 
for a given OSNR. The compatibility of ULH and 
VLH systems with existing infrastructures is also 
a difficult challenge because the OSNR decreases 
as the number of spans increases, and thus the 
accumulation of propagation impairments is much 
more critical. This limitation is of course mitigated 
in practice with error-correcting codes, as well as 
Raman amplifiers. 


1.6.4 Raman amplification 


Raman amplification relies on SRS, a nonlinear 
optical process that causes power transfer between 
an optical pump and a signal (see “Stimulated 
Raman Scattering” section). One of the greatest 
strengths of Raman amplification is that it can 
supply gain at any wavelength provided a suitable 
pump source is available. Since the Raman shift 
of silica is typically 13 THz, to obtain a gain peak 
at 1530 nm the pump wavelength must be ~1430 
nm. With such a pump, gain will be available from 
about 1490-1546 nm. The Raman gain cross sec- 
tion of silica is unfortunately relatively small, so 
the power requirement is much higher than for an 
EDFA, but Raman fiber amplifiers present several 
important benefits that somewhat mitigate this 
disadvantages, including the flexibility of pump 
wavelength selection, the availability of gain any- 
where where pump is available, and the fact that 
the gain medium is the transmission fiber itself. 
Raman fiber amplifiers can be configured in a 
number of ways, each with its own benefits and 
applications. Forward-pumped Raman amplifiers, 
in which the pump and the signals to be ampli- 
fied travel in the same direction, induce cross-talk 
between channels, which is undesirable in WDM 
systems. This is the reason why the backward- 
pumped Raman fiber amplifier is often preferred. 


Signal 


Pump light == 


Line fiber 


Multiplexer 


This configuration is illustrated in Figure 1.5. 
An obvious advantage of the Raman amplifier 
is that it can be easily implemented in any exist- 
ing fiber link by simply adding a pump source at 
the proper wavelength. In particular, it does not 
require the insertion of a doped fiber, which keeps 
the cost down. Also, in the event of pump failure 
the fiber amplifier is still transparent, as opposed 
to an EDFA where an unpumped Er-doped fiber 
is essentially opaque at the signal wavelengths. We 
refer the reader to Chapter A1.6 for a more detailed 
description of Raman amplifiers. 

To illustrate the system benefits of in-line Raman 
amplifiers, Figure 1.6 shows the calculated evolu- 
tion of signal power with distance from the emit- 
ter with and without Raman amplification. This 
figure simulates a link with a fiber attenuation of 
0.22 dB-km“' a total length of 100km, and a Raman 
gain of 10dB around 1.53pm. In the absence of 
amplification, the signal power in dBm decreases 
linearly with distance. When pump is injected at 
the far end of the fiber, SRS adds gain at that end of 
the fiber. The gain decreases away from the pump, 
as a result of pump photons being either scattered 
by the fiber or converted into signal photons. There 
is practically no more incremental Raman gain 
after the pump has traveled about 60km, because 
by then most of it has been consumed. The system 
advantage of Raman amplification is to increase the 
signal power at the receiver and thus to improve the 
SNR of the detected signal. As with any amplifier, 
the greatest benefit to the SNR occurs when the sig- 
nal is amplified before its power becomes too low. 
To understand this basic principle, consider two 
configurations. Configuration (a) is a link of length 
L with an amplifier of gain G placed at the end, and 
configuration (b) is the same link with the same 
amplifier of gain G placed a distance d before the 
receiver. In both configurations, the signal power at 
the receiver is the same, but in (b) the noise power 
is reduced by a factor equal to the loss A of the fiber 
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Figure 1.5 Schematic of a backward-pumped Raman fiber amplifier. 
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Figure 1.6 Typical variation of signal power versus distance with and without a fiber Raman amplifier. 


length d. The in-line amplifier of configuration (b) 
is therefore equivalent to an amplifier located at the 
receiver with a NF reduced by A. 

A backward-pumped Raman amplifier can be 
viewed as an in-line amplifier located before the 
receiver. The gain that it supplies allows (1) reduc- 
ing the launched signal power and thus reducing 
nonlinear effects for a given topology (span loss 
and number of spans); (2) increasing the span loss 
for a given number of spans while maintaining 
the same OSNR; and (3) increasing the number of 
spans for a given span loss while maintaining again 
the same OSNR. Raman amplification is therefore 
a key technology to increase the optical reach and 
upgrade existing networks to higher bit rates. 


1.6.5 Diversification of fibers 
and international 
telecommunications union 
fiber standards 


We have seen how propagation phenomena become 
more and more critical as the bit rate increases. 
These phenomena depend on several system 
parameters, such as the channel power and chan- 
nel spacing, as well as fiber parameters, especially 
scattering loss, chromatic dispersion, and effective 
mode area. It was therefore important, early on, 
to develop standard fibers with set ranges for all 
of their critical parameters. This task was accom- 
plished by the International Telecommunications 
Union (ITU), which defined a number of other 
important standards and terminologies as well, 
such as the so-called ITU grid, i-e., the precise 
values of the discrete wavelengths used in opti- 
cal communication systems around the world. 


One of the most important single-mode standard 
fibers is the G.652 fiber (also known as SSMEF), 
designed for early single-channel transmission in 
the 1550-nm band before the emergence of WDM 
and EDFAs. 

Because the high dispersion of the G.652 fiber 
(17 psnm km!) could be viewed as a drawback 
for single-channel transmission, the standard DSF, 
known as G.653, was designed to achieve zero dis- 
persion at 1550 nm and thus drastically reduce 
signal distortion. As we saw earlier, a DSF is not 
well suited for WDM operation, because channels 
located near the zero-dispersion wavelength suffer 
severe nonlinear effects, especially FWM. G.653 
fiber can be used in WDM systems but at the price 
of significant constraints, such as increased and 
eventually irregular channel spacing. This results 
in a poorer spectral efficiency and thus a higher 
cost, which is the reason why it is generally not 
used in this application. 

A third family of fibers, the NZDSF, also known 
as G.655, became available somewhat later. Their 
main feature is a chromatic dispersion lower than 
that of the G.652 fiber. The initial idea was to choose 
a dispersion value high enough to keep deleteri- 
ous nonlinear effects at a low level while requir- 
ing less dispersion compensation. Identifying the 
right trade-off between dispersion and dispersion 
compensation was not an easy task. The chromatic 
dispersion of commercially available G.655 fibers 
has increased progressively from 3 psnm-km" in 
early fibers to 8 psnm™-km_ for recent fibers. The 
dispersion of the NZDSF family of fibers therefore 
covers a wide range. Although G.652 is certainly 
the most widely used fiber in backbone networks, 
a considerable amount of G.655 fibers has been 
deployed, especially by new operators, in the 
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second half of the 1990s. At a bit rate of 10 Gbit-s7 
per channel, the advantage over G.652 fibers is not 
obvious; excellent performance has been reached 
with G.652 fibers because their higher dispersion 
allows very efficient protection against interchan- 
nel effects, and thus a closer channel spacing. 
The question of fiber choice is more critical and 
certainly still an open issue at higher bit rates 
(40 Gbit-s! and above), although WDM 40G trans- 
mission has been successfully demonstrated over 
G.652 fibers in many laboratories. 


1.6.6 Toward the future: WDM 40G 
systems and beyond 


There is a definite economic advantage in operat- 
ing transmission systems at higher bit rates, in part 
because integration reduces cost, at least when a 
certain level of production is reached and technol- 
ogy is stabilized. For instance, a 10 Gbit-s! emitter 
is less expensive than four 2.5 Gbit-s-! emitters, yet 
both provide the same capacity. The same is true 
for receivers. A drawback of moving toward higher 
bit rates is that the narrower impulses required 
are more sensitive to propagation phenomena, 
and also that the implementation of electronic 
circuits is more difficult. Another consideration is 
the channel spacing. After its amplitude has been 
modulated, the optical signal has a much wider 
spectrum than the original signal produced by the 
emitter. The line width of the signal from a typi- 
cal emitter is of the order of a few MHz, whereas 
after modulation this line width becomes com- 
parable to the modulation frequency, i.e. many 
GHz or tens of GHz. Maintaining a tolerable level 
of cross-talk then requires a higher channel spac- 
ing: for instance, a bit rate of 40 Gbit-s“! is certainly 
not compatible with a 25-GHz channel spac- 
ing. However, with a 100-GHz channel spacing a 
40 Gbit-s"! bit rate becomes possible. A sample of 
recent research in 40 Gbit-s! systems is given in 
“Selected Recent Results” section. 

In response to the growing demand for higher 
speed communications, extensive research efforts 
have been expanded in industrial laboratories 
toward the development of WDM 40G systems, 
which are considered to be the next generation for 
optical data transmission. The main difficulties, as 
stressed earlier, are that this type of system has a 
much tighter tolerance to most design parameters, 


including chromatic dispersion, FWM, and PMD, 
and more accurate forms of compensation will be 
required. In addition, advanced research is already 
being conducted on even higher bit rates per chan- 
nel, 80 and even 160 Gbit-s"! [53,61]. The objective 
of this research is primarily knowledge acquisition. 
At this point it is difficult to predict whether these 
systems will ever be viable and in what time frame, 
and whether they will be cost-effective compared 
to existing generations. 


1.6.6.1 INCREASING THE NUMBER OF 
CHANNELS BY INCREASING THE 
AMPLIFICATION BANDWIDTH 


For a given channel spacing, or when the mini- 
mum channel spacing imposed by the bit rate is 
reached, it is still possible to increase the capacity 
by increasing the number of channels. The channels 
used in practice fall in one of three bands, which 
are defined by the amplification bands of EDFAs, 
as opposed to other requirements. The C-band 
(1530-1565 nm) is the region where the EDFA is 
most efficient, i-e., where it provides the most gain 
per unit pump power. This is the band that is used 
first, and in many installed networks it is the only 
band that is used. The L-band (1565-1610 nm) 
is used when the C-band is full and additional 
channels need to be added, even though L-band 
EDFAs are less efficient and provide a lower gain 
than C-band EDFAs. Generally, an EDFA cannot 
be optimized to provide gain efficiently in both 
bands, so an in-line amplification site for both 
C- and L-bands typically consists of two separate 
EDFAs, one operating in the C-band and the other 
one in the L-band. The two amplifiers are often 
placed in a parallel configuration, as illustrated in 
Figure 1.7. A demultiplexer separates the incom- 
ing WDM signals into C-band signals, which are 
sent to the C-band EDFA, and L-band signals, 
which are sent to the L-band EDFA. A multiplexer 
placed at the output of the two EDFAs recombines 
the amplified signals onto the same fiber. Serial 
configurations are also possible. These configura- 
tions are not specific to the L- and C-bands: they 
can be used to multiplex any bands, in principle 
in any number. It should be pointed out that the 
economical advantage of using both the C- and 
L-bands in the same system is not obvious: ampli- 
fication must be performed in two separate ampli- 
fiers, so that this technique allows only sharing 
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Figure 1.7 Diagram of a parallel arrangement of L-band and C-band EDFA. 


the communication fiber. The benefits of using the 
L-band are also mitigated by other considerations. 
First, the fiber loss is higher in the L-band, and 
the C-band/L-band multiplexers introduce losses 
as well, in both bands, which degrade the power 
budget and the OSNR. The trade-off between these 
extraneous losses and OSNR degradation on one 
end, and the cost of deploying additional fiber on 
the other, needs to be carefully examined. Second, 
L-band amplification is more costly than C-band 
amplification. The reason is that it requires a dif- 
ferent set of components than C-band amplifiers, 
and these components are more expensive because 
they do not benefit from the same economy of 
scale. As a result, very few systems in the world 
use the L-band, except in Japan where new tech- 
nologies tend to be deployed earlier than elsewhere 
and where existing fibers happen to have the right 
amount of GVD for WDM communication in the 
L-band. Use of the L-band is often a good solution 
for G.653 fibers: chromatic dispersion is higher in 
the L-band than in the C-band, and it provides 
some protection against deleterious nonlinear 
effects such as FWM. 

Very recently, EDFAs have also been designed 
to provide gain in the S-band (~1480-1530 nm) [4]. 
Although the gain and the gain efficiency in the 
S-band are even lower than in the L-band, they 
are both respectable, and the S-band is a pos- 
sible future spectral window to extend the usable 
bandwidth of silica fibers. The S-band is in fact 
already used in metropolitan communication sys- 
tems without amplification. However, amplifica- 
tion in the S-band is costly, even more than in the 
L-band. No deployed system uses amplification in 
the S-band, except perhaps isolated experimen- 
tal systems. Because of the higher costs of L- and 
S-bands, communication companies generally 


find it economically preferable to use up the capac- 
ity of the C-band in a given link before starting to 
utilize other bands. 


1.6.6.2 INCREASING THE NUMBER OF 
CHANNELS WITH A CLOSER 
CHANNEL SPACING 


The above considerations show that filling the 
C-band is one of the most efficient solutions for 
increasing the bandwidth. The channel spacing 
has been normalized by ITU to 100 GHz, but sub- 
grids with a spacing of 50GHz and even 25GHz 
can be used. NX 10 Gbit-s! systems available today 
have commonly 100- and 50-GHz channeling, and 
25 GHz has been proposed [38]. As is well known 
in the radio domain, a convenient parameter to 
describe the fraction of the available bandwidth 
that is actually utilized is the spectral efficiency. It 
is defined as the total transmitted bit rate (number 
of channels times channel bit rate) divided by the 
occupied bandwidth. For example, a 16 x 10 Gbit-s*! 
link with a channel spacing of 100 GHz has a total 
capacity of 160 Gbit-s"! and occupies a bandwidth 
of 16x 100 GHz, so the spectral efficiency is 0.1 bit 
s-Hz"1. The highest value obtained so far in com- 
mercially available optical systems based on OOK 
modulation is around 0.4 bit s--Hz7 for 10 Gbit-s7 
with a 25-GHz channel spacing in the C-band, and 
0.8 bit s-Hz™ has been demonstrated in the labo- 
ratory. These values remain modest compared to 
the spectral efficiencies of 4 or 5 bit s-Hz™ cur- 
rently achieved in radio systems. Such high val- 
ues are made possible by multilevel modulation 
schemes. Similar schemes also exist in optics [6], 
but they have not yet been implemented in com- 
mercial systems. 

Figure 1.8 depicts the evolution of the spectral 
efficiency of commercially available WDM systems 
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Figure 1.8 Historic evolution of the spectral efficiency (in bit s~'-Hz™) of optical systems. 


since 1994. Higher values have been achieved in the 
laboratory up to 1.28 bit-s--Hz™ [9]. Reducing the 
channel spacing has required several technological 
improvements over the years, including increas- 
ing the stability of demultiplexing filters and of the 
channel wavelengths (to this end, lasers are now 
equipped with wavelength lockers). The ultimate 
achievable channel spacing depends strongly on 
the fiber. As mentioned earlier, G.652 fiber is a very 
good candidate because its high chromatic disper- 
sion reduces the threshold of FWM. 


1.6.6.3 MODULATION SCHEMES 


Optical systems use currently an OOK intensity 
modulation format. The advantage of this modula- 
tion scheme is that it is rather simple, and that it 
does not require excessive signal power. However, 
its main drawback is that it does not allow to 
fully exploit the large available bandwidth of the 
fiber. As mentioned in the previous section, more 
sophisticated schemes can be used to improve the 
spectral efficiency, in particular multilevel modu- 
lation schemes. However, it is also well known that 
the modulation format has a strong influence on 
the penalties imposed by propagation phenom- 
ena. For example, to achieve the same BER, mul- 
tilevel modulation schemes require a higher power 
than a binary modulation scheme, which means a 
higher sensitivity to propagation effects. Alternate 
schemes have been extensively investigated for 
long-haul systems, especially for undersea light 
wave transmission. The duobinary code has been 


proposed by different laboratories as a good can- 
didate, providing a better resistance to chromatic 
dispersion [60]; phase-shaped binary transmission 
is another technique, which is a modified version 
of duobinary code [14]. Vestigial sideband (VSB), 
a very well-known technique in radio communi- 
cation systems, could also allow to reduce chan- 
nel spacing and thus improve spectral efficiency 
[8,9]. More recently, DPSK has also been proposed 
[24]. Although OOK remains today practically the 
only one used, at least for terrestrial systems, it 
is clear that improving the modulation format to 
better utilize the capacity of existing systems is a 
key issue in research. This effort is likely to lead to 
major improvements in the future optical trans- 
mission systems. 


1.6.6.4 SELECTED RECENT RESULTS 


Recently, publications show that the field of commu- 
nication systems is very prolific, and novel systems 
with very impressive performance continue to be 
demonstrated. Table 1.1 summarizes the characteris- 
tics of some of the most remarkable systems reported. 
in recent years. The objective is not to be exhaus- 
tive—dozens of examples could be compiled—but to 
illustrate some of the concepts mentioned earlier and 
to give the reader a fair notion of the state of the art in 
research and development at the time of this writing. 
It has to be kept in mind that comparison of capacity 
is not necessarily meaningful because transmission 
distance, amplification span, and spectral efficiency 
are not identical. 


Table 1.1 Selected examples of recent experimental high-capacity transmission lines 


Total Channel Spectral Optical 
Bit rate Number of capacity spacing Modulation efficiency reach Amplification 
(Gbit-s-") channels (Tbit s-*) (GHz) scheme (bit s-1-Hz-") (km) span (km) Bands References 
Terrestrial Systems 
42.7 125 5 VSB 1200 100 C+Lb [7] 
40 80 3.2 Duobinary 300, 100 C+Raman [10] 
G.655 
40 273 10.9 116 58 S+C+L [21] 
10.9 VSB+Pol. 1.28 100 NA Raman [8] 
div. mult 
Transoceanic Systems 
365 10 3.65 NRZ+error 6850 ~50 [57] 
correcting 
code 
Repeaterless Systems 
160 10 1.6 25 380 NA Raman+remote C-band [39] 
EDFAS+C+L+Raman 
104 40 4.16 125 0.32 135 NA [11] 
25 40 1 306 NA [55] 
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2.1 INTRODUCTION 


Telecommunication networks in all their vari- 
ous shapes are indispensable to bring information 
quickly anywhere and anytime, which is a vital 
need of our modern global society. Since the inven- 
tion of the electrical telegraph by Samuel Morse in 
1837, the variety of telecommunication services 
has grown at an increasing pace, as illustrated in 
Figure 2.1. In addition, the services are becom- 
ing ever more individualized, and along with the 
penetration of video-based services (“a picture 
says more than a thousand words”) the request 
for information transport capacity has exploded 
and is continuing to do so. Since the early 1990s, 
the introduction of the worldwide Internet has 
drastically promoted this information transport 
explosion. The number of Internet hosts is still 
increasing exponentially; from January 1992 to 
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January 1997 to January 2002, it grew from 727 
thousand to 19.5 million to 147 million worldwide. 
This is causing data traffic to take an ever-larger 
share of the telecommunication network capacity; 
since a few years, it has surpassed the volume of the 
traditional voice traffic (but not yet its revenues). 
Wireless mobile telecommunication is attracting 
ever more users, and enables a fast roll-out of ser- 
vices to the end users without the need to install 
extensive first-mile customer access networks. The 
telecommunication market liberalization has pro- 
vided ample opportunities to the entry of more 
operators and service providers, and the resulting 
national and international competition is pressing 
for very efficient high-capacity telecommunication 
networks. 

As a result, the volume of telecommunication 
traffic is ramping up at a compound annual growth 
rate of roughly 60%, which means an increase with 
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Figure 2.1 Evolution of telecommunication services. (From Consortium British Telconsult/Consultel/ 


Detecon/Sofrecom et al.) 


a factor of 10in. no more than 5years. This traf- 
fic is in vast majority carried along fixed-wired 
networks, due to their high reliability, security, 
and immunity for external disturbances. Wireless 
networks are coming up in customer access envi- 
ronments; but due to increasing microwave car- 
rier frequencies and user densities, the wireless 
cells are shrinking and thus extensive fixed access 
network lines are still indispensable as the vessels 
to feed the wireless antenna stations. Traditional 
coaxial and twisted-pair copper cables have been 
the transport media of choice since the introduc- 
tion of telecommunication networks. However, 
the advent of optical fiber with its extremely low 
losses and extremely large bandwidth as pioneered 
by Kao and Hockam in 1966, and the commercial 
introduction of optical fiber communication sys- 
tems in the early 1980s has caused that single-mode 
optical fiber has become the transport medium 
uniquely used in long-distance fixed-wired core 


transport networks, and that it is also conquering 
at increasing pace the area of metropolitan and 
access networks. 


2.2 TELECOMMUNICATION 
NETWORKS HIERARCHY 


Telecommunication networks are carrying traf- 
fic at various aggregation levels, as illustrated in 
Figure 2.2. 

At the highest level, long-haul core networks are 
transporting huge data capacities in the tens of 
terabits/second over large distances, such as over 
transnational and transoceanic links up to 9000km 
(transpacific). These global and wide area networks 
are transporting circuit-switched data following 
the SDH (or SONET) standards, where each fiber 
usually carries a number of wavelengths at bitrates 
of up to 10-40 Gbit s"! each. They have to meet 
extremely high levels of reliability and availability, 
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Figure 2.2 The hierarchy of telecommunication networks. 


considering the huge volume of customers depen- 
dent on them. Taking into account the increasing 
dynamics in the traffic matrix describing the data 
flows between the various network nodes, packet 
switching techniques are being introduced, which 
can offer a more efficient utilization of the net- 
work’s resources than circuit switching. 

Metropolitan area networks (MANs) are cover- 
ing large urban areas with a reach of up to 100km 
and capacities of tens of gigabit/second, serving in 
particular business parks and residential customer 
access regions. High availability for large-volume 
fast file transfer is a major need for the business 
customers. Also these networks should be easily 
scalable for adding more network nodes, and flex- 
ible to accommodate new business needs. Storage 
area networks (SANs) are specifically employed for 
regularly moving large volumes of data between 
geographically separated sites, in order to safe- 
guard vital business information. 

Access networks are providing a wide variety of 
services to the end customers, and consist of fiber 
feeder networks followed by various first-mile net- 
works. These networks are mostly optimized for a 


particular set of services, and exploited by different 
operators. Coaxial cable network operators offer 
television and radio broadcast services, and since 
recently also data modem services and telephony, 
multiplexed in different frequency bands. The 
public switched telephone network (PSTN) uses 
twisted pair copper cables and is carrying voice 
telephony and data services, time-multiplexed 
according to the SDH/SONET or ATM standard; 
it is exploited by the incumbent telecom operators 
as well as new entrants. Mobile network opera- 
tors are mainly providing wireless voice telephony, 
according to among others the GSM standard; also 
wireless data services using GPRS and UMTS are 
coming up. The statistics of traffic in access net- 
works shows much higher dynamics than in metro- 
politan and core networks, due to the significantly 
lower traffic aggregation levels. Therefore, applying 
packet switching instead of circuit switching can 
improve remarkably the network utilization effi- 
ciency. Access networks have to be laid out very 
cost-effectively, as the factor with which network 
equipment is shared among customers is much 
lower than in metropolitan and in core networks. 
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Figure 2.3 Network topologies. (a) Mesh, 
(b) Ring, (c) Bus, and (d) Tree -and- branch. 


2.2.1 Network topologies 


A number of general network topologies as shown 
in Figure 2.3 can be discerned for implementa- 
tion of the various hierarchical network layers. 
Each of them has a specific set of characteristics, 
which makes it suited to match the requirements 
of a certain layer. Mesh networks, as exemplified in 
Figure 2.3a, provide a number of options to route 
traffic between two network nodes. This routing 
redundancy yields a large availability of the net- 
work services, which is a highly valued merit in 
long-distance core and metro-core networks. The 
entailed extra costs are of less concern due to the 
large resource-sharing factor among the huge 
customer base served. Ring networks, in particu- 
lar when composed of both an inner and an outer 
ringlet as shown in Figure 2.3b, provide clockwise 
and counterclockwise traffic routing options; thus 
also network protection is established in order to 
yield a good network availability (albeit at a lower 
level than in mesh networks, but also at lower costs 
as less network resources are needed). The com- 
bination of good availability and moderate costs 
of resources makes ring topologies well suited to 
implement metropolitan networks. Bus networks, 
as shown in Figure 2.3c, use a common single lin- 
ear medium along which signal power is tapped 
off to the various nodes (with power losses accu- 
mulating at each subsequent tap). This topology 
is quite cost-efficient, as a minimum of network 
resources is needed. The nodes may exchange 
information as peers in the network, which makes 
this topology suited for linking data processing 
equipment. However, no routing redundancy is 
provided, and therefore there is no guarantee for 
good network availability. In tree-and-branch net- 
works, exemplified in Figure 2.3d, a relatively long 
single feeder line is running from a headend node 


to a power splitting point, from where the signal 
power is distributed via short lines to a number of 
end nodes. This topology is most suited for broad- 
casting information from a single headend node 
to many customer end nodes. No routing redun- 
dancy is provided, so again network availability is 
limited. Also the topology is very cost-efficient, as 
the costs of the feeder line and the headend equip- 
ment is shared by all the end nodes. In addition, 
the signal power distribution is more efficient than 
in the bus network, as the power splitting loss (in 
decibels) to an end node increases only with the 
logarithm of the number of end nodes whereas 
it increases linearly with this number in the bus 
network. Every end node receives the same power 
level, which relaxes the dynamic range over which 
the end node equipment has to operate. The tree- 
and-branch topology is therefore well suited and 
popular for access networks. With a fully passive 
optical power splitter in the branching point, the 
topology is also widely known as the passive opti- 
cal network (PON). 

In the next sections of this chapter, architec- 
tural aspects and key functionalities needed at 
the subsequent hierarchical network layers (core 
networks, metropolitan networks, and access net- 
works) will be discussed in more detail. 


2.3 CORE NETWORKS 


The main task of core networks is to transport huge 
amounts of telecommunication traffic over large 
distances in a highly reliable way. The network 
should not break down by failures in one or ina few 
links. Therefore, provisions have to be included for 
alternative routing of traffic, which are adequately 
offered by the mesh network topology with cross- 
connect functions in the nodes. The links between 
the nodes are usually very long (>100km), and 
intermediate signal amplification and compensa- 
tion of fiber dispersion effects is needed. 


2.3.1 Optical signal multiplexing 
techniques 


Using electrical time division multiplexing (ETDM), 
commercial systems support bitrates up to 10-40 
Gbit s“! and by direct laser diode modulation (or a 
laser diode followed by an external modulator) these 
bitrates can be carried through a fiber link via single 
wavelength channel. The transport capacity of the 
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Figure 2.4 Wavelength division multiplexing. 


fiber link can basically be increased further in two 
ways: by increasing the number of wavelength chan- 
nels, and by increasing the bitrate per channel. 
Multiple wavelength channels can be carried by 
a single fiber by combining them at the transmit- 
ting end by means of a wavelength multiplexing 
device, and separating them again at the receiv- 
ing end by a wavelength demultiplexer. This wave- 
length division multiplexing (WDM) approach is 
shown in Figure 2.4. Standard single-mode fiber 
offers low dispersion in the wavelength window of 
1285-1330 nm, which amounts to about 8 THz of 
bandwidth, and low attenuation in the 1500-1580 
nm window, corresponding to about 10 THz. The 
popular 1530-1560 nm window (the C-band, the 
operation range of erbium-doped fiber optical 
amplifiers) represents about 3.8 THz of bandwidth. 
Wavelength channel spacings of 100 GHz according 
to ITU-T G.692 (or even down to 25 GHz) are being 
deployed, which enables hundreds of wavelength 
channels to be accommodated in a single fiber. 
Commercial systems are available which carry 160 


ABCD 
4 x 40 Gbit s-! ETDM 


Receivers 


Data out 
1 


Data out 
2 


Data out 
3 


wavelength channels at 10 Gbit s“! each, amounting 
to a total 1.6 Tbit s"! capacity. The record obtained 
in research stands at 273 wavelengths at 40 Gbit s“! 
each, amounting to 10.92 Tbit s“! [1]. 

Increasing the bitrate per wavelength channel 
beyond the limits of ETDM can be achieved with 
the so-called optical time division multiplexing 
(OTDM) technique. As illustrated in Figure 2.5, at 
the transmitter side a single laser diode generates 
a sequence of equidistant narrow optical pulses. 
After splitting and distribution to four fast optical 
gates, the pulse train is on/off switched in each gate 
by an electrical time-multiplexed data stream. The 
modulated pulse trains are delayed with respect 
to each other, and subsequently interleaved (like a 
“zipper”). The individual pulse trains may all have 
the same polarization (SP, single polarization), or 
alternating polarization (AP). The optical pulses 
need to be sufficiently narrow in order to avoid 
crosstalk. With the OTDM-AP scheme, somewhat 
broader optical pulses are allowed than with the 
OTDM-SP scheme. The resulting output signal is 


OTDM-SP 
DCBA 


—t 
m=> 160 Gbit s-! RZ 


OTDM-AP 
DB 


Figure 2.5 Optical time division multiplexing transmitter. 
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Figure 2.6 Optical time division demultiplexing receiver. 


a modulated optical pulse data stream at a speed 
that is the sum of the speeds of the electrical input 
data streams. At the receiver side, first the clock 
signal needs to be recovered from the high-speed 
data stream. Using this clock signal and appropri- 
ate time delays, fast optical gates followed by opti- 
cal receivers with PIN photodiodes can demultiplex 
the original constituting pulse trains, as shown in 
Figure 2.6. Using these OTDM techniques, the next 
bitrate hierarchy of 160 Gbit s“! can be realized with 
40 GHz electronics. In research, the record has been 
set at 1.28 Tbit s“! by polarization-multiplexing two 
OTDM-SP 640 Gbit s“! streams [2]. 

The chart shown in Figure 2.7 indicates how 
by increasing the data rate per wavelength chan- 
nel by means of advances in electrical and optical 
TDM at one hand, and by increasing the number of 
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wavelength channels at the other hand, the trans- 
port capacity of a single fiber has been enormously 
increased since the introduction of optical fiber 
communication systems in the early 1980s. The 
main leap forward was made with the introduction 
of wavelength multiplexing in the early 1990s. And 
opportunities for further capacity growth are still 
being created by opening new wavelength bands 
such as the S-band from 1450 to 1530 nm, and the 
L-band from 1560 to 1620 nm, supported by alterna- 
tive optical amplifying processes due to fiber nonlin- 
earities such as Raman gain. Optical gain in the 1300 
nm window can be provided by fiber amplifiers using 
rare earth materials such as praseodymium and neo- 
dymium, and by semiconductor optical amplifiers. 
The present system capacity record stands at 10.92 
Tbit s! deploying 273 wavelength channels at 40 
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Figure 2.7 The evolution of transport capacity of a single fiber link. 
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Gbit s-! each, as mentioned earlier. Applications for 
multi-terabits/second systems are found not only in 
transcontinental and transoceanic systems, but also 
for massive data processing systems such as huge 
synthesized antenna array systems for astronomical 
observations (e.g., the RETINA system [3)). 

In addition to wavelength multiplexing, opti- 
cal amplification and dispersion compensation 
are key techniques to enable high volume data 
transport over long fiber links. A single optical 
amplifier can handle many wavelength channels 
simultaneously; otherwise, an equivalent number 
of opto-electronic regenerators plus a wavelength 
multiplexer and demultiplexer would be needed, 
which is clearly more costly, requires more mainte- 
nance and powering, and requires a sizeable adap- 
tation effort when the system needs to be upgraded 
with more wavelength channels. 


2.3.2 Traffic routing 


The deployment of multiple wavelength channels 
is not only beneficial for increasing the transport 
capacity of a single fiber link, but also provides more 
flexibility to route traffic streams in the network. 
Wavelength channels can constitute independent 
optical paths through the network, and each path 
may be laid out individually to establish an optimum 
connection between certain end nodes. As shown 
in Figure 2.8a, within each node the wavelength 
channels are routed by means of optical crosscon- 
nects which guide the incoming signals depend- 
ing on their wavelength and the entrance port to 
a specific output port. The routing table is usually 
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set by the network management system, and can be 
altered when needed by changing traffic conditions. 
When two optical paths do not touch each other, 
they may be established with the same wavelength. 
This wavelength re-use reduces the overall number 
of wavelengths needed in the network. In Figure 
2.8a, for instance, wavelength Ai is used twice, but 
in different paths. As illustrated in Figure 2.8b, fur- 
ther reduction of the number of wavelengths needed 
can be obtained by using wavelength converters in 
the nodes; thus an optical path may be constituted 
by a sequence of different wavelengths in a series of 
links. By controlling the crossconnects in the nodes, 
the network management system can optimize the 
traffic flow routings to obtain a good balancing 
of the network load among the links, and thus to 
reduce the probability of congestion and to increase 
the network’s efficiency. Also link failures can be 
circumvented by routing traffic through alternative 
links, which improves the reliability of the network. 

The basic layout of an optical crossconnecting 
node is shown in Figure 2.9. The N wavelength 
channels carried by each of the M input fibers are 
firstly separated by a wavelength demultiplexer, 
and subsequently a large optical matrix switch with 
(M x N) input ports and (M x N) output ports is 
needed which can route any wavelength channel 
from any input fiber to any of the M output fibers. 
For larger numbers M and N, the internal architec- 
ture of the matrix switch becomes quite comprehen- 
sive. Composed of individual 2 x 2 optical switches, 
a Benes switch architecture as shown in Figure 2.10 
would require (MN/2)(27log MN-1) switches; and 
2?7log MN-—1 switches have to be passed from any 
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Figure 2.8 Establishing wavelength paths. (a) Without wavelength conversion and (b) with wavelength 


conversion. 
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Figure 2.9 Optical crossconnect. 
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Figure 2.10 Re-arrangeable nonblocking Benes switching matrix. 


input to any output causing losses which increase 
with the switch size. Three-dimensional free-space 
optical switches using micro-mechanical mirrors 
for beam steering between the array of input fibers 
and the array of output fibers exhibit losses which 
do not increase with the number of switch ports, 
and thus can outperform the Benes planar architec- 
ture [20, 21]. 

The amount of packet-based data traffic is grow- 
ing fast in all telecommunication network layers, 
a.o. due to the steeply rising use of Internet ser- 
vices. IP packets are usually carried over SDH/ 
SONET and/or ATM, which in their turn are car- 
ried in WDM channels; efficiency can be gained, 
however, by transporting the IP packets directly in 
the WDM channels, with some simple framing for 
basic synchronization functions. By using packet 
switching in the optical crossconnect nodes instead 
of circuit switching, the network capacity can be 
exploited much more dynamically in response to 
instantaneous traffic demands, and thus the net- 
work operation efficiency is improved. Following 


the GMPLS protocol, wavelength-switched paths 
can be set up in the network in a similar way as 
label-switched paths in the MPLS protocol. The 
packets can be marked by assigning a wavelength 
to them, which acts as a label. Based on these labels, 
a path is established for each packet through the 
network; see Figure 2.11. The per packet label swap- 
ping needed for efficient forwarding and routing is 
achieved by wavelength converters in the nodes, 
which may be realized with fast tunable lasers 
and an all-optical wavelength converter (such as a 
Mach-Zehnder interferometer with semiconductor 
optical amplifiers in its branches). Another way to 
attach label information to a packet is to incorpo- 
rate it in the packet data frame, or to modulate it on 
a subcarrier frequency outside the spectrum of the 
packet data. Even more comprehensive label infor- 
mation may be attached by frequency shift keying 
(FSK) modulation of the optical carrier (or dif- 
ferential phase shift keying, DPSK), orthogonally 
to the payload data that is intensity modulated on 
the carrier [4, 5]. Using wavelength converters and 
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Figure 2.11 Optical packet routing. 
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Figure 2.12 Fast optical crossconnect using wavelength conversion. 


a wavelength-selective passive router (such as an 
arrayed waveguide grating router, AWGR), a fast 
optical crossconnect can be realized; e.g., in an 
architecture as shown in Figure 2.12. Variable delay 
lines are included to avoid collisions of packets that 
are heading for the same output fiber at the same 
wavelength. Congestion can also be avoided by 
temporary buffering in the recirculating loops; the 
loops plus power splitter enable optical multicast- 
ing as well. 


2.4 METROPOLITAN AREA 
NETWORKS 


MANs have to bring a variety of services to large 
urban areas, typically at bitrates up to 10 Gbit s~’ 
per wavelength over distances between nodes of 
less than 100km. Major customers to be served 
are business parks, where customers mainly ask 
for fast large-volume file transfer, e.g., to intercon- 
nect their offices and for storage at safe locations 
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Figure 2.13 Multiwavelength MAN with fixed wavelength routing via hub. 


of vital business data (storage area network). As 
high reliability is required while network costs per 
customer should be kept limited, the ring network 
topology is well suited for providing good network 
protection at moderate costs. 

An example layout of a MAN ring network is 
shown in Figure 2.13. Typically, the ring is com- 
posed of two fibers, on which the traffic flows 
clockwise and counterclockwise, respectively. The 
fiber links between neighbouring nodes are less 
than 20km, and the ring circumference is usu- 
ally less than 100km; therefore no in-line optical 
amplifiers are needed. Each node uses a specific set 
of wavelength channels, typically carrying bitrates 
up to 2.5 Gbit s-! per channel. Each node is com- 
municating with the hub node; in the hub, opto- 
electric-optical translation to another wavelength 
set is done in order to establish communication 
between ring nodes. So virtually in the network 
there are point-to-point node-to-hub connections. 
The MAN typically contains up to 16 nodes, and 
up to 40 wavelength channels. Per node, one or 
more out of the set of wavelength channels can be 
dropped and/or added by means of optical add/ 
drop multiplexers (OADMs). Through the hub, 
also communication to other networks can be 
established. 


2.4.1 Network protection 


The two-fiber topology enables self-healing of the 
ring network in case of a cable break. As shown in 
Figure 2.14, a link failure may be circumvented by 
looping back the traffic at the nodes neighbouring 
the broken fiber cable. This procedure also allows 
a ring segment to be taken temporarily out of ser- 
vice without disrupting the traffic on the remain- 
der of the ring network, e.g., for maintenance or for 
inserting a new node. In the SONET standard, the 
2-fiber uni-directional path-switched ring (UPSR) 
is composed of an outer primary path ring, car- 
rying clockwise the normal working traffic; see 
Figure 2.15. The inner ring provides the counter- 
clockwise protection path. In the SDH standard, 
this concept is called sub-network connection 
protection (SNCP). Both rings are fed from trans- 
mitters at the nodes, and at the node receiver, the 
signal from the ring with the best quality is cho- 
sen. Bi-directional traffic between two nodes will 
span around the entire ring, and thus consumes 
resources on every link of the ring. The 4-fiber 
bi-directional line-switched ring (BLSR) in the 
SONET standard (called multiplex section shared 
protection ring, MS-SPRing, in the SDH standard) 
consists of an outer bi-directional 2-fiber primary 
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Figure 2.14 Self-healing by looping back at the OADMs. 


Figure 2.15 Protection in the 2-fiber SONET 
UPSR (SDH SNCP) network. 
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loop carrying the normal working traffic, and an 
inner bi-directional 2-fiber secondary loop for pro- 
tection. Bi-directional traffic between two nodes is 
only sent along part of the ring, and does not involve 
resources on other parts of the ring. As shown in 
Figure 2.16, the system is well protected against 
cable breaks or node failures. In IEEE 802.17, the 
resilient packet ring (RPR) is being discussed for 
standardization [6]. It consists of a dual counter- 
propagating ring, with up to 256 node stations and 
spans up to 6000km. The RPR is able to operate 
with any packet protocol (such as Ethernet), and 
uses frame-based transmission based on the stan- 
dard gigabit Ethernet frame of variable length, 
but also supporting jumbo frames up to 9 kbytes. 
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Figure 2.16 Protection in the 4-fiber SONET BLSR (SDH MS-SPRing) network. (a) Bypassing a fiber 
break or malfunctioning card and (b) reconfiguration in case of a cable break or node failure. 
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Figure 2.17 Protection in the resilient packet ring network. (a) Traffic flows along the outer ring before 
cable out, (b) wrapping of the data path after the cable cut and (c) traffic steering, yielding a shortest 


data path after discovery of the new topology. 


Figure 2.18 Multiwavelength MAN with dynamic wavelength routing between nodes. 


It is optimized for providing multiple IP service 
classes: reserved traffic class AO (guaranteed data 
rate, small bounded delay and jitter), high prior- 
ity class Al (committed rate, bounded delay and 
jitter), medium priority class B (committed rate, 
and excessive rate subject to fair access), and low 
priority class C (best effort, subject to fair access). 
The RPR management system is able to provide 
ring survivability within 50 ms. It selects the inner 
or outer ring to establish the shortest path. When 
a ring failure occurs, the traffic is first wrapped at 
the failure point, subsequently the damaged ring 
topology is discovered, and the source traffic is 
steered to establish the shortest path again taking 
the failure into account (see Figure 2.17). 

The nodes in the MAN may also route the 
wavelength channels dynamically between them- 
selves without relaying via a hub, as exemplified 
in Figure 2.18. By means of an appropriate algo- 
rithm, and by deploying tunable laser diodes and 


tunable OADMs at the nodes, the most appropri- 
ate wavelengths can be assigned to the communi- 
cation paths needed between nodes. When paths 
do not overlap, the same wavelength may be used 
(wavelength re-use), which reduces the number of 
wavelengths needed. 


2.4.2 Optical add/drop multiplexing 


Essential network elements in the ring-shaped 
MANSs are OADMs, which are able to extract data 
stream on a particular wavelength channel (or sev- 
eral channels) from the ring, and to insert new data 
streams on one or more wavelength channels into 
the ring. Signalling information controls whether a 
data stream will be dropped at a node or will pass 
through it. There are various ways to transfer this 
signalling information to the nodes, without inter- 
fering with the data streams. It may be modulated 
on a subcarrier frequency that is positioned above 
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the data spectrum, and each wavelength channel 
may carry a unique subcarrier frequency. Thus, 
it is not needed to wavelength-demultiplex the 
channels first in order to detect and demodulate 
the signalling information on the various subcar- 
riers. Another method to transfer the signalling 
information is to modulate it on a dedicated sepa- 
rate common wavelength channel, which is opto- 
electric-optically (O/E/O) converted at each node 
for inspection. And a third method is to put the 
signalling information in-band in a digital frame 
together with the data (e.g., the digital wrapper 
concept); then at each node all wavelength chan- 
nels need to be demultiplexed and O/E/O con- 
verted for inspection. 

The OADMs need to be wavelength-selective. 
When the pass-through wavelength channels 
experience some filtering, the passband of many 
OADMs put in cascade along the ring may nar- 
row significantly. Thus the possibility to extend 
the number of nodes in the network scalability 
may be reduced. This scalability issue is avoided by 
applying notch-type OADMs, which do not exhibit 
pass-through filtering. 

Another issue in the design of OADMs is cross- 
talk. Two types of crosstalk may be discerned: inco- 
herent crosstalk and coherent crosstalk, as shown in 
Figure 2.19. Incoherent crosstalk may occur because 
part of the wavelength channel(s) A; #1, to be passed 
through are dropped, and interfere with the inten- 
tionally dropped wavelength channel 4,. Coherent 
crosstalk occurs because part of the dropped wave- 
length channel A, leaks through, and beats with the 
added wavelength channel A’, which has nominally 
the same wavelength. Thus the coherent crosstalk 
cannot be removed by subsequent optical bandpass 
filtering, and it may accumulate when cascading 
OADM nodes. The coherent crosstalk imposes the 
most stringent requirements on the device crosstalk 
characteristics. Mathematical analysis taking the 
specific statistical properties of the beat signal into 
account show that per node the crosstalk attenuation 
needs to be better than 32 dB to yield a bit error rate 
(BER) better than 10-! [7]. For incoherent crosstalk 
in a four-channel OADM, the crosstalk attenuation 
needed to yield a BER < 10-” needs to be better than 
13 dB only. 

An example of an OADM that can drop and add 
a fixed wavelength channel is shown in Figure 2.20. 
A fiber Bragg grating (FBG) reflects only wave- 
length i,; the other wavelength channels are passed 


unaffected. This notch-type OADM does not put a 
limit to extending the number of nodes in the ring. 
The FBG is made by writing a grating with UV light 
into the fiber core. By putting thermal or mechani- 
cal stress on the FBG, by means ofa local heater or a 
piezo-electric stretcher, respectively, the device may 
be slightly tuned at low speed to other wavelength 
channels. Thus circuit-switched connections may 
be set up. Another more widely tunable OADM is 
shown in Figure 2.21, which deploys a fiber Fabry- 
Perot (FFP) filter. The passband of the FFP can be 
tuned to any of the input wavelength channels, and 
thus the selected channel is dropped whereas the 
other channels are reflected without filtering and via 
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Figure 2.19 Crosstalk in an Optical Add/Drop 
Multiplexer (OADM) 
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Figure 2.20 Fixed-channel OADM using fiber 
Bragg grating (FBG). (Note: the right-hand circu- 
lator may also be replaced by a power combiner.) 
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Figure 2.21 Tunable OADM using a fiber 
Fabry-Perot filter. 


46 Optical network architectures 


the optical circulator passed to the output port. The 
locally added channel will pass the FFP and join the 
other channels via the circulator. The residual reflec- 
tion (a few percent) at the FFP of the added chan- 
nel causes near-end crosstalk at the local receiver; 
this may be counteracted by echo cancelling, as the 
locally added signal is known. An FFP is usually 
tuned with piezo-electric means; tuning speed is 
therefore limited, and this OADM is suited for set- 
ting up circuit-switched connections. Its notch-type 
characteristic implies that this OADM architecture 
does not limit the extension of the number of nodes 
in the ring. 

When the wavelength channels are arranged 
in groups of closely spaced channels, adding and 
dropping of a specific group per node may be 
accomplished by a cascaded architecture of a fine- 
grain demultiplexer/multiplexer and a coarse grain 
OADM. Figure 2.22 shows an example, where a 
silica-based AWGR demultiplexer with narrow 
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Figure 2.22 Two-stage OADM for handling 
wavelength groups. 


channel spacing and with a free spectral range of 
500 GHz separates the 40 input wavelength chan- 
nels spaced at 100 GHz into groups of eight wave- 
lengths spaced at 500 GHz. Next, a compact OADM 
integrated in InP with a coarse channel spacing of 
500 GHz separates the eight wavelength channels 
within a specific group. 

For packet-switching applications, the OADM 
characteristics need to be fast tunable. Next to a 
wavelength demultiplexer stage and a wavelength 
multiplexer stage, the OADM architecture shown 
in Figure 2.23 is equipped with fast (lithium-nio- 
bate, or semiconductor-based) optical switches, 
and can drop and/or add multiple wavelength 
channels. The wavelength passband channels of 
the demultiplexing and the multiplexing stages 
need to be and to stay carefully aligned. By put- 
ting a 2X2 optical switch in the cross state (as 
indicated for channel ,), the corresponding wave- 
length channel can be dropped and added. The 
pass-through wavelength channels are filtered, 
and thus this design limits the cascadability of 
nodes. An OADM architecture with similar func- 
tionality but requiring only a single wavelength- 
selective element is shown in Figure 2.24 [8,9]. 
The wavelength-selective AWGR performs both 
the demultiplexing and the multiplexing of the 
wavelength channels, and thus avoids wavelength 
misalignment issues. Some crosstalk may occur 
due to direct leak-through from the input port to 
the output port of the AWGR. By looping back 
the wavelength-demultiplexed paths via the add/ 
drop switch matrix not to the front side but to the 
back side of the AWGR, and positioning the output 
port also at the front side, this crosstalk is strongly 
reduced; this fold-back architecture requires, 
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Figure 2.23 Fast wavelength-switchable OADM. 
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Figure 2.24 Fast wavelength-switchable OADM 
based on an arrayed waveguide grating router 
(AWGR). 


however, a larger AWGR, with twice the number of 
ports at the back side. 


2.5 ACCESS NETWORKS 


Access networks carry a wide range of services 
to and from the residential end customers, rang- 
ing from voice-based services, audio-based ones, 
video-based ones, to Internet/data services. Also 
the capacities needed per service vary widely: 
from 64 kbit s“! for traditional voice telephony to 
beyond 100 Mbit s“! for high-speed Internet and 
data. The last link to connect to the end customer, 
the so-called first mile (or last mile, depending 
on the point of view), may be bridged with vari- 
ous types of transport media exploited by various 
network operators. Coaxial copper cable trans- 
ports broadcast television and radio services, and 
increasingly also data services via cable modems. 
Twisted copper pair cables carry voice telephony, 
and data services via voice modems or high-speed 
ADSL and VDSL modems. Wireless systems bring 
mobile voice telephony via the GSM standard, and 
also data services via GPRS and UMTS. Optical 
fiber to the home/building is entering the market, 
but still has to surpass some cost barriers. It can 
offer the full set of integrated broadband services, 
from broadcast high bandwidth video services to 
gigabit Ethernet data services. A list of first-mile 
media with the bearer services, bitrates and reach 
is given in Table 2.1. The need for more band- 
width in the access network is growing continu- 
ously, due to the increasing amount of bandwidth 
required by each customer, mainly fuelled by 


video-based services and high-speed Internet, the 
tailoring of services to individual customer needs, 
and the emergence of more competing operators 
due to liberalization. This spurs the introduction 
of optical fiber (mainly single mode fiber, being 
a future-proof solution with its virtually infi- 
nite bandwidth) into the access network. As the 
installation and equipment costs of fiber-to-the- 
home (FTTH) are still quite high in compari- 
son to the traditional copper wired access lines, 
hybrid fiber access networks are the first step to 
introduce fiber. Fiber is used in the upper feeder 
part of the access network, where it runs from 
a local exchange (headend station) to a cabinet 
along the street (fiber-to-the-cabinet, FTTCab) or 
to the basement of a building (such as an apart- 
ment building with many living units; fiber-to- 
the-building, FTTB), where the optical signals are 
converted back into electrical ones which are then 
brought via copper-based first mile links or wire- 
lessly to the end customers. In the following of this 
section, the attention will be focussed on the opti- 
cal fiber part of the access network. 

Basically, three architectures may be deployed 
for the fiber access network: 


1. Point-to-point topology, where individual 
fibers run from the local exchange to each cabi- 
net, home or building. Many fibers are needed, 
which entails high first installation costs, but 
also provides the ultimate capacity. 

2. Active star topology, where a single fiber car- 
ries all traffic to an active node close to the end 
users, from where individual fibers run to each 
cabinet/home/building. Only a single feeder 
fiber is needed, and a number of short branch- 
ing fibers to the end users, which reduces 
costs; but the active node needs powering and 
maintenance. 

3. Passive star topology, in which the active node 
of the active star topology is replaced by a pas- 
sive optical power splitter that feeds the indi- 
vidual short branching fibers to the end users. 
In addition to the reduced installation costs of 
a single fiber feeder link, the completely pas- 
sive outside plant avoids the costs of powering 
and maintaining active equipment in the field. 
This topology has therefore become the most 
popular one for introduction of optical fiber 
into access networks, and is widely known as 
the passive optical network, PON. 
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Table 2.1 First-mile network technologies 


Medium Bearer service Bitrate (down/up) Reach (km) 
Twisted pair Analogue line Rates up to 56 k/56 kbit s* 
Twisted pair ISDN 144 k/144k data incl. 64 k/64k bit s voice <6 
or data circuits 
Twisted pair SDSL 768 k/768k bit s“! <4 
Twisted pair ADSL 1.5M to 6 M/64k to 640k bit s* <4-6 
Twisted pair VDSL 26M to 52 M/13M to 26M bit s* <0.3-1 
Coaxial cable CDMA/ <14 M/14M (net 8.2 M) bit s*? in 6 MHz slot 
OFDM+QOAM/OPSK 
Fiber (single ATM 150M to 622 M/150M bit s*' shared up to < 20 
mode) 1:32 (FSAN ATM-PON); up 1.24 
G/620M bit s-! (FSAN/ITU B-PON) 
Fiber (single Gigabit Ethernet 1 Gbit s-? (1.25 Gbit s*? 8B/10B coded) <5 
mode) 
Fiber (multi mode) Gigabit Ethernet 1 Gbit s*? (1.25 Gbit s-’ 8B/10B coded) <0.55 
Wireless (mobile) GSM 13 kbit s“! (at carrier frequency 900 and <16 
1800 MHz, frequency duplex) 
Wireless (mobile) GPRS 115 kbit s“ 
Wireless (mobile) UMTS 144k to 2 Mbit s“! (at carrier frequency 
2110-2200/1885-2025 MHz, frequency 
duplex) 
Wireless (fixed) MMDS 6 Mbit s* (at carrier frequency > 17 GHz) 
Wireless (fixed) LMDS 45 Mbit s“ (at carrier frequency > 17 GHz) 


2.5.1 Multiple access PONs 


The common fiber feeder part of the PON is shared 
by all the optical network units (ONUs) terminat- 
ing the branching fibers. The traffic sent down- 
stream from the optical line terminal (OLT) at the 
local exchange is simply broadcasted by means of 
the optical power splitter to every ONU. Sending 
trafic from the ONUs upstream to the local 
exchange, however, requires accurate multiple 
access techniques in order to multiplex collision- 
free the traffic streams generated by the ONUs 
onto the common feeder fiber. Four major catego- 
ries of multiple access techniques for fiber access 
networks have been developed: 


Time division multiple access (TDMA) 
SubCarrier multiple access (SCMA) 
Wavelength division multiple access (WDMA) 
Optical code division multiple access 
(OCDMA) 


In a TDMA system, as shown in Figure 2.25, 
the upstream packets from the ONUs are 


time-interleaved at the power splitting point, 
which requires careful synchronization of the 
packet transmission instants at the ONUs. This 
synchronization is achieved by means of grants 
sent from the local exchange, which instruct the 
ONU when to send a packet. At the local exchange 
in the OLT, a burst mode receiver is needed which 
can synchronize quickly to packets coming from 
different ONUs, and which also can handle the dif- 
ferent amplitude levels of the packets due to differ- 
ences in the path loss experienced. 

In an SCMA system, illustrated in Figure 2.26, 
the various ONUs modulate their packet streams 
on different electrical carrier frequencies, which 
subsequently modulate the light intensity of the 
laser diode. The packet streams are thus put into 
different frequency bands, which are demulti- 
plexed again at the local exchange. Each frequency 
band constitutes an independent communica- 
tion channel from an ONU to the OLT in the 
local exchange and thus may carry a signal in a 
format different from that in an other channel 
(e.g., one channel may carry a high-speed digital 
data signal, and an other one an analogue video 
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Figure 2.25 TDMA passive optical network. 


Figure 2.26 SCMA passive optical network. 


signal). No time synchronization of the channels 
is needed. The laser diodes at the ONUs may have 
nominally the same wavelength. When the wave- 
lengths of the lasers are very close to each other, 
the frequency difference between them may result 
in beat noise products due to optical beating at the 
photodetector in the receiver. These noise products 
may interfere with the packet data spectrum. The 
wavelengths of the laser diodes have to be adjusted 
slightly different (e.g., by thermal tuning) in order 
to avoid this optical beat noise interference. 

In a WDMA system (see Figure 2.27), each 
ONU uses a different wavelength channel to send 
its packets to the OLT in the local exchange. These 
wavelength channels constitute independent com- 
munication channels and thus may carry differ- 
ent signal formats; also no time synchronization 
is needed. The same wavelength channel may be 
used for upstream communication as for down- 
stream. The isolation requirements of the wave- 
length demultiplexer may be high to sufficiently 
suppress crosstalk, e.g., when high-speed digital 


data and analogue video are carried on two differ- 
ent wavelength channels. The channel routing by 
the wavelength multiplexer at the network split- 
ting point prohibits broadcasting some channels to 
all ONUs, as needed for instance for CATV signal 
distribution. Every ONU needs a wavelength-spe- 
cific laser diode, which increases costs, and com- 
plicates maintenance and stock inventory issues. 
An alternative is to use a light source with a broad 
spectrum at the ONU (e.g., a superluminescent 
LED), of which the in-field multiplexer cuts out the 
appropriate part of the spectrum. This “spectral 
slicing” approach reduces the inventory problems, 
but also yields a reduced effective optical power 
available from the ONU and thus limits the reach 
of the system. Another alternative is to use a reflec- 
tive modulator at the ONU, which modulates the 
upstream data on a continuous light channel emit- 
ted at the appropriate wavelength by the OLT and 
returns it to the OLT [10]. Thus no light source is 
needed at the ONU, which eases maintenance; but 
again the power budget is limited. 
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Figure 2.27 WDMA passive optical network. 


In an OCDMA system, each ONU uses a differ- 
ent signature sequence of optical pulses, and this 
sequence is on-off modulated with the data to be 
transmitted. The duration of the sequence needs to 
be at least equal to that of a data bit, and thus a very 
high-speed signature sequence is needed to trans- 
mit moderate-speed data which limits the reach 
of the system due to the increased impact of dis- 
persion and the decreasing power budget at high 
line rates. In the OLT at the local exchange, the 
received signals are correlated with the known sig- 
nature sequences, in order to demultiplex the data 
coming from the different ONUs. As the signature 
codes may not be perfectly orthogonal, some cross- 
talk may occur. 

TDMA systems have received the most atten- 
tion for broadband access networks, as they are 
most suited for high-speed data transmission 
at relatively moderate complexity. Two types 
of TDMA passive optical networks have been 
addressed extensively in standardization bod- 
ies: the ATM PON (APON) carrying native ATM 
cells in the G.983 standard series of ITU-T SG15, 
and the Ethernet PON (EPON) carrying gigabit 
Ethernet packets in IEEE 802.3. 


2.5.2 ATM PON 


The full service access network (FSAN) group, a 
committee of presently 21 major telecommuni- 
cation operators around the world, since 1995 is 
promoting the ATM PON for broadband access 
networks. 

As laid down in the G.983.1 Recommendation of 
ITU-T [11], an ATM PON may have a downstream 
bitrate of 155 or 622 Mbit s! and an upstream one 


of 155 Mbit s"' The maximum optical splitting ratio 
is 32 (may grow to 64), and the maximum fiber 
length between the OLT in the local exchange and 
an ONU is 20km. The range in which this length is 
allowed to vary is from 0 to 20km. Standard single 
mode fiber (G.652) is foreseen. Coarse wavelength 
multiplexing is used for separating the bi-direc- 
tional traffic: the downstream traffic is positioned 
in the 1.5,m wavelength band, and the upstream 
traffic in the 1.3 pm band (using cheap Fabry-Perot 
laser diodes in the ONUs). 

In the downstream direction of a 155 Mbit s7! 
down/155 Mbit s“' up system, 54 ATM cells of 53 
bytes each are fitted together with two physical 
layer operation, administration, and maintenance 
(PLOAM) cells of 53 bytes in a frame [11]. The 
PLOAM cells contain each 53 upstream grants. 
A grant permits an ONU to send an ATM cell. 
By sending these grants, the OLT controls at each 
ONU the transmission of the upstream packets, 
and can therefore assign dynamically a portion of 
the upstream bandwidth to each ONU. In a 622 
Mbit s"! down/155 Mbit s~! up system, a frame con- 
tains four times as many cells (i-e., 216 ATM cells 
and eight PLOAM cells). The downstream frame is 
broadcasted to all ONUs. An ONU only extracts 
those cells that are addressed to it. 

In the upstream frame, both for the 155 Mbit s“! 
down/155 Mbit s' up system and for the 622 
Mbit s“! down/155 Mbit sup system, 53 ATM cells 
are fitted of 53 bytes each plus an overhead of 3 
bytes per cell. This overhead is used as guard time, 
as a delimiter and as preamble for supporting the 
burst mode receiver process in the local exchange. 

The power budgets needed to bridge the fiber 
losses and the splitter losses are denoted by three 
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Figure 2.28 Time ranging in a TDMA PON. 


classes of optical path losses: class A 5-20 dB, class 
B 10-25dB, and class C 15-30dB. At the ONU, a 
launched optical power of —4 to +2 dBm is speci- 
fied for class B, and —2 to +4 dBm for class C [12]. 
The ONU receiver sensitivity at 155 Mbit s' should 
be better than —30 dBm for class B, and —33 dBm 
for class C. 

The ONUs are usually positioned at different 
distances from the local exchange. Therefore, the 
upstream transmission of the packets from each 
ONU should be carefully timed, in such a way that 
the packets do not collide at the network splitter 
[11,14]. The OLT has to measure the distance to 
each ONU for this, and then instructs the ONU to 
insert an equalizing transmission delay such that 
all distances from the ONUs to the OLT are vir- 
tually equal to the longest allowable distance (i.e., 
20km); see Figure 2.28. To measure the distance to 
each ONU, the OLT emits a ranging grant to each 
ONU, and on receipt the ONU returns a ranging 
cell to the OLT. In this distance ranging process, 
the OLT can deduce the distance to each ONU 
from the round trip delay. 

Each ONU sends an upstream cell upon the 
receipt of a grant. Because the path losses from 
each ONU to the OLT may be different, the power 
of the cells received by the OLT may vary consid- 
erably from cell to cell. The burst mode receiver 
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at the OLT should therefore have a wide dynamic 
range, and should be able to set its decision thresh- 
old quickly to the appropriate level to discriminate 
the logical ones from the zeros. Also the power of 
the ONU transmitter can be varied over a certain 
range to limit the requirements on the receiver 
dynamic range. In this amplitude ranging process, 
the overhead to each ATM cell is used for support- 
ing the fast decision threshold setting at the OLT 
burst mode receiver and the power adaptation at 
the ONU burst mode transmitter. 

Four types of network protection have been 
described in Recommendation G.983.1 [12], as 
shown in Figure 2.29. Type A protection involves 
protection of the feeder fiber only by a spare fiber 
over which the traffic can be rerouted by means of 
optical switches. After detection of a failure in the 
primary fiber and switch-over to the spare fiber, 
also re-ranging has to be done by the PON trans- 
mission convergence (TC) layer. Thus only limited 
protection of the system is realized. Mechanical 
optical switches are used up to now; when optical 
switching becomes cheaper, this protection scheme 
may become more attractive. Type B protection 
features duplication of both the feeder fiber and 
the OLT. The secondary OLT is on cold standby, 
and is activated when the primary one fails. Due to 
the high sharing factor of the duplicated resources 
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Figure 2.29 PON protection schemes. 


by the ONUs, this approach offers an economical 
yet limited protection. Type C protection implies 
full duplication of the PON, and all equipment is 
normally working which allows fast switch-over 
(within 50 ms) from the primary equipment to 
the secondary one. The branch fibers as well as the 
ONUs are protected; also a mix of protected and 
unprotected ONUs can be handled. Type D protec- 
tion features independent duplication of the feeder 
fibers and the branch fibers. It cannot offer fast res- 
toration. It is less attractive than C, as it requires 
more components but not a better functionality. 
In summary, types B and C are the most attractive 
schemes for a new recommendation. 

To further increase the speeds laid down in 
Recommendation G.983.1, research is done into 
622, 1244 and 2488 Mbit s"! line rates, both for 
upstream and downstream. A key technical issue 
is the development of faster burst-mode circuitry 
to adequately retrieve the timing and set the deci- 
sion threshold level, which becomes increasingly 
more difficult at higher line rates. Operation of 622 
Mbit s’ burst-mode circuitry has been achieved 
recently [12]. In January 2003, ITU has set stan- 
dards for gigabit-capable PONs (G-PONs). These 
ITU-T Recommendations G.984.1 and G.984.2 
cover downstream speeds of 1.25 and 2.5 Gbit s*! 
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ONU 1 
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and upstream speeds of 155 and 622 Mbit s"! and 
of 1.25 and 2.5 Gbit s7. 

The G.983.1 ATM PON was initially mainly 
designed for high-speed data communication. 
However, in the residential access networks there 
is also a clear demand for economical delivery of 
CATV services, for which subcarrier multiplexing 
techniques are quite appropriate. In the enhanced 
Recommendation G.983.3 [12], room has been 
allocated in the optical spectrum to host video ser- 
vices or additional digital services next to the ATM 
PON services. As shown in Figure 2.30, the APON 
upstream services remain in the 1260-1360 nm 
band (asin G.983.1), but the band for downstream ser- 
vices is narrowed to 1480-1500 nm (1480-1580 nm 
in G.983.1). Next to those, an enhancement band 
for densely wavelength multiplexed bi-directional 
digital services (such as private wavelength ser- 
vices) is foreseen, or an enhancement band for an 
overlay of video delivery services. The latter is used 
in downstream direction only, and coincides with 
the C-band as thus economical erbium-doped fiber 
amplifiers (EDFAs) can be deployed for the power 
boosting required. When positioning an overlay of 
CATV distributive services in the C-band, strin- 
gent crosstalk requirements have to be put on the 
wavelength multiplexers and demultiplexers, to 
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Figure 2.30 WDM enhancement G.983.3. 


prevent noticeable interference of the CATV signals 
into the digital ATM signals, and vice versa [13]. 

In order to further improve the economics of 
ATM PON systems, an extended PON system with 
an increase of the network splitting factor to 128 
and even 256 has been developed, while still main- 
taining a passive outside plant and compatibility 
with G.983.1 compliant ONUs [14]. This extended 
split is achieved by a larger optical power budget. 
In the downstream direction, at the OLT a high 
power laser diode or an EDFA is used to boost the 
power. In the upstream direction, the sensitivity of 
the burst-mode receiver is improved by applying 
an avalanche photo diode (APD). Also eight sin- 
gle-mode feeder fibers (each feeding a 1:16 or 1:32 
power splitter in the field) are at the OLT coupled 
to a multimode fiber yielding a low-loss coupling 
to the receiver. 

Even further extensions of the split factor and 
of the reach of an ATM PON have been realized 
in the SuperPON system [15]. An extension to a 
splitting factor of 1:2048 has been achieved; this 
needs, however, active equipment in the field. In 
the downstream direction exploiting the 1530- 
1560 nm wavelength window, EDFAs are used for 
overcoming the large path losses. In the upstream 
direction, gated semiconductor optical amplifiers 
(SOAs) are deployed. Each SOA gate is opened 
when upstream packets arrive, and is shut other- 
wise in order to avoid funneling of the amplified 
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spontaneous emission noise towards the OLT. This 
SuperPON approach is not compliant with present 
standards, and may be economically feasible only 
in the long term [14]. 


2.5.3 Ethernet PON 


With the rapid penetration of Ethernet-based 
services, Ethernet PON (EPON) techniques are 
receiving increasing attention, and are promoted 
by the IEEE 802.3 Ethernet in the first mile (EFM) 
group. The major difference with ATM PONSs is 
that an EPON carries variable-length packets up 
to 1518 bytes in length, whereas an ATM PON car- 
ries fixed-length 53 bytes cells. This ability yields 
a higher efficiency for handling IP traffic. The 
packets are transported at the gigabit Ethernet 
1.25 Gbit s“! speed using the IEEE 802.3 Ethernet 
protocol. However, ATM offers built-in quality 
of service for all traffic classes, whereas Ethernet 
does not. EPON thus cannot support voice services 
with quality of service as provided in the tradi- 
tional public switched telephone network (PSTN), 
and also the support of real-time services still has 
issues due to latency and packet jitter. 

The EPON features full-duplex transmission 
similarly as the ATM PON, with downstream 
traffic at 1490 or 1510 nm, and upstream traffic at 
around 1310 nm. As shown in Figure 2.31, stan- 
dard IEEE 802.3 Ethernet packets are broadcasted 
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Figure 2.31 Downstream traffic in an EPON. 
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Figure 2.32 Upstream traffic in an EPON. 


downstream by the OLT to all the ONUs. Each 
ONU inspects the headers, and extracts the pack- 
ets that are addressed to it. Several variable-length 
packets are put into a fixed-length frame of 2 ms 
duration, and each frame begins with a 1-byte syn- 
chronization marker. In the upstream direction, 
also 2 ms frames are used. A frame contains time 
slots that each are assigned to one of the ONUs (see 
Figure 2.32). Each ONU puts one or more of its 
upstream variable-length IEEE 802.3 packet into a 
time slot; if it has no packets to send, the time slot 
may be filled with an idle signal. No packet frag- 
mentation takes place. The time slot overhead con- 
sists of a guard band, and indicators for timing and 
signal power. The OLT thus allows only one ONU 
to send at a time, and no collisions occur. The time 
slot size is 125 or 250 ps. 


2.5.4 Hybrid fiber coax networks 


CATV networks usually are laid out over large geo- 
graphical areas, and are mainly designed for down- 
stream broadcasting of analogue TV channels that 


are frequency-division multiplexed in a carrier fre- 
quency grid extending up to 1GHz. As shown in 
Figure 2.33, in a hybrid fiber coax (HFC) system a 
CATV headend station is collecting the CATV sig- 
nals, remodulating them into a specific frequency 
grid, and sending them via single-mode fibers to 
fiber nodes. Each fiber node converts the com- 
posite optical signal into an electrical one, which 
is carried via a coaxial cable network including 
several RF amplifiers to the residential homes. A 
single headend may thus serve hundred thousands 
of customers, and a fiber node some thousands 
of customers. In particular during transmission 
in the coaxial cable network, the signal quality 
deteriorates due to the addition of noise from the 
electrical amplifiers and intermodulation products 
from nonlinearities in the system. On the fiber part 
of the network, the signals are carried with subcar- 
rier multiplexing; see Figure 2.34. The TV chan- 
nels each are amplitude-modulated on a separate 
frequency, and after summing all these modulated 
signals, a highly linear high power laser diode 
(or laser diode followed by a linearized external 
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Figure 2.33 Hybrid fiber coax network. 
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Figure 2.34 Subcarrier multiplexing. 


modulator) generates an optical signal which is 
intensity-modulated with the composite CATV 
signal. At the receiver site, the optical signal is con- 
verted into the electrical CATV signal by means of 
a highly linear PIN photodiode, and subsequently 
the signal can be passed to the coaxial cable net- 
work or to a selective receiver. When using a laser 
diode with low relative intensity noise and high 
linearity (or a carefully linearized external modu- 
lator), the CATV signal can be transported with 
very little loss of quality. If a 1.5m wavelength 
laser diode is used, EDFAs may boost the power 
at the headend and compensate for the split- 
ting losses; thus very extensive networks feeding 
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thousands of ONUs can be realized. In this wave- 
length region, however, with direct laser modula- 
tion second order intermodulation products may 
arise due to laser chirp in combination with fiber 
chromatic dispersion; with an external modulator, 
however, the chirp is small enough to avoid these 
intermodulation products. 

The CATV signal quality that can be main- 
tained in HFC networks is very high due to the 
fiber’s low losses and high bandwidth in compari- 
son with coaxial cable. Therefore, in HFC networks 
fiber is gradually brought deeper into the network, 
and fiber nodes have to serve fewer customers 
through a coaxial cable network of limited size 


56 Optical network architectures 


(ie., mini fiber nodes, each serving in the order of 
40 customers). 

At present, HFC networks are not only car- 
rying CATV and FM radio broadcast services, 
but cable operators are also exploiting them for 
voice telephony and data transport using cable 
modems. For the upstream traffic involved with 
these interactive services, parts of the spectrum 
unused for CATV and FM radio broadcast can be 
used. In Europe, typically the 5-65 MHz band is 
used for this; in the US, the 5-42 MHz range. For 
downstream data, e.g., the 300-450MHz range 
is used, taking into account that Internet traffic 
is usually highly asymmetric (much more down- 
loading traffic than uploading). Downstream per 
8 MHz CATV channel, 30-50 Mbit s“! data can be 
accommodated deploying 64 or even 256 quadra- 
ture amplitude modulation (QAM). Upstream 
due to ingress noise less complicated modulation 
schemes are to be used; DQPSK offers about 3 
Mbit s*! per channel. 


2.5.5 Dense wavelength multiplexing 
in access networks 


In general, access networks have to meet a fast 
growth in capacity demand, due to several causes: 
customers are asking for second and more tele- 
phone lines; Internet data traffic is booming with 
higher data rates, more users and longer sessions 
on-line (even always on); an increasing amount 
of video-based services; fast growth in number of 
mobile phone users and session frequency; new 
operators entering asking to rent capacity on exist- 
ing access networks; etc. This hunger for more 
capacity and the strive for convergence of services 
on a single network can most adequately be met 
by bringing fiber ever closer to the end users, from 
where only a short copper cable based (or wireless) 
link has to be bridged to the customer. Ultimately, 
when installation and equipment costs have come 
down sufficiently, the most powerful network is 
achieved when fiber runs all the way to the cus- 
tomer’s home (fiber to the home, FTTH). 

The upgradation of installed fiber plant to 
higher capacities while protecting the investments 
made is efficiently done by introducing wavelength 
multiplexing techniques. Wavelength channels 
may be allocated to specific sets of services (for 
service unbundling), and/or to separate service 
operators (leasing of network capacity). 


2.5.6 Dynamic capacity allocation by 
flexible wavelength assignment 


To cope with variation in service demand by the 
users and the sometimes quickly changing opera- 
tor conditions, it is more efficient to flexibly allocate 
the augmented available network capacity across 
the access network. Dynamic wavelength rout- 
ing techniques can be used for this, thus making 
more efficient use of the network’s resources and 
generating more revenues. Figure 2.35 illustrates 
the principle: from the OLT in the headend station 
of the network, multiple wavelength channels are 
fed to the ONUs via a tree-and-branch PON. By 
wavelength-selective routing in the PON, or wave- 
length selection at the ONU, wavelength channels 
can be assigned to a number of specific ONUs. 
Thus capacity can be specifically shared between 
these ONUs. The ONUs subsequently transfer this 
capacity shares to their first-mile electrical network 
connecting the end users. The mapping of the net- 
work capacity resources to the first-mile networks 
can thus be changed by changing the wavelength 
channel assignment. Basically two approaches can 
be followed, as illustrated in Figure 2.36: a wave- 
length router in the field, or wavelength selection 
at the ONUs. As shown in Figure 2.36a, a tunable 
wavelength router directs the wavelength chan- 
nels to specific output ports, and this routing can 
be dynamically adjusted by external control signals 
from the headend. In order to support in addition 
the delivery of broadcast services to all ONUs, extra 
provisions have to be made for enabling broad- 
cast wavelength channel(s) to bypass the router. 
As the wavelength channels are routed to only those 
ONUs whose customers require the associated 
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Figure 2.35 Dynamic wavelength routing in 
hybrid access networks. 
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Figure 2.36 Dynamically allocating wavelength channels to ONUs. (a) Flexible wavelength routing and 


(b) broadcast-and-select. 


services, no optical power is wasted. As shown in 
Figure 2.36b, another approach is to broadcast all 
wavelength channels to every ONU, and subse- 
quently tune the ONU to the wavelength channel 
wanted. Clearly the power of the other wavelength 
channels is wasted by the ONU, and losses at the 
broadcasting power splitter are significant. An 
optical amplifier is usually needed to make up for 
these losses; the amplifier needs to operate bi-direc- 
tionally to handle downstream as well as upstream 
traffic. No specific provisions in the network are 
needed for supporting broadcast services. 

Figure 2.37 presents a multi-wavelength overlay 
of a number of ATM PON networks on a HFC net- 
work, following the wavelength channel selection 
approach [16]. Figure 2.37a shows a fiber-coax net- 
work for distribution of CATV services, operating 
at a wavelength A, in the 1550-1560nm window 
where EDFAs offer their best output power per- 
formance. Thus, using several EDFAs in cascade, 
an extensive optical network splitting factor can 
be realized and a large number of customers can 
be served. For example, with two optical ampli- 
fier stages and typical splitting factors of N=4 and 
P=16, and a mini-fiber node serving 40 users via 
its coaxial network, a total of 2560 users is served 
from a single headend fiber. For interactive ser- 
vices, the upstream frequency band in a standard 
HFC network (with a width of some 40-60MHz) 
has to be shared among these users, thus allowing 
only limited bitrates per user for narrowband ser- 
vices such as voice telephony. 

An upgrade of the system in order to provide 
broadband interactive services can be realized by 
overlaying the HFC network with a number of 
wavelength-multiplexed APON systems, as devel- 
oped in the ACTS TOBASCO project [16] and 
shown in Figure 2.37b. Four APON OLTs at the 
headend site are providing each bi-directional 622 
Mbit s ATM signals on a specific downstream 


and upstream wavelength. These eight wavelengths 
are positioned in the 1535-1541nm_ window, 
where the up-and downstream wavelength chan- 
nels are interleaved with 100GHz spacing. The 
APON wavelengths are combined by a high-den- 
sity wavelength division multiplexer (HDWDM), 
and subsequently multiplexed with the CATV 
signal by means of a simple coarse wavelength 
multiplexer (due to the wide spacing between the 
band of APON wavelengths and the CATV wave- 
length band). The system upgrade implies also 
replacement of the uni-directional optical EDFAs 
by bi-directional ones which feature low noise 
high-power operation for the downstream CATV 
signal, and for the bi-directional ATM signals a 
wavelength-flattened gain curve plus a nonsatu- 
rated behavior (to suppress crosstalk in burst- 
mode). At the ONU site, first the CATV signal is 
separated from the APON signals by means of a 
coarse wavelength multiplexer, and is subsequently 
converted to an electrical CATV signal by a highly 
linear receiver and distributed to the users via the 
coaxial network. The APON signals are fed to a 
wavelength-switched transceiver, of which the 
receiver can be switched to any of the four down- 
stream wavelength channels, and the transmitter 
to any of the four upstream ones. The wavelength- 
switched transceiver may be implemented by an 
array of wavelength-specific transmitters and 
receivers, which can be individually switched on 
and off; this configuration allows to set up a new 
wavelength channel before breaking down the old 
one (“make-before-break”). Alternatively, it may 
use wavelength-tunable transmitters and receiv- 
ers, which can in principle address any wavelength 
in a certain range; this eases further upgrading 
of the system by introducing more wavelength 
channels, but also implies a “break-before-make” 
channel switching. The network management and 
control system commands to which downstream 
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Figure 2.37 Flexible capacity assignment in a multi-wavelength fiber-coax network by wavelength 
selection at the ONUs. (a) fiber-coax network for distribution of CATV services and (b) upgrading of 
the fiber-coax network with multi-wavelength APON system for delivery of broadband interactive 


services. 


and to which upstream wavelength channel each 
ONU transceiver is switched. By issuing these 
commands from the headend station, the net- 
work operator actually controls the virtual topol- 
ogy of the network, and thus is able to allocate 
the network’s capacity resources in response to 
the traffic demands at the various ONU sites. The 
network management command signals are trans- 
ported via an out-of-band wavelength channel in 
the 1.3,1m wavelength window. The APON signal 
channel selected by the ONU is converted into a 
bi-directional electrical broadband data signal by 
the transceiver, which is by a cable modem con- 
troller put in an appropriate frequency band for 
multiplexing with the electrical CATV signal. The 
upstream data signal is usually put below the low- 
est frequency CATV signal (so below 40-50 MHz), 


and the downstream signal in empty frequency 
bands between the CATV broadcast channels. The 
signals are carried by the coaxial network (in which 
only the electrical amplifiers need to be adapted to 
handle the broadband data signals) to the customer 
homes, where the CATV signal is separated from 
the bi-directional data signals; the latter signals are 
processed by a cable modem, which interacts with 
the cable modem controller at the ONU site. 

By remotely changing the wavelength selection 
at the ONUs, the network operator can adjust the 
system’s capacity allocation in order to meet the 
local traffic demands at the ONU sites. As illus- 
trated in Figure 2.39, the ONUs are allocated to 
the four upstream (and downstream) wavelength 
channels, which each have a maximum capacity of 
622 Mbit s“! for ATM data. As soon as the traffic 
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Figure 2.38 Flexible capacity assignment in a multi-wavelength fiber-wireless network by wavelength 


routing in the field. 


to be sent upstream by an ONU grows and does 
not fit anymore within its wavelength channel, 
the network management system can command 
the ONU to be allocated to an other wavelength 
channel, in which still sufficient free capacity is 
available. Obviously, this dynamic wavelength 
re-allocation process reduces the system’s blocking 
probability, i.e., it allows the system to handle more 
traffic without blocking and thus can increase the 
revenues of the operator. 

Figure 2.38 presents the dynamic wavelength 
channel routing approach in a fiber-wireless net- 
work to allocate flexibly the capacity of a number 
of ATM PON systems among ONUs in a single 
fiber split network infrastructure [17]. The ONUs 
are each feeding a radio access point (RAP) of 
e.g., a wireless LAN, which wirelessly connects 
to a variable number of users with mobile termi- 
nals. These users move across the geographical 
area served by the network (e.g., a business park), 
and they may want to set up a broadband wire- 
less connection to their laptop at any time any- 
where in this area. When many users are within 
a wireless cell served by a certain RAP, this cell 
may have to handle much more traffic than the 
other cells; it has become a “hot spot” which has 
to be equipped with additional capacity. The cor- 
responding RAP may switch on more microwave 
carriers to provide this additional capacity over 
the air, and also has to claim more capacity from 
the ONU. This local extra capacity can be provided 


by reallocation of the wavelength channels over 
the ONUs, which is done by a flexible wavelength 
router positioned in the field. Similar to the archi- 
tecture of the wavelength-reconfigurable fiber- 
coax network in Figure 2.37b, the architecture 
in Figure 2.38 developed in the ACTS PRISMA 
project has four 622 Mbit s"! bi-directional APON 
OLTs with a specific downstream wavelength and 
an upstream one each. The four downstream wave- 
lengths are located in the 1538-1541 nm range, 
with 100 GHz spacing, and the four upstream ones 
in the 1547-1550 nm range with the same spacing. 
The flexible wavelength router directs the down- 
stream wavelength channels each to one or more 
of its output ports, and thus via a split network to 
a subset of ONUs. The RAPs could operate with 
up to five microwave carriers in the 5 GHz region, 
each carrying up to 20 Mbit s"! ATM wireless LAN 
data in OFDM format. At the flexible router (or at 
the local exchange) a number of continuous-wave 
emitting laser diodes are located, which provide 
unmodulated light power at the upstream wave- 
lengths. The flexible router can select one of these 
upstream wavelengths, and direct it to the ONUs 
that can modulate the signal with the upstream 
data and return it by means of a reflective modula- 
tor via the router to the local exchange. Thus no 
wavelength-specific source is needed at the ONU, 
the downstream light sources are shared by a num- 
ber of ONUs, and all ONUs are identical, which 
reduces the system costs and the inventory issues. 
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Figure 2.39 Re-allocating ONUs to wavelength 
channels. 


The flexible wavelength router can be implemented 
with a wavelength demultiplexer separating the 
wavelength channels, followed by power splitters, 
optical switches and power couplers in order to 
guide the channels to the selected output port(s). 
Depending on the granularity of the wavelength 
allocation process, the flexible router may be posi- 
tioned at desired different splitting levels in the 
network. 

Using a similar strategy to assign wavelength 
channels to the ONUs as shown in Figure 2.39, 
a statistical performance analysis has been per- 
formed of the blocking probability of the system. 
It was assumed that the total network served 343 
cells, of which 49 were “hot spots”, i.e., generated 
a traffic load two times as large as a regular cell. It 
was also assumed that the system deployed seven 
wavelength channels, and that the calls arrived 
according to a Poisson process where the call 
duration and length were uniformly distributed. 
Figure 2.40 shows how the system blocking prob- 
ability depends on the offered load (normalized 
on the total available capacity, which is 7 times 
622 Mbit s~!), using various system architecture 
options. When wavelength re-allocation would 
not be possible (i.e., static WDM) and all the 49 
hot spots were positioned at cells served by ONUs 
assigned to the same wavelength channel, the 
blocking probability is obviously the worst case. 
On the other hand, in the static WDM case when 
the 49 hot spots were evenly spread over the seven 
wavelength channels, the blocking probability is 
much lower (i.e., best case). Unfortunately, a net- 
work operator cannot know beforehand where 
the hot spots will be positioned, so in this static 
WDM situation the system blocking probabil- 
ity will be anywhere between the best case and 
the worst case, and no guarantee for a certain 
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Figure 2.40 Improving the system performance 
by dynamic wavelength allocation. 


blocking performance can be given. When, how- 
ever, dynamic re-allocation of the wavelength 
channels is possible, the system can adapt to the 
actual hot spot distribution. Figure 2.40 shows that 
when the flexible wavelength router is positioned 
at the second splitting point in the network, the 
blocking performance is better than the best-case 
static WDM performance; but more importantly, 
it is also stable against variations in the hot spot 
distribution, and thus would allow an operator to 
guarantee a certain system blocking performance 
while still optimizing the efficiency of his system’s 
capacity resources. The blocking performance 
may even be better and stable when position- 
ing the flexible router at the third splitting point; 
however, this implies that the costs of the router 
are shared by less ONUs. Locating the router at 
the second splitting point is a good compromise 
between adequate improvement of the system 
blocking performance and system costs per ONU. 


2.5.7 Microwave signals over fiber 


Fiber-wireless systems may also carry microwave 
signals directly over fiber. In wireless local area 
networks (WLANs), the evolution towards larger 
capacities necessitates higher microwave carrier 
frequencies. For example, the current IEEE 802.11b 
WLAN systems transport up to 11 Mbit s“ per car- 
rier in the ISM 2.4GHz band. The upcoming IEEE 
802.1la systems carry up to 54 Mbit s" per car- 
rier in the 5.2 GHz band, and 60 GHz systems are 
under study for providing more than 100 Mbit s-!. 
With these increasing carrier frequencies, the 
microwave cells covered by the antenna of a radio 
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access point (RAP) become smaller. Thus more 
RAPs are needed to serve e.g., all the rooms in an 
office building, and hence also a more extensive 
wired network to feed the RAPs. Instead of gener- 
ating the microwave signals at each RAP individu- 
ally, feeding the microwave signals from a central 
headend site to the RAPs enables to simplify the 
RAPs considerably. The signal processing func- 
tions can thus be consolidated at the headend site. 
Due to its broadband characteristics, optical fiber 
is an excellent medium to bring the microwave 
signals to the RAPs. 

Carrying multi-gigahertz analogue signals over 
fiber requires very high frequency optical analog 
transmitters and receivers, including careful fiber 
dispersion compensation techniques. An attrac- 
tive alternative avoiding the transport of multi- 
gigahertz intensity-modulated signals through the 
fiber is to apply heterodyning of two optical signals 
of which the difference in optical frequency (wave- 
length) corresponds to the microwave frequency. 
When one of these signals is intensity-modulated 
with the baseband data to be transported, and the 
other one is unmodulated, by optical heterodyn- 
ing at the photodiode in the receiver the electrical 
microwave difference frequency signal is gener- 
ated, amplitude-modulated with the data signal. 
This modulated microwave signal can via a simple 
amplifier be radiated by an antenna; thus a very 
simple low-cost radio access point can be realized, 
while the complicated signal processing is con- 
solidated at the headend station. This approach, 
however, requires two light sources with narrow 
spectral linewidth and carefully stabilized dif- 
ference in optical emission frequency. An alter- 
native approach requiring only a single optical 
source is shown in Figure 2.41 [18]. The optical 
intensity-modulated signal from a laser diode is 


Data on 
IF subcarriers 


subsequently intensity-modulated by an external 
Mach-Zehnder modulator (MZM) which is biased 
at its inflexion point of the modulation character- 
istic and driven by a sinusoidal signal at half the 
microwave frequency. At the MZM’s output port 
a two-tone optical signal emerges, with a tone 
spacing equal to the microwave frequency. After 
heterodyning in a photodiode, the desired ampli- 
tude-modulated microwave signal is generated. 
The transmitter may also use multiple laser diodes, 
and thus a multi-wavelength radio-over-fiber sys- 
tem can be realized with a (tunable) WDM filter to 
select the desired wavelength radio channel at the 
antenna site. The system is tolerant to fiber disper- 
sion, and also the laser linewidth is not critical as 
laser phase noise is largely eliminated in the two- 
tone detection process. 

An alternative approach to generate microwave 
signals by means of a different kind of remote opti- 
cal processing, named optical frequency multiply- 
ing, is shown in Figure 2.42 [19]. At the headend 
station the wavelength A, of a tunable laser diode is 
swept periodically over a certain range AA,,, witha 
sweep frequency f,,. Alternatively, the wavelength- 
swept signal can be generated with a continuous- 
wave operating laser diode followed by an external 
phase modulator that is driven with the integral 
of the electrical sweep waveform. The intensity of 
the wavelength-swept signal is on/off modulated 
with low frequency chirp by the downstream data 
in a symmetrically driven MZM. After travelling 
through the fiber network, the signal transverses at 
the receiver an optical filter with a periodic band- 
pass characteristic. When the wavelength of the 
signal is swept back and forth over N filter trans- 
mission peaks, the light intensity impinging on the 
photodiode fluctuates at a frequency 2Nf,,. Thus 
the sweep frequency is multiplied, and a microwave 
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Figure 2.41 Generating microwave signals by heterodyning. 
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Figure 2.42 Generating microwave signals by optical frequency multiplying. 


signal with carrier frequency f,,,,=2Nf,y plus higher 
harmonics is obtained. The intensity-modulated 
data is not affected by this multiplication process, 
and is maintained as the envelope of the micro- 
wave signal. The microwave signal is subsequently 
transversing an electrical bandpass filter (BPF) to 
reject the unwanted harmonics. Simulations have 
shown that the microwave signal is very pure; its 
linewidth is nearly independent of the linewidth 
of the tunable laser. The periodic optical band- 
pass filter can be advantageously implemented 
by a Fabry-Perot filter with a free spectral range 
Adgsp Which is N times as small as the wavelength 
sweep range Ad,,. The microwave signal can also 
carry more advanced data modulation schemes; 
e.g., 16-level quadrature amplitude modulated (16- 
QAM) signals may be modulated on a subcarrier 
first, and then drive the MZM. The main advan- 
tage of this optical frequency multiplying method 
is that the fiber network is carrying only signals at 
moderate frequencies (up to the sweep frequency 
fw ©» up to 1GHz), while at the antenna site 
microwave signals with carrier frequencies in the 
tens of gigahertz region are generated. Thus dis- 
persion problems in the fiber network and other 


bandwidth limitations in the transmitter are effec- 
tively circumvented. The system does not rely on 
heterodyning, and thus may also operate on mul- 
timode fiber networks (such as polymer optical 
fiber). 

The system can also transport upstream data 
from the antenna station to the headend. When 
no data are sent downstream, the upstream micro- 
wave signal arriving at the antenna can be down- 
converted with the locally generated unmodulated 
microwave signal. Thus data can be conveyed bi- 
directionally in time-division duplex mode. 


2.6 CONCLUDING REMARKS 


Optical fiber is now generally recognized to be the 
most powerful medium for transporting informa- 
tion, due to its very low losses and extremely wide 
bandwidth. Next to space and time multiplexing, 
the wavelength dimension offers unprecedented 
opportunities to extend not only the data traffic 
transport capacity, but also the traffic routing pos- 
sibilities in networks. 

In core networks, optical amplifiers together 
with wavelength multiplexing techniques have 
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allowed transoceanic links to be bridged with tera- 
bits/second total transport capacity. In terrestrial 
mesh-shaped core networks, wavelength routing 
also provides link protection and thus improves 
network availability. IP packet streams may be 
very efficiently transported through mesh-shaped 
networks by wavelength-selective optical bypass- 
ing of the electronic processing in the packet rout- 
ers located in the nodes. 

In MANs, wavelength multiplexing techniques 
enable simultaneous broadband data communica- 
tion between many nodes. They also improve the 
network availability by offering alternative routing, 
which enables fast recovery from link or node fail- 
ures. Through wavelength add-drop multiplexers, 
multiple connections can be set up between nodes 
via a hubbing node, or directly between them using 
an appropriate wavelength assignment strategy. 

In access networks, fiber is penetrating steadily 
towards the end user. Infrastructure costs are the 
major nut to crack here. Shared-feeder concepts 
such as the passive optical network tree-and- 
branch one greatly reduce the installation costs of 
the fiber network, and can support various mul- 
tiple access techniques (a.o. ATM and Ethernet 
for time multiplexed access). In the past, operators 
have invested a lot in various last-mile networks 
(e.g., twisted pair, coaxial cable) to reach their resi- 
dential customers. In upgrading these networks 
to higher capacity and larger service variety, fiber 
can support a wide range of last-mile technolo- 
gies by hybrid combinations such as fiber-twisted 
pair, fiber-coax, and fiber-wireless. By means of 
wavelength multiplexing techniques, the fiber 
feeder part of such hybrid networks can very flex- 
ibly host different operators and service categories. 
Augmented with wavelength routing, capacity- 
on-demand can be realized, e.g., for handling hot 
spots, while respecting quality of service require- 
ments. Carrying microwave radio signals in analog 
form over the fiber offers clear advantages in the 
implementation of mobile communication net- 
works; the antenna stations can be considerably 
simplified and thus reduced in costs, while also 
the mobility functions can be consolidated in the 
headend station yielding improved efficiency of the 
signal processing. By combining radio over fiber 
techniques with flexible wavelength routing, the 
mobility offered by wireless communication can 
be powerfully complemented with the broadband 
dynamics of optical networks. 


2.7 FUTURE PROSPECTS 


With the ongoing improvements in fiber charac- 
teristics and development of novel optical amplifier 
structures, the wavelength range available for com- 
munication will stretch from below 0.8 to beyond 
1.6,1m, covering a bandwidth of some 200 THz. 
Further improvements in signal coding yielding a 
higher spectral efficiency, in ultra-dense WDM and 
in ultra-high speed OTDM will enable us to exploit 
this huge bandwidth, and will push the transport 
capacity of a single fiber beyond 100 Tbit s“'. 

Realizing these tremendous link transport 
capacities over fiber links is of little value if the 
network nodes cannot keep up with handling the 
data streams. Present-day electronic routing will 
be replaced by fast optical processing. All-optical 
packet routing will provide the ultimate in node 
throughput, by optically inspecting the header, and 
by making routing decisions at light speed by means 
of ultra-fast optical logic. Optical memories will 
provide the intermediate buffering in the nodes, and 
electronics may be expelled to the edges of the net- 
work, thus providing a fully-optical network path. 

The end user is to benefit from all these ultra- 
broadband communication possibilities. Therefore, 
the optical fiber will not only reach up to his house, 
but will penetrate into it as well, benefiting of the 
low installation cost techniques made possible by 
deploying e.g., large-core multimode (polymer) 
optical fiber. It will reach close to his personal area 
network, which due to the ongoing miniatuar- 
ization may consist of a myriad of small wireless 
power-lean terminals, sensors and actuators. These 
wireless devices will be incorporated not only in 
his residential living environment, but also in his 
clothes, his car, etc. Next to the traditional wired 
terminals such as the TV, these wireless devices 
will be connected to the fixed in-home and access 
network by a myriad of small intelligent anten- 
nas. Fiber-radio techniques, augmented with opti- 
cal routing to accommodate dynamically the hot 
spots, will provide the best match of the ultimate 
capacity of fiber with the user freedom of wireless. 
Even in the wireless domain, optics may penetrate 
by means of intelligently-steered free-space light 
beams providing the ultimate in wireless transport 
capacity. 

Which in the far foreseeable future will make the 
communication world an end-to-end globally trans- 
parent one, with nearly unlimited communication 
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capacity for anybody, anytime, anywhere, for any 
kind of service... the ultimate global crystal ball! 
Today we are now seeing such visionary statements 
(explained in previous paragraphs) being imple- 
mented as the ‘Internet of Things, though the final 
outcomes and potential are still not yet clear. 
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The optical packet switching technology has been 
identified as a promising technology to identify a 
new generation of systems and networks during the 
past few years. However, the lack of optical mem- 
ory, and the complexity of optical packet switching 
systems offering functionalities comparable to full 
electronic systems, has motivated the constructors 
of equipment to envisage new directions: recon- 
figurable optical add/drop multiplexers, currently 
deployed in the field or hybrid packet switching 
systems. In the early 2000s, the hybrid approach 
combining the best of optics and electronics has 
been then identified as a new promising direction 
for optical packet switching. Several research labo- 
ratories have investigated new concepts limiting 
this technology to fast space switches [based on 
pure spatial technique or exploiting the combina- 
tion of tunable lasers arrayed waveguide gratings 
(AWG)] and some processing functions. The capa- 
bility of the optical technology to switch ultrahigh 
capacities has concentrated its efforts on systems 
for the metro core or backbones where the capacity 
was very challenging. But the recent reorientation 
of information communication technologies (ICT) 
toward its cloudification forces a repositioning of 
this technology in new network segments, closer 
to the users where new key performance indicators 
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(KPIs) are asked. The new challenges are then to 
find a technology offering ultralow latencies, with 
systems more simple, cheaper, and less power con- 
suming than existing products. This new direc- 
tion then creates a need for new low-cost systems, 
ecodesigned at the convergence of new network 
concepts and of the emergent technologies driven 
in particular by the GPON2 and the DATA COM 
communities. 


3.1 INTRODUCTION 


This chapter gives a positioning of optical switch- 
ing technologies in the next generation of systems 
and networks. After giving two introduction sce- 
narios, one for the metro part and the other for 
the backbone part, this chapter addresses feasibil- 
ity issues. For the metro part, three introduction 
scenarios are presented. The first one exploits the 
well-known circuit switching techniques that can 
be introduced rapidly on the market. The second 
one proposes a packet switching technique to have 
a better bandwidth exploitation. Finally, with 
the emergence of new optical functions/devices, 
the third scenario describes how it is possible to 
propose a full flexible packet ring network that 
is really competitive with respect to other elec- 
tronic alternatives. For the backbone part, the 
first scenario will be probably in the core of large 
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routers, competing with current smart routers 
(router + cross-connects). The second scenario is 
for a new network concept, being disruptive with 
what exists but pushing toward a transparent com- 
pliant with a multiservice environment and fully 
evolutive in capacity. Finally, the last scenario 
describes how it is possible to go into an all-optical 
approach through the description of key optical 
functions required to make this concept realistic 
at a lower cost. 

With the introduction of the Internet protocol 
in the network, the telecommunication domain 
has turned a new corner. The broadband access 
to this new technology, opening the way to many 
residential applications, creates a revolution for the 
next generation of switching systems. The first rev- 
olution is the traffic volume. Personal computers 
becoming more and more powerful are generating 
a traffic through files that could not be envisaged 
even 2 years before. The second revolution is prob- 
ably the traffic profile evolution, moving from a 
constant bit rate to a variable bit rate, always driven 
by personal computer capabilities (video applica- 
tions, high definition TV (HDTV), net shopping, 
net courses, games, etc.). 

Optical technologies could appear in the next 
4 years as an important technology to grow the 
capacity of systems while preserving the simplic- 
ity, reliability, and performance of the systems. 
But, more importantly, optical packet switching 
technologies could become efficient techniques 
to really fit with the statistical behavior of the 
traffic profile to preserve bandwidth utilization 
as much as possible. One of the key issues in such 
packet-switched networks is the identification of 
the best packet format (variable packets or fixed 
packets). Several European projects have concen- 
trated their efforts on this important topic such 
as the RACE 2039 ATMOS project, the ACTS 
043 KEOPS project, and more recently the IST 
DAVID project. 

Thus, in this chapter, after a positioning of opti- 
cal switching in the next generation of systems 
and networks, the benefits of multiplexed architec- 
tures will be presented. Solutions for a progressive 
introduction of this technology in the metro are 
described, highlighting the required technology 
and addressing physical feasibility as well as perfor- 
mance issues. Opportunities for the backbone are 
also presented with the objective of highlighting 


the most promising approaches. For a pragmatic 
approach, criteria introducing this technology on 
the market are listed, but, more importantly, a 
basic cost approach leading to the winning solu- 
tion is mentioned. Finally, a conclusion is drawn. 


3.2 POSITIONING OPTICAL 
SWITCHING TECHNIQUES IN 
THE NEXT GENERATION OF 
SYSTEMS AND NETWORKS 


In this section, we will present the advantage of 
optics with respect to electronics, but, more impor- 
tantly, how optics can be exploited to complement 
electronic technology to really make the most of 
both technologies. 


3.2.1 Why optical switching? 


To give some arguments, we must list the advan- 
tages and drawbacks of optics. 
The main advantages of optics are as follows: 


e Low-power consumption: An example is a laser 
exploited in direct modulation. A laser operat- 
ing at 622 Mbit s“ or at 2.5 Gbit s“! will be 
electrically modulated with the same electrical 
modulation amplitude. 

e High reliability: For passive devices, it is 
evident that for active devices [lasers, EDFAs, 
semiconductor optical amplifiers (SOAs), etc.], 
the reliability is quite high because we exploit 
a carrier density dynamic, characteristic of the 
material used. 

© Good mode adaptation: The default on the 
coupling simply introduces losses, and the 
reflection can be easily managed by exploiting 
tilts. 

e Low-power dissipation: The photon-photon 
interaction dissipates less energy than the 
electron-electron interaction, mainly because 
of the mass. 

e High bit rate compliant: A passive guide is 
a priori a high bandwidth medium capable of 
supporting several terabits of capacity. 

e Management of large granularity: The switch- 
ing can be done at the wavelength level but 
also at the waveband (group of wavelengths) 
level. 
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The main drawbacks of optics are as follows: 


e Slow progress in integration: There is still 
a debate between monolithic and hybrid 
integration. 

e Slow progress in low-cost packaging: The cou- 
pling between passive and active guides still 
remains a costly technique. 

e Polarization sensitive: The characteristic of 
some materials often depends on the polariza- 
tion state of the light. 


In summary, optics is very interesting when the 
switching granularity is high, exploiting the WDM 
dimension to make simple structures. In electron- 
ics, we need to demultiplex at the wavelength level 
and then at the bit rate level; in optics, we simply 
need one device. 

To switch at the WDM granularity, we have 
commercially available devices such as opto- 
mechanical switches, thermo-optical switches, elec- 
tro-optical switches, and micro electro mechanical 
systems (MEMs) for slow switching applications and 
digital optical switches for fast switching SOA. 


3.2.2 Granularities of switching 


In optics, we can switch wavebands (group of 
wavelengths), wavelengths, or optical packets. 


3.2.2.1 SWITCHING OF WAVEBANDS 


The switching of wavebands is particularly inter- 
esting in the following cases: 


e To reduce the number of switching elements 
inside systems. This is the case for a large 
number of optical cross-connects or switches. 

e At the network level, when the traffic is aggre- 
gated enough to tolerate a waveband switching 
with a good bandwidth utilization. This is the 
case for pipes bridging networks and where the 
traffic matrix is quite stable. 


The waveband switching really exploits the poten- 
tial of optics because it reduces the complexity of 
the switching process with respect to electronic 
techniques. This technique has to be exploited as 
much as possible to make a system or a network 
concept really competitive to electronic techniques 
but only when the traffic matrix is stable enough 
not to penalize the average load of the waveband. 


3.2.2.2 SWITCHING OF WAVELENGTHS 


The switching of wavelengths is particularly inter- 
esting as follows: 


e To switch at the line bit rate without time 
demultiplexing inside optical systems, which 
is an advantage with respect to electronic tech- 
niques, especially when the bit rate is high (210 
Gbit s“!). Generally, the wavelength switch- 
ing is used to relax the power budget to offer 
higher switching capacity. In fact, with one 
important parameter being the optical signal- 
to-noise ratio, it is fundamental to preserve it 
to be able to address the higher throughputs. 
This is generally imposed when input powers 
launched into optical amplifiers cannot exceed 
a certain value. By this way, as the optical 
signal-to-noise ratio is always a function of the 
channel input power, if the total input power is 
the channel power, then we reach the maxi- 
mum optical signal-to-noise ratio value in the 
architecture. This is the case for a major part of 
the optical cross-connects or switches. 

e Atthe network level, when the traffic is aggre- 
gated enough and stable enough not to cope with 
traffic transient effects. This technique is still effi- 
cient when it is possible. The switching is done at 
the line bit rate. It is more efficient for backbones 
than for metro networks, for example, simply 
because of the traffic matrix characteristic. 


3.2.2.3 SWITCHING OF OPTICAL PACKETS 


The switching of optical packets can be processed 
at the wavelength level or at the waveband level. 
The only difference with respect to wavelength or 
waveband switching is that the ON state is rela- 
tively short, on the order of a few hundred nano- 
seconds or microseconds. The switching of optical 
packets is particularly interesting as follows: 


e To create a datagram connection or a virtual cir- 
cuit connection inside optical switching systems. 
The switching can be done at the wavelength 
level (classical optical packet switching) or at the 
waveband level (we define then a WDM packet) 
for a short time. Like for the previous cases, the 
WDM dimension is preferred where possible to 
reduce the number of active components in the 
architecture considered. 

e At the network level, when the traffic profile is 
sporadic, we have to cope with time constants 
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that cannot fit with seconds or minutes, and 
where the packet connection is the only one 
realistic to exploit efficiently the available 
bandwidth. This is the case for metro networks, 
in particular, but also for backbones if the 
application bit rate is growing at bit rate. 


3.2.3 Optical packet switching: An 
interesting approach context 


Among the different switching techniques, the 
optical packet switching technique is probably the 
most promising technique for the next generation 
of networks. The main indicator is the natural 
evolution of the traffic profile versus packet tech- 
niques. Driven mainly by the Internet protocol, we 
need to cope more with a traffic profile than a traf- 
fic matrix as could have been the case in the past. 
The main reason is the drastic change of telecom- 
munication applications moving from telephony 
to data. In addition, the rapid introduction of 
personal computers (PCs) at home as multimedia 
machines pushes telecom companies to find solu- 
tions to offer a higher quality of service at a lower 
cost. This new form of traffic imposes new infra- 
structures capable of handling the required capac- 
ity and to provide at the same time the required 
flexibility to offer low-cost connections. 


3.2.3.1 WHY OPTICAL PACKET SWITCHING? 


When analyzing the traffic profile at the output of 
a local area networks (LAN), the sporadic behav- 
ior of the traffic, often modeled with self-similar 
functions, clearly points to the problem. We then 
need to adapt the network concepts to the traffic 
nature coming from the access. And how do we 
adapt such a variability of the traffic profile with 


VOP: 
Follows the IP profile but 
difficult to manage in the optical network 


Header 


Continuous variable optical packet 


Lea 1 


circuit connections while having a good efficiency? 
The answer lies in the high aggregation level that 
requires a grouping of different LANs, and this 
is not always possible. Another solution is to cut 
the circuit into pieces (packets) trying to follow 
the traffic evolution at a scale comparable to the 
scale of the incoming traffic. Even if the technique 
is more complex to manage, it is undoubtedly the 
most efficient way to optimize the bandwidth utili- 
zation, more in the time domain today, than in the 
volume domain in the recent past. 


3.2.3.2 WHAT KIND OF PACKETS: FIXED 
PACKET OR VARIABLE PACKETS? 


There is still a debate on the choice of the packet 
size: for main arguments in favor of the variable 
packet, the optical packet size always follows the 
incoming packet size, whereas for the fixed packet, 
the optical packet format contributes to a better 
management of the performance. In both cases, 
arguments are acceptable, but the reality is more 
complex. 

It is evident to say that where the contention 
can be managed easily (a case of small or simple 
topologies), the variable packet has to be envis- 
aged. But in the case ofa large topology (meshed or 
other), the resolution of the contention is then local 
in each node, and the control of the traffic profile 
inside the network becomes fundamental. In that 
case, the fixed packet format is the only reasonable 
format. Another alternative, probably the best, is 
the adoption of a concatenated packet. For best 
effort, the concatenation is created to really follow 
the incoming packet profile. For high priority traf- 
fic where the delay is fundamental, small packets 
will always experience the smallest delay in the 
network. In this way, the technique can be adapted 
to a multiservice environment (Figure 3.1). 
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Figure 3.1 Different optical packet types that can be adopted. 
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Figure 3.2 Comparison: variable optical packets versus fixed optical packets versus concatenated 


optical slots. 


Figure 3.2 gives a comparison among fixed, 
variable, and concatenated packets. 


3.3 BENEFITS OF MULTIPLEXED 
ARCHITECTURES 


3.3.1 The WDM dimension, a 
dimension not only for 
transmission systems 


The WDM technique is often assimilated to the 
transmission domain. But, in fact, this technique 
is very useful in optics for many purposes. 

The WDM technique can be exploited for the 
following objectives: 


e To increase the capacity of a link without 
increasing the TDM bit rate at values difficult 
to manage. This first application was found 
rapidly. But there was also the requirement to 
reduce the number of active components. As a 
good example, the EDFA amplifying a group of 
wavelengths has rapidly replaced the classical 
single-channel repeater. 

e To facilitate the optical processing to avoid the 
interferometric noise when interleaving several 
pulses at high bit rate. The technique is used in 
optical time division multiplexing (OTDM), 
coding format techniques, multiplexing tech- 
niques (bit level or packet level), etc. 

e To ease the contention resolution. In fact, 
the wavelength becomes a new support, with 
different colors, and the color is selected in 
real time to avoid collisions. This technique is 


particularly used in optical packet switching 
network concepts. 

e To switch high granularities. It can be intro- 
duced to reduce the number of components in 
an optical architecture or in cases of low con- 
nectivity while addressing high throughputs. 


3.3.2 Benefits of the WDM 
dimension in multiplexed 
architectures 


In multiplexed architectures, the WDM dimension 
is exploited for different purposes. 

In the following, we will describe where the 
WDM dimension can be exploited efficiently and 
what kind of benefit we can expect. 


3.3.2.1 IN METROPOLITAN NETWORKS 


In the case of optical rings, the WDM dimension 
can be exploited first to provide an upgradability of 
the network in terms of allocated resources, simply 
by attributing progressively bands of wavelengths. 
The advantage of the band is mainly to relax the 
filtering constraints during the cascade of several 
nodes. This advantage was raised many times in 
circuit switching platforms. If we want to exploit 
the same WDM infrastructure while introducing 
packet techniques, the notion of band can then 
be advantageously exploited to reduce the latency 
in the transmitting parts. In fact, we can exploit 
the statistical time multiplexing of packets over 
a group of wavelengths to increase the chance to 
insert a packet in the line when we can have access 
to a group of wavelengths instead of one. 
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In summary, we can see that for optical rings, 
the WDM dimension can relax physical con- 
straints and, in addition, improve the performance 
in packet rings in terms of latency. 

To illustrate the benefit of the WDM dimension, 
the IST DAVID project is proposing a multiring 
optical packet MAN exploiting the WDM dimen- 
sion for these two main aspects. 


3.3.2.2 IN BACKBONE NETWORKS 


In backbone networks where the topology is gener- 
ally meshed, the WDM dimension is exploited for 
four main reasons: 


© Cost and compactness reasons: In optical cross- 
connect/packet switching architectures, the 
WDM dimension can be exploited to reduce 
the number of switching components to make 
the architecture compact and low cost. The 
objective is to switch in the WDM dimension 
as much as possible in the limit of the required 
power budget. In cross-connect architectures, 
the dynamic management of the wavelength 
dimension can, in addition, contribute to a 
drastic reduction of the number of intercon- 
nected fibers. 

e Power consumption reasons: By exploit- 
ing the WDM dimension, the processing is 
done at the WDM dimension. In electronics, 
a double demultiplexing is required: at the 
wavelength level and at the bit rate level. This 
is, for example, the case for coarse synchro- 
nization stages where the WDM dimension 
can be exploited efficiently at the WDM level 
to reduce the complexity of many electronic 
structures. 

e Physical reasons: The number of channels 
switched can also vary inside the architecture 
to preserve the optical signal-to-noise ratio at 
the output. For example, in the first stage of 
an optical architecture, the WDM switching 
can be done on a large number of channels 
(8 or 16), and then progressively reduced to a 
lower number, converging then to one channel 
switched. 

e Performance reasons: In packet architectures, 
the WDM dimension can be exploited to 
reduce the packet loss rate and the latency 
at the same time. When the packets can be 
addressed onto different wavelengths, the 
statistical multiplexing on these wavelengths 


let a freedom degree for the choice of the 
wavelength at the output of an optical packet 
switch. By this way, the contention can be 
avoided simply by reaffecting a new wave- 
length to a packet instead of putting it ina 
queue. The benefit is double because the packet 
loss rate can be improved, with the same 
amount of digital memory, and the latency is 
preserved because the packet will not experi- 
ence a queue. 


3.4 SPECIFICITY OF THE METRO 
AND PROPOSED SOLUTIONS 


3.4.1 Introduction scenario 


To take a pragmatic approach, it is fundamental to 
identify what could be the introduction scenario of 
a technology at the time. These paragraphs intend 
to position the optical switching technology for the 
metro in a timescale. 

Before developing these scenarios, it is also 
important to list the specificity of the metro part. 

In the metro, it is clear that the cost is prob- 
ably the most important parameter together with 
the performance. The cost addresses the hardware 
part but also the means adopted to exploit the 
bandwidth in the best way. Due to the traffic pro- 
file coming from the access, the important point 
is really to preserve the bandwidth. Thus, packet 
techniques will be preferred in this part of the 
network. 


3.4.1.1 WHAT ABOUT THE WDM 
DIMENSION? 


The WDM dimension is expensive in the metro, 
but there are also some arguments in favor of 
the introduction of the WDM in this part of the 
network. 

Due to the current traffic profile, not constrain- 
ing the bandwidth utilization too much (because 
the native bit rate is still quite low), a circuit plat- 
form is interesting. To reduce the cost of the WDM 
dimension while having enough resources to cope 
with the traffic volume and profile, probably the 
waveband approach is the best one. It guarantees 
upgradability (sub-band per sub-band), physi- 
cal performance (relaxing filtering constraints), 
and simplicity (circuit switching) with an existing 
infrastructure (fibers already installed). But if the 
native bit rate increases together with the variance, 


74 Optical switching and multiplexed architectures 


there will be a need to go into a lower granular- 
ity to have a better utilization of the optical band- 
width. Then the optical packet technique could be 
easily introduced making use of an existing infra- 
structure. The gap becomes natural. 

Thus, in the following presentation, we will 
introduce the circuit switching technique as a first 
step with a progressive migration toward optical 
packet switching techniques, which lead to a really 
efficient platform. 


3.4.1.2 SHORT-TERM INTRODUCTION: 
OPTICAL CIRCUIT SWITCHING 
TECHNOLOGY 


Due to the current traffic profile, optical circuit 
switching could be rapidly introduced in the mar- 
ket for many reasons: 


e The optical technology is mature enough and 
commercially available to envisage its utiliza- 
tion in optical platforms. It requires active 
devices such as integrated laser modulators 
(ILMs), receivers, and basic passive devices 
(demultiplexers, couplers, filters, etc.). 

e Itis avery simple technique that can be imple- 
mented rapidly. 

e The optical technology can exploit advan- 
tageously an existing WDM infrastructure. 
Currently, all the fibers installed are not 
exploited. 

e The management of such a network is mature 
enough to propose products. Management 
studies have been carried out leading to clear 
information models, protection scenario, and 
well-defined monitoring techniques. 

e ‘The traffic profile is not yet so critical to envis- 
age using such a technique in the metro. The 
bit rate at the output of personal computer (PC) 
is sufficiently low to have enough aggregation 
level at the output of the LANs. 

e This kind of platform can be easily upgraded 
with optical packet techniques to be fully com- 
patible with the future traffic profile driven by 
a powerful PC and fiber to the home (FTTH). 
This is also a strong argument for operators 
who want to invest with the possibility to 
upgrade their platform according to traffic 
constraints. 

Examples of implementation are ring topolo- 
gies, star topologies, and mesh topologies. In 
ring topologies, we can find the single ring or 


multiring approach. Figure 3.3a and b illus- 
trates these three interesting topologies. 


The particularity of this approach is in the opti- 
cal add/drop multiplexing structure. We can list 
mainly as follows: 


e Passive optical add/drop multiplexers 
e Dynamic optical add/drop multiplexers 


The passive structure can be used when the traffic 
matrix is very stable, whereas the dynamic struc- 
ture can allow some adaptation of the allocated 
resources to follow at least the envelope of the traf- 
fic matrix. This last type of the active structure is 
particularly interesting when the time constants 
are in the range of a few hours. 

Figure 3.4a and b describes the two main struc- 
tures of add/drop multiplexers. 

In the metro part, the WDM granularity for the 
upgradation of the network capacity will depend 
on the cost of the intervention. And in some cases, 
the upgradation of one wavelength is not the most 
cost-saving solution. For that purpose, it is impor- 
tant to identify the minimum WDM granularity 
for the upgradation. This minimum granularity, 
which can be on the order of few wavelengths (2 
or 4), can impact dramatically the network infra- 
structure. If we take into account the physical 
limitations in the cascade of filters, it is clear that 
the sub-band approach is an interesting approach. 
Figure 3.5 gives an overview of a ring MAN adopt- 
ing a sub-band strategy. 


3.4.1.3 MEDIUM-TERM INTRODUCTION: 
OPTICAL PACKET SWITCHING 
TECHNOLOGY 


The optical packet switching technology could be 
introduced as a required second step simply to face 
the traffic profile evolution. In that case, a packet 
technique will be required on the basis of the 
infrastructure already installed. The upgradation 
is made simply by changing the optical ring access 
node. Two new sub-blocks are then mandatory: the 
opto-electronic part and the electronic interface 
compliant with different classes of services. 

The more pragmatic approach is the adop- 
tion of a commercially available technology. 
Several concepts are proposed, all based on 
the adoption of ILMs and synchronous optical 
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Figure 3.3 (a) Conventional ring topology. (b) Multiring topology. 


NETwork (SONET)-like receivers. The resilient 
packet ring (RPR) concept is probably the most 
representative one. 


3.4.1.4 MEDIUM-TERM INTRODUCTION: 
FULL FLEXIBLE OPTICAL PACKET 
SWITCHING TECHNOLOGY 


In the case of long-term approaches, the adoption 
of an advanced technology is then mandatory. 

In particular, two important components are 
a fast tunable source and a fast wavelength selec- 
tor. These two components have already been 


demonstrated feasible (e.g., agility for the tunable 
sources, nippon telegraph and telephone (NTT) for 
the wavelength selectors). 

These components are: 


e At the transmission side (tunable laser): limita- 
tion of the latency in the output queue by 
exploiting the WDM dimension 

e At the receiving side (wavelength selector): 
the guarantee of the wavelength transpar- 
ency to receive any packet from the optical 
bandwidth 
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Figure 3.4 (a) Fixed optical add/drop multiplexer. (b) Dynamic optical add/drop multiplexer: 
an optical switching element guarantees the dynamic behavior. 
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Sub-band 1 


Figure 3.5 Sub-band introduction in an optical ring. 


e At the transit side (wavelength selector): the 
guarantee to introduce some fairness mecha- 
nisms when a high priority traffic with a mini- 
mum of latency is required 

e The structure of the optical add/drop multi- 
plexer is depicted in Figure 3.6a and b. 


3.4.2 Technology identification and 
feasibility 


The technology required to introduce these con- 
cepts can be split into two categories: 


© Commercially available technology 
e Advanced technology 


In the following, we will list the required com- 
mercial devices, and we will describe in more 
detail the potential new advanced technologies 
that open the way to really attractive system 
functionalities. 


3.4.2.1 COMMERCIALLY AVAILABLE 
TECHNOLOGY 


There are two types of devices: passive devices and 
active devices 


e For passive devices, we need couplers, attenua- 
tors, isolators, fixed filters, circulators, inter- 
connection fibers, connectors, (de)multiplexers, 


sa 


Sub-band 4 


access Sub-band 3 


etc. For filters and (de)multiplexers, we can 
have different specifications if we are address- 
ing channel filtering or band of channel 
filtering. 

@ Inall the cases, these components are products 
and today fit all the required specifications for 
system applications. 

e For active devices, we will need lasers, 

ILMs, slow tunable lasers, optical amplifiers, 
photodiodes, etc. Particular interest must be 
devoted to optical amplifiers because we can 
distinguish two types of optical amplifiers: 
fiber-based amplifiers and semiconductor 
amplifiers. For fiber-based amplifiers, largely 
introduced in point-to-point transmission 
systems, the main preoccupation is to find 
the best way to make these devices very cost- 
effective. For SOAs, even if they are on the 
market today, they currently suffer from a 
small market. However, the generic potential 
of such a component for different functions, 
such as optical gating or optical conversion, 
makes this component a promising one for 
future applications. 


3.4.2.2 ADVANCED TECHNOLOGY AND 
FEASIBILITY ISSUES 


This is the most promising technology to really 
propose something new and disruptive with 
respect to what exists today. 
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Figure 3.6 (a) Structure of the ring access node for an upgrade of the circuit switching platform into 
a packet switching platform. (b) Structure of the optical add/drop multiplexer including the required 
functionality to fully manage the transit traffic. 


In the following, we will illustrate thefourmain 3.4.2.2.1 SOA for gating 


opponents: A SOA is basically a laser in which the facets have 
been treated in order to eliminate the resonant cav- 
ity. Only the amplification medium is exploited. To 
be used as an optical gate, the SOA requires a high- 
frequency driver interconnected to the SOA. The 
driver sends a control signal, which can be forced 


e The SOA for gating functions 

The tuneable source 

e The wavelength selector, a strategic fast tune- 
able filter 

@ The packet mode receiver 
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at the ON or OFF state. Because of the short car- 
rier lifetime, a SOA can be switched with response 
times on the order of few tens of nanoseconds. At 
present, this component is exploited by many labo- 
ratories and has been demonstrated to be feasible 
for many applications. 

In the metro area, and according to the two net- 
work model described previously, different SOA 
structure are particularly interesting. 

There are roughly three types of SOA: SOA with 
high confinement factor or long active section to 
achieve wavelength conversion at high bit rate, 
SOA with low confinement factor or short active 
section to exploit more the linear characteristic of 
the gain, and SOA with an internal clamping to 
have a strictly linear response not to create distor- 
tion on the signal crossing the component. 

As for fiber amplifiers, there are three classes 
of SOAs: preamplifiers, in-line amplifiers, and 
boosters. 


3.4.2.2.2 SOA gate at the output 
of an ILM 


A SOA interconnected at the output of a modu- 
lated source is the basic schematic we can envisage. 

The advantage of this solution is that it is com- 
mercially available today. The main drawback is 
that there is a need to have control of the cross- 
gain modulation. One interesting solution is the 
use of a clamped-gain SOA to avoid any cross- 
gain modulation. The SOA used only in its lin- 
ear characteristic will provide enough gain to 
guarantee a sufficiently high ON/OFF ratio with 
no degradation of the pulse shape. This solu- 
tion is currently studied in different laboratories 
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(a) Optical 
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to analyze network concepts based on a packet 
transmission. 


3.4.2.2.3 Tunable source using a 
hybrid integration of a SOA 
gate array 


The SOA gate array is located in front of a laser array. 
At the output, a multiplexer and an integrated mod- 
ulator are interconnected. The SOAs see only contin- 
uous waves and can be switched, thus selecting the 
wavelength that must be transmitted. The SOAs are 
in a stable regime because the input power is a con- 
stant. Therefore, they do not experience any cross- 
gain modulation as could be the case in the previous 
use. The preservation of the signal quality (no deg- 
radation of the extinction ratio and no distortion of 
the bits) makes this SOA array an important device 
for the building of hybrid tunable sources. 

Figure 3.7a illustrates the structure of a tunable 
source based on a gate array, and Figure 3.7b shows 
an integrated four gate-array (OPTO+ realization). 


3.4.2.2.4 Tunable switching sources 
based on a sampled grating— 
Distributed Bragg reflector 
structure 


Another solution to build a fast tunable source is to 
use an Simple Grating Distributed Bragg Reflector 
(SG-DBR) laser while integrating a SOA section 
and an electro-absorption section. The following 
schematic illustrates the structure of the source. As 
for the hybrid tunable source, the SOA sees only 
a continuous wave that again prevents cross-gain 
modulation. The structure is currently studied by 
agility (Figure 3.8). 


Figure 3.7 (a) Structure of a hybrid tunable source using a SOA gate array. (b) Photo of a four gate- 
array, key building block of the hybrid tunable source. 
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Figure 3.8 Schematic of an integrated tunable source integrating a SOA section for amplification of 


the signal or optical gating. 


3.4.2.2.5 Wavelength selector 


The wavelength selector is probably one of the 
other key devices because it can be comparable 
to a fast tunable filter. The principle of operation 
is very simple. A first demultiplexer demulti- 
plexes the wavelengths, then each wavelength is 
selected or not, depending on the orders com- 
ing from the control part, and finally an output 
multiplexer regroups the wavelengths selected 
and contributes to reject the wideband amplified 
spontaneous emission coming from the SOAs. In 
principle, only one wavelength is selected among 
a group in a normal scenario. However, in the 
case of the third network scenario, the number 
of output wavelengths can vary from 1 to N (N 
being the total number of wavelengths at the 
input of the device). 

Figure 3.9a shows the basic structure of a wave- 
length selector. 


Optical gate 
, Multiplexer 


Demultiplexer 


Figure 3.9 Structure of a wavelength selector. 


3.4.2.2.6 Packet mode receiver 


In the case of the third scenario, another important 
technology is required, not at the optical level but 
more at the electronic level: the packet mode receiver. 

The packet mode receiver has to experience 
a continuous or noncontinuous packet stream 
exhibiting different packet phases (when aligned 
to a common reference clock) and suffering from 
a packet power dispersion. Such kinds of receivers 
are currently studied in different laboratories. We 
can cite NTT, Nichiden (NEC), Lucent, Alcatel, etc. 

Currently, this technology has been demon- 
strated to be feasible for use in different coding for- 
mats: Manchester but also return to zero (RZ) or 
non return to zero (NRZ). 

Figure 3.10a and b shows the performance 
obtained with a packet mode receiver operating at 
10 Gbit s*. 


3.4.3 Performance expected for the 
packet approach 


The performance will depend on the packet format 
adopted. 


e In the case of variable packets, the contention 
resolution becomes a bidimensional problem: 
to be able to insert a packet in the ring, we have 
to check if we have a void and if the void is 
large enough to insert the packet. 

e In the case of fixed packets, we need to also 
experience two dimensions, but in that case, they 
are not cojoined. The first problem to solve is the 
packet filling ratio, imposing some time out to 
have a good efficiency, whereas the second prob- 
lem is in the add part: check if there is a free slot. 
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Figure 3.10 (a) Characteristics of a packet mode receiver. Large power dynamic ranges and fully trans- 
parent to the packet phase fluctuation. (b) Eye diagram recorded: before and after the 10 Gbit s“ 
packet mode receiver. The phase is preserved and the amplitude completely equalized. 


Therefore, the problem is not the same for both 
cases, and the impact on the performance is not 
the same. 


@ In the case of variable packets, the main 
problem to solve will be the management of 
the voids in the ring so that the last ring access 
node in cascade is not blocked. 

e Inthe case of fixed packets, the main problem 
to solve is the choice of the good packet size 
to be compatible with the incoming traffic 
profile. 


Therefore, concerning the performance, the vari- 
able packet format exhibits a poorer performance 
compared to the fixed packet. This is the reason 
why the concatenated approach is probably the 
optimum solution, providing performance and 
reliability (packet rhythm present in the ring to 
ease monitoring aspects). 


3.5 OPPORTUNITIES FOR THE 
BACKBONE 


3.5.1 Introduction scenario 


In this part, the objective is to draw a progressive 
introduction of optical switching techniques for 
the backbone. For the short term, a cross-connect 
interconnected to a router could be envisaged 
to reach high-throughputs and high throughput 
routers. Both approaches are of prime importance 
because they correspond to two reality cases and 
two classes of products required to cover different 
network specificity. As a medium-term approach, 
we will present a multiservice Opto/Electro/Opto 


(O/E/O) network concept based on new features. 
Finally, we will describe how an all-optical packet 
switching network could become a reality in the 
longer term because of attractive features. 

Short-term introduction: smart router (router + 
cross-connect) versus multi-terabit class routers/ 
switches. 


3.5.1.1 WHERE SMART ROUTERS AND 
WHERE HIGH-THROUGHPUT 
ROUTERS? 


Smart routers are required for the dorsal net- 
work and where the aggregation is forced by the 
poor connectivity of the nodes and by the huge 
amount of traffic that must be transported. This 
type of product is particularly interesting when 
the traffic matrix is stable enough to make semi- 
permanent connections realistic and efficient. 
This is particularly the case in the United States 
when connections have to be established between 
states. The router is mandatory to collect the traf- 
fic coming from regional networks or national 
networks. 

High-throughput routers are required when 
it is not possible to process a part of the traffic 
with semi-permanent connections through opti- 
cal cross-connects. It can be the case for metro 
core networks, where the dynamicity of the traffic 
could require a transfer mode at the packet level 
to increase the network efficiency while optimizing 
the resource cost. 


3.5.1.1.1 Smart router 


Figure 3.11 shows the global structure of a smart 
router. The router collects the traffic at the packet 
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Figure 3.11 Smart router schematic. 


granularity. Packets are put into queues and are 
sent on a specific wavelength. The optical cross- 
connect has to manage high throughputs. The 
structure can be based on a MEM technology. 
The approach is very interesting when the traffic 
is strong enough to open large pipes, thus enabling 
the establishment of a waveband. The cross- 
connection at a waveband level is the best guar- 
antee of simplicity and reliability without creating 
breakthrough between the transmission system 
and the switching cross-connect. 


3.5.1.1.2 High-throughput routers 


Figure 3.12a shows the generic structure of a high- 
throughput router (multi-terabit-class router) 
exploiting an optical core in its center part, whereas 
Figure 3.12b and c shows a prototype realized and 
the bit error ratio (BER) curve. The optical matrix 
is basically a fast space switch, creating connec- 
tions at the packet level. The burst card is respon- 
sible for the packet format adaptation, whereas the 
line card is used to manage the incoming traffic. 
Buffers are at the input and the output of the opti- 
cal matrix, and in the burst and line cards. An 
internal speedup can be exploited to guarantee the 
full functionality even in the case of failure of one 
of the switching planes. This approach is currently 
adopted by many constructors. 

Both approaches are really important and dem- 
onstrate the potential of optical switching for basic 
functions, mainly focused on space switching: 
slow or fast. 


3.5.1.2 MEDIUM-TERM APPROACH: 
NETWORK CONCEPT BASED ON 
O/E/O NODES 


In the previous cases, solutions are based on a 
traffic profile assumption, which enables the cir- 
cuit switching technique for long-haul networks 
or enables the packet technique for small-scale 
backbones. However, the problem will occur when 
the application bit rate is increased together with 
the variance. This can rapidly happen if merging 
low bandwidth connections coming from mobile 
phones and high bandwidth coming from more 
and more powerful PCs reinforced by an optical 
connection giving access to very high bit rates per 
user. In this particular case, the variance can be 
increased leading to huge problems for efficient 
aggregation and forcing the telecommunications 
companies to think differently toward an optical 
packet platform for the backbone. 

This means that, due to the traffic profile 
evolution, the packet will be used extensively. 
And the key question could be as follows: how to 
realize an efficient network capable of handling 
the required capacity while providing a manda- 
tory flexibility? 


3.5.1.2.1 The generic approach 


In the network concept, we mainly exploit the 
edges to prepare the traffic in such a way that the 
traffic constraints inside the core of the network 
are relaxed. This means that all the complex func- 
tions are located in the edges such as aggregation, 
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Figure 3.12 (a) High-throughput router schematic. A fast optical switching matrix could be adopted 
in the center of the architecture. (b) Photo of a 640 Gbit s~' throughput optical matrix. (c) BER 


performance. 


switching per destination, classification, packeti- 
zation, traffic shaping, load balancing, and admis- 
sion control. The traffic profile, having a better 
shape, is then sent to the network. The core nodes 
will be responsible for the synchronization, the 
contention resolution, and the switching. In this 
case, the packet being created in the edges only 
simplifies the structure of the core router. 


3.5.1.2.2 What type of packet format? 


As a first introduction, and if possible, it can be 
envisaged to introduce an existing packet format. 
The G709 framing is currently being investigated 


to identify the potential of the concept, but other 
packet formats could be considered. 

For a second introduction, it seems clear that 
a more smaller packet size is required to relax the 
problems of aggregation. This also imposes a stan- 
dard on a packet that does not exist today. 

It must be noted that some universities and labo- 
ratories are currently studying the possibility of 
managing variable packets called bursts. The advan- 
tage of this approach is that the edge part is simpli- 
fied in its functionality, but the core nodes are more 
complex to control, and the overall performance is 
affected by a highly sporadic traffic profile. 
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Figure 3.13 Schematic of a core router in the concept of a packet network. 


3.5.1.2.3 Structure of the core node 


The core node is strongly simplified with respect 
to the first approach because all the complex func- 
tions are located in the edge nodes. 

Figure 3.13 describes a representative structure 
of such a core router. 

It can be noticed that the particularity of this 
architecture is to have synchronization stages and 
memory stages before and after the optical matrix 
in the core of the high-throughput routers. This 
optical matrix has been introduced in the first 
introduction scenario, so the step is quite easy to 
cross. The challenge is greater at the management 
level than at the node level. 


3.5.1.3 LONG-TERM SOLUTION: ALL- 
OPTICAL NETWORK 


In the previous scenario, we still needed a lot of 
costly O/E conversion. One key question is as fol- 
lows: can we efficiently reduce the number of O/E 
conversion stages in the core routers? 

For this, we need to solve three key problems: 


e The synchronization (to realign the packets 
before the switching) 

e The regeneration (to enable the cascade of 
several optical core nodes). 

e@ The contention resolution (to be able to offer 
the required packet loss rate and delay with 
respect to the Class of Services requirements) 


3.5.1.3.1 Synchronization stage 


In electronics, we have bit memories making the 
synchronization process very simple. The informa- 
tion is stored at the distant clock rhythm, and it 
is extracted at the local clock rhythm. However, 
the structure is quite complex because the process 
is done at low bit rate, thus imposing one stage of 
WDM demultiplexing and one stage of bit rate 
demultiplexing. 

As in optics, we do not have bit memories, 
and therefore, it is important to think differently. 
By imposing in the packet format a sufficiently 
large guard band, we simply need to preserve the 
phase between consecutive packets in order not to 
have collisions. This means that we need simple 
synchronization structures capable of having a 
resolution so as to avoid the problem mentioned. 
Typically, the resolution that can be handled is in 
the range of few nanoseconds. It is exactly what we 
will adopt for the synchronization. 


3.5.1.3.2 What kind of problem we need 
to face? 


The first problem is the thermal effect in the fiber 
modifying the index and creating variable delays 
during the propagation of data depending on 
the average temperature of the fiber. This means 
that all the WDM multiplexes will be affected. 
A WDM structure could bring a solution to this 
problem. 
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The second problem is that we do not have con- 
trol of the time jitter created in any optical switch- 
ing fabric. This packet jitter can be a blocking point 
in the cascade of several nodes. We need to control 
the packet jitter packet per packet, which indicates 
that the control must be done at the wavelength 
level but not at the WDM level. 

Therefore, in summary, we can easily solve 
the problem of the synchronization by using one 
or two stages and combining a processing at the 
WDM level and a processing at the wavelength 
level. In both cases, we operate at the line bit rate. 
The gain is in the simplicity of the synchronization 
process and in the complexity of the structure, 
making this synchronizer reliable. 


3.5.1.3.3 Regeneration stage 


The regeneration stage is mandatory if we want to 
exploit the maximum throughput of the optical 
switching matrix. The regeneration separates the 
two systems: the transmission and the switching 
in order to lead to a maximum throughput for the 
nodes. It is also the only way to cascade nodes when 
the line bit rate is high. Once again, as the process- 
ing is done at the line bit rate level, the structure of 
the optical regenerator is really simple, being a guar- 
antee of simplicity and robustness. 


3.5.1.3.4 Contention resolution 


The contention resolution is still an issue in optical 
architectures because we do not have any efficient 
optical memory. To solve the problem, we will 
exploit the WDM dimension and more particularly 
the statistical multiplexing over the different avail- 
able wavelengths. Therefore, the technique adopted 
is to avoid collision by reaffecting the wavelength 
to the packet at the output of the switching fab- 
ric. As the number of wavelengths per fiber can be 
limited, a recirculation buffer is then mandatory 
to solve the contention properly. By combining 
both techniques, the performance can easily reach 
the performance of a classical electronic switch 
but offering here all the switching capacity in one 
unique stage. 


3.5.1.3.5 Optical matrix adapted to 
optical interfaces 


If photodiodes have very low sensitivity, it is not the 
case when introducing all-optical interfaces. 


Therefore, the optical switching fabric must be 
adapted to these optical interfaces by providing the 
required power. Once again, different techniques 
can be proposed to achieve this goal. 


3.5.1.3.6 Generic structure of optical 
core nodes exploiting optical 
techniques 


A generic structure for an all-optical packet core 
is described in Figure 3.14. The particularity of 
this architecture is that there is no O/E conver- 
sion except for the electronic control and the 
memory, making this architecture cost-effective. 
Based on the previous concept, this architecture 
is simply an evolution of the core node exploit- 
ing optical functions for a better efficiency, 
and a potential line bit rate increases at a lower 
cost. This approach is fully compatible with 
future point-to-point transmission systems at 
40 Gbit s71. 


3.5.2 Required technology available 
technology 


For the short- and medium-term approaches, we 
need the following: 


e An optical technology for space switching 
For optical cross-connects, the MEM 
technology is probably the most promising 
technology. 

For fast optical matrix, we need the 

following: 

- For the optical matrix itself: free space- 
like technologies (Chiaro-like), tunable 
source-based technologies (Lucent), 
and SOA-based technologies (NEC, 
Alcatel, etc.). 

— For the receivers (still in the labora- 
tory): packet mode receivers capable of 
being fully transparent to the packet 
phase and capable of absorbing packet 
power variations arriving at the packet 
rhythm. Several companies have pro- 
posed such a kind of receiver: Lucent, 
NTT, NEC, Alcatel, etc. 

- Burst cards (still in the laboratory): fully 
electronic adaptation interfaces for the 
packet format used inside the fabric. 
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Figure 3.14 Structure of an optical core router exploiting the optical resources but including an 
electronic memory stage in recirculation to guarantee the performance. 


3.5.2.1 ADVANCED TECHNOLOGY 


To realize compact systems, there is first a need for 
an integrated technology. 

To realize the key sub-blocks presented previ- 
ously, we need the following: 


e For the fast optical switching matrix: compact 
tunable lasers, SOA gate arrays, and integrated 
wavelength selectors 

e For the optical regenerators: integrated Mach- 
Zehnder, self-pulsating lasers, etc. 

e For optical synchronization: optical gates 


3.5.3 Technological feasibility 
viability of advanced sub- 
blocks and feasibility issues 


3.5.3.1 OPTICAL MATRIX 


In the case of opto-electronic interfaces, the fast 
optical switching matrix (introduction scenario 
for the short and medium terms), the constraints 
are quite relaxed because the sensitivity of the 
receivers allows the design of switches with small 
output powers. Typically, reception powers as 
low as —10 dBm can be considered at the output 
of the switching fabrics. This also means that the 
amplification is limited in the core of the switch, 


leading to very compact and less power-consum- 
ing architectures. 

One typical matrix is the SOA-based matrix, 
requiring simply an amplification stage before the 
splitting stage is the broadcast-and-select archi- 
tecture. Another one is using tunable lasers and 
a wavelength router in the center. Both are repre- 
sented in Figure 3.15. 

The first architecture (Figure 3.15a) takes advan- 
tage of broadcasting functionalities and exploits 
robust devices such as ILMS or SOAs. However, 
it is limited in capacity, mainly due to large losses 
that have to be compensated by amplification. 
The optical signal to noise ratio (OSNR) affected 
mainly limits the capacity. 

The second architecture (Figure 3.15b) has a pri- 
ori a larger potential in terms of capacity because 
the architecture simply includes a tunable source 
and a passive wavelength router. However, this 
architecture is not adapted to the broadcast of the 
packets, and the fast tunable laser is probably the 
most challenging switching element. 

In the perspective of the long-term scenario, 
with optical interfaces, the constraint comes from 
the output power that must be high enough to be 
compatible with optical interfaces. In addition, the 
polarization is responsible for problems in the opti- 
cal regenerative structures, it is then fundamental 
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Figure 3.15 (a) Optical matrix-based on SOAs. (b) Optical matrix-based on tunable lasers. 
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Figure 3.17 A 32 SOA gate-array module. 


3.5.3.2 SOA TECHNOLOGY FEASIBILITY 


The SOAs have been used in different system appli- 
cations for amplification but also for wavelength 
conversions or for optical gating. To realize large 
systems as described previously, there is a need 
for a large amount of components. In this case, 
the integration is then mandatory to make such 
a matrix very compact. OPTO+ has designed and 
realized 32 SOA gate array modules. The module 
shown in Figure 3.17 includes 32 SOAs and their 
respective drivers. It has been used to realize a 640 
Gbit s! switching matrix. 


3.5.3.3 TUNABLE SOURCE FEASIBILITY 


The tunable source is a key component for many 
system applications. 

In the case of slow switching, we can identify 
the tunable wavelength conversion to provide the 
required flexibility to achieve a best utilization 
of the wavelengths in a network. Another evi- 
dent application is the replacement of ILMs with 
tunable sources. The advantage is mainly in the 
spare cost: instead of duplicating all the sources, 
the objective is to have only one source capable of 
emitting at any wavelength of the comb exploited 
in the WDM system. Finally, another application 
is the monitoring of optical switching systems. 
In this particular case, we need a compact struc- 
ture capable of testing the different wavelengths 
and paths of a switching system. To be compatible 
with the system constraints, the requirements are 
switching times in the range of milliseconds or 
more (for monitoring or for sources), large tunabil- 
ity, high output power, and good extinction ratio. 

In the case of fast switching, the main appli- 
cations are for the metro and the backbone. 


The tunability is fundamental in providing the 
required flexibility to exploit the WDM dimension 
in optical packet switching network concepts. The 
requirements are fast switching time in the range 
of a few nanoseconds, small tunability (four or 
eight channels), high output power, good extinc- 
tion ratio, and high ON/OFF to guarantee no 
impact of the cross talk on the signal quality. 

For slow structures, a DBR laser has been tested 
by different laboratories, and feasibility is not an 
issue. 

For fast structures, the main problem is the sta- 
bility of the wavelength. DBR can be considered 
if the tunability is small. These components have 
been demonstrated to be feasible, with switching 
times in the range of a few tens of nanoseconds. 
Another alternative is the selective source. Based 
on the cascade of a laser array, a SOA gate array, 
a phasar, and an integrated modulator, this struc- 
ture has been demonstrated to be feasible. 


3.5.3.4 OPTICAL SYNCHRONIZATION 


The optical synchronization is probably the most 
challenging function. The objective is to process 
the signal, if possible, in the WDM regime or at 
the wavelength level. The second important point 
is probably the lack of digital memory that forces 
designers to think differently. In that context, the 
synchronization cannot be done at the bit level. We 
assume that the synchronization can be efficient in 
a resolution of a few nanoseconds. When we have 
said that the other point is to identify the source of 
desynchronization with respect to a reference clock. 

The first source of loss of synchronization is 
the thermal effect in the transmission fiber. With 
a value between 40 and 200 ps km“, depending on 
the mechanical protection scheme adopted for the 
fiber, thermal effects can dramatically affect the 
phase of the packet streams. The WDM dimension 
can be advantageously exploited to make the syn- 
chronization stage compact and low cost. 

The second source is the loss of synchronization 
in switching fabrics due to a nonideal path equal- 
ization. This occurs at the packet level, imposing a 
synchronization at the wavelength level. 

The structure adopted is shown in Figure 3.18. 


3.5.3.5 OPTICAL REGENERATION 


The optical regeneration is one of the fundamental 
functions to make the approach realistic. To build 
all-optical networks while having optical switches 
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Figure 3.18 Optical synchronizer as proposed in the frame of the 1ST DAVID project. 


capable of handling terabits/second throughputs, 
the optical regeneration is then mandatory at the 
periphery of switching architectures. The main 
functions are the total reshaping of the pulses 
in the amplitude and in the time domains. To 
achieve this reshaping, several techniques can be 
adopted. We will retain one, particularly adapted 
to the characteristic of switching fabrics creating 
strong impairments between pulses or between the 
groups of pulses. The technique adopted is a total 
reshuffle of the pulses adopting nonlinear elements 
such as Mach-Zehnder or Michelson structures. 
The main distortions identified are the follow- 
ing: bits affected at the periphery of packets due 
to the switching regime, nonlinear effects such 
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as cross-gain modulation and four-wave mixing, 
cross talk (in-band and out-of-band), patterning 
effects by crossing active devices, noise accumula- 
tion, and jitter accumulation. 

To overcome these effects, a structure has been 
proposed in the frame of the DAVID project. This 
structure, presented in Figure 3.19, has the follow- 
ing characteristics: by using a cascade of two non- 
linear elements, the convoluted function creates a 
much more nonlinear transfer function, thus lim- 
iting the noise transferred in the first cascade. This 
has an important impact on the OSNR specifica- 
tion, which can be close to the back-to-back value 
(before the first stage) even in the case of a large 
number of cascades. 


Figure 3.19 Structure of the Re-amplification—Reshaping—Retiming (3R) regenerator as it is studied in 


the frame of the DAVID project. 
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To really reshuffle the pulses, different tech- 
niques will be adopted. We can retain an amplitude 
and a phase modulation creating an amplitude 
modulation in interferometric structures to really 
enhance the extinction ratio and remove the noise. 
The second technique adopted is a sampling tech- 
nique of each pulse with a clock to remove the jit- 
ter. The wavelength conversion technique will then 
be preferred to reallocate the wavelength in the 
correct wavelength comb of the new system. 


3.5.3.6 FEASIBILITY OF NETWORK 
CONCEPTS 


The feasibility of the approach was demonstrated 
for the first time in 1998, at the end of the ACTS 
KEOPS project. 

In this project, we have cascaded 40 network 
sessions error free at 10 Gbit s"! per wavelength 
demonstrating for the first time the possibility of 
building an all-optical network at a backbone scale. 


Figure 3.20 gives the network session tested and 
put in a loop to demonstrate the concept. 


3.5.4 Performance expected 


The performance is probably one of the most 
important indicators for the feasibility of such con- 
cepts. When the physical aspects are verified, the 
challenge becomes the performance in a real traffic 
environment. 


3.5.4.1 ENVIRONMENT AND SPECIFICITY OF 
THE BACKBONE 


If, in the metro, the capacity is limited, in the back- 
bone this is the major characteristic. To provide 
the capacity with a technology limited today to 10 
Gbit s“!, the only solution is in the exploitation of 
the WDM dimension. 

Therefore, the WDM dimension will be fully 
exploited to provide the required capacity but also 
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Figure 3.20 (a) Network session put in a loop to test the feasibility of an all-optical network. (b) Photo 
of the demonstrator. (c) BER curves giving the physical performance. 
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to avoid collisions due to the natural statistical 
multiplexing of packets on the wavelengths. 

The second particularity is the aggregation. 
Depending on the traffic profile, circuit switching 
or packet switching will be preferred. 


3.5.4.2 CIRCUIT SWITCHING TECHNIQUES 
FOR AN IMMEDIATE INTRODUCTION 


Circuit switching techniques can be envisaged in 
the first scenario as a transport layer to provide the 
capacity of transport. 

To be compatible with a DATA traffic, the cou- 
pling of a cross-connect with a packet router is, 
even today, the more pragmatic approach. This is a 
subject of strong interest for products that are used 
at present. 

However, this solution is not really cost-effec- 
tive because only two alternatives can be adopted: 


e All the wavelengths are connected to the packet 
router, and in this case, the number of TX/RX 
dramatically increases the cost per port. 

e Only a part of the wavelength is connected to 
the packet router, and in this case, the traffic 
matrix needs to be very stable. The determin- 
istic approach for the number of connections 
becomes nonrealistic when the traffic profile 
becomes highly statistical. 


3.5.4.3 OPTICAL PACKET SWITCHING 
TECHNIQUES TO COPE WITH A 
TRAFFIC PROFILE EVOLUTION 


In this case, what could be the benefit of the con- 
cept proposed? 


e The packetization at the edge level with an opti- 
mized size can reduce the latency in the creation 
of the packets. A second technique to acceler- 
ate the filling ratio is in the upgrade of the best 
effort in a premium class of service. Therefore, 
the advantage of this solution is that latency can 
be controlled to reduce the latency in the rest of 
the network. Calculations show that there is a 
global benefit in terms of end-to-end latency. 

e The exploitation of the WDM dimension once 
again reduces the latency. The packets cross 
the architecture, and they see only a transmis- 
sion path, even if it is switched. No buffers are 
crossed so the resultant latency is minimum. 


Figure 3.21 shows a table summarizing the per- 
formance in terms of packet loss rate established 


Total CoS 1 CoS2. BEnorecir. BE recirc 
load PLR PLR PLR 16\, PLR 
0.5 <10e-6 <10e-6 ~10e-04 <10e—06 
0.7 <10e-6 <10e-—6 0.012 <10e—06 
0.8 <10e-6 <10e—6 0.03 <10e—04 
0.9 <10e-6 ~10e-5 0.065 0.007 


Figure 3.21 Performance of the all-optical packet 
switching network concept. 


in the frame of the Reseau Optique Multiservice 
(ROM) project. It appears that on the three class of 
services considered, the end-to-end performance 
can be obtained. From dimensioning issues, it 
appears that for the WAN, a sum of 30% for Class 
of Service 1 (CoS 1) and CoS 2, and a best effort 
(BE) lower than 80% is tolerated. 

This demonstrates the viability of an all-optical 
concept and as a consequence the viability of the 
opto-electronic scenario. 


3.6 INTRODUCTION ON THE 
MARKET: CRITERIA 


3.6.1 Criteria of selection for a new 
technology 


To select a technology to task is not easy, but we 
can draw some conclusions: 


e Bit rate evolution at the user part creating a 
convergence of the bit rate in all the layers of 
the network and forcing a transfer of the traffic 
profile even in the backbone. This will create 
a need for high-flexible networks to cope with 
a traffic profile and not with a traffic matrix. 
Packet technique is today the only pragmatic 
solution with a coexistence of circuit switching 
techniques. 

e The key bit rate is 10 Gbit s. All the com- 
panies are focusing on 10 Gbit s™ that will 
develop a volume to make this technology 
compact and cost-effective. This also reinforces 
the packet technique because the bit rate is now 
totally independent of the physical bit rate; the 
granularity is offered by the packet size and not 
by the bit rate of the wavelength. 

e Ifa circuit switching technique is adopted 
today, it must be compatible with a migration 
toward packet switching. 
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3.6.2 Cost approach 


For the cost approach, everything will depend on 
the aggregation efficiency. In the following, we have 
computed the relative cost of different approaches, 
comparing mainly packet switching and circuit 
switching. 


3.6.2.1 METRO PART 


If the average load of a wavelength is high enough, 
due to an efficient aggregation process, then circuit 
switching is probably viable. However, if the load 
is low, below 20%, even if the cost of switches are 
more expensive, the gain in statistical multiplexing 
creates a real opportunity for packet techniques 
making them less expensive than circuit switching 
techniques. 

The main reason for this gain is probably the 
high cost of the wavelength due to expensive infra- 
structure costs, pushing all telecommunications 
companies to prefer an increase of the bit rate rather 
than an exploitation of the WDM dimension. 

Therefore, the tendency is probably packet tech- 
niques to decorrelate the bit rate from the granu- 
larity of switching and high-bit rates to adopt 
the most cheap technology while providing the 
required capacity. 

Figure 3.22 shows the areas where optical packet 
switching is better than circuit switching. 

From the figure, the numbers 2, 4, 6, and 8 indi- 
cate the ratio in terms of cost per port (wavelength) 
between an optical packet switch and a cross-con- 
nect targeting the same size (256 x 256) and the 
same technology. 

The load of a wavelength is the average load. 
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Figure 3.22 Need to exploit packet techniques in 
the near future. 


The ratio on the horizontal axis is a ratio between 
the wavelength transmission cost (including the 
installation costs) and the cross-connect port. 

For example, if the ratio between the cost of a 
wavelength in the transmission system and the cost 
of a cross-connect port is equal to 1 (red bar), opti- 
cal packet switching techniques are interesting: 


e 2: always, whatever the load is 

e 4: ifthe average load of a wavelength in circuit 
switching is lower than 33% 

6: if the average load of a wavelength in circuit 
switching is lower than 20% 

8: if the average load of a wavelength in circuit 
switching is lower than 13% 


Therefore, the tendency is the following: If the cost 
of a wavelength in a transmission system is high 
(the case of the metro where the installation cost is 
not negligible), or if the aggregation is not efficient 
enough forcing an average load very low (this is a 
serious tendency with the increase of the applica- 
tion bit rate and the sporadicity of the traffic pro- 
file), packet switching techniques always exhibit a 
better performance than circuit switching. 


3.6.2.2 BACKBONE PART 


As an example, we have computed, for two levels 
of aggregation, the load of a network with respect 
to the distance of the network (for the access to 
the backbone). It appears that in major cases, if 
the aggregation process is not enough, even in the 
backbone, the packet switching technique is the 
cost-effective solution. 

Figure 3.23 shows the importance of packet 
switching techniques, also for the backbone. 
This is one of several curves that could be drawn. 
However, it once again shows a tendency. 

The grooming or the aggregation efficiency 
depends on the traffic profile in large part. 
Therefore, we plot two indicative curves: 


@ One curve exhibits an efficient aggregation 
(a realistic case is when the constant bit rate 
(CBR) is higher than VBR or when the number 
of connection points is high to facilitate the 
grooming process). 

e The other curve exhibits a less efficient aggre- 
gation (a realistic case is when the variable bit 
rate (VBR) becomes dominant). In that case, 
we cannot have a stable traffic matrix, and we 
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Figure 3.23 Curve showing strong interest to 
introduce packet switching techniques even in 
the backbone. 


are addressing a sporadic traffic profile (a real- 
istic case if we have a bit rate convergence from 
the access to the backbone). 


The vertical axis indicates that the required average 
load of a wavelength is cost-effective. The horizon- 
tal axis indicates the average distance of a trans- 
mission system with respect to an average network 
session representative of the network considered. 
Therefore, the WAN starts for transmissions higher 
than 100 km. The calculations show the impor- 
tance of the time multiplexing. If the distance is 
long, there are a large number of clients sharing 
the same network infrastructure. Therefore, the 
cost per client is reduced. In addition, the cost of 
the installation of a wavelength is considered lower 
than that of the metro. The reason is that in the 
WAN, natural infrastructures are exploited to 
reduce the installation costs (such as highways or 
railways). If we are under the curve in bold, there 
is an advantage of introducing packet switching 
techniques. The grooming tendency gives values 
for the required load in circuit switching. 

For example, in the case of a good grooming effi- 
ciency, if the average distance of propagation of a rep- 
resentative network session is lower than 300 km, you 
will have a cost gain by exploiting packet techniques. 
In the case of a low grooming efficiency, packet 
switching techniques are always more efficient. 


3.7 CONCLUSIONS 


In this chapter, we have suggested the optical 
switching technique as a potential technique 
for the next generation of systems or networks. 


However, more importantly, an evolution scenario 
is given for the metro part and the backbone part 
describing what could be the most promising solu- 
tions. Optical packet switching techniques appear 
very attractive because they really offer a solution 
compliant with the traffic constraints. 

Circuit switching techniques will be introduced 
as a first step, but we must not forget optical packet 
switching techniques that will improve the band- 
width utilization. 

We have seen that there is no problem building 
any of the network concepts proposed, because all 
the functions have already been demonstrated to 
be feasible. The solution is now in the availability 
of the technology and in the cost. The progress on 
this integrated/low-cost optical technology will 
be fundamental for the future systems and could 
really provide new advantages with respect to clas- 
sical solutions exploiting electronics only. 

Today, we can imagine two scenarios: 

The first one will consist of the introduction ofa 
circuit switching platform to give a concrete answer 
to an immediate need at a lower cost. Circuit 
switching is probably the best today. However, we 
cannot forget the evolution of the traffic profile to 
increase the bit rate at the access part. Therefore, 
the migration scenario is an important argument 
to propose solutions that can be rapidly adapted to 
packet switching techniques with the best flexibil- 
ity and upgradability. 

The second scenario is in the adoption of packet 
switching techniques such as RPR for the metro or 
routers for the backbone. And we then need to think 
about competitive solutions with serious added val- 
ues to justify the introduction of optical techniques 
in the network. Optical packet switching is probably 
one technique that can emerge. In the metro part, 
the benefit is mainly in the exploitation of the WDM 
dimension and in the very simple in-line processing 
(without any buffer) to reduce the latency and the 
number of Transceiver (TRX). In the backbone part, 
the benefit is probably in the adoption of large pack- 
ets assimilated to containers in order to be able to 
exploit techniques to reshape and manage the traffic 
profile in the edge nodes and WDM techniques to 
reduce mainly the latency without constraining the 
capacity expansion in the core nodes. 

However, to build subsystems, there is also a 
need for an advanced technology. Without any 
advanced technology such as tunable sources or 
tunable filters, there will be no chance to provide 
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the functionality required to be really competitive 
on other aspects. Therefore, the development of 
this new technology (components and systems) is 
then fundamental and will position a constructor 
of equipment as a leader in the future market. 
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4.1 THE CAMERA TUBE AND 
CAMERA 


41.1 Introduction 


The history of photoconductive camera tubes using 
the internal photoelectric effect began in 1950 with 
the Vidicon camera tube [1]. A photoconductive 
camera tube performs both photoelectric con- 
version and signal storage on a photoconductive 
target, which is a vapor-deposited film of Sb,S, in 
Vidicon tubes. Although the Vidicon tube boasts a 
simple, small, and lightweight structure compared 
with the Image Orthicon tube that used the exter- 
nal photoelectric effect, it also suffers from several 
weak points such as large lag and dark current. 
The Vidicon, as a result, has not found much use in 
broadcasting-type television cameras that require 
high levels of picture quality. These weak points 
stem from the fact that excited carriers are easily 
trapped in Sb,S,-deposited film and that charge is 
injected into the target from external electrodes 
(injection-type target). However, the Plumbicon 
camera tube announced in 1963 features a target 
formed by a PbO film with a p-i-n structure that 
blocks the injection of charge from external elec- 
trodes (blocking-type target) [2]. With this type 
of target, the Plumbicon became the first photo- 
conductive camera tube to feature low lag and low 
dark current among other superior features. In the 
1970s, the Plumbicon rode the wave of change to 
color television broadcast facilities and became the 
leading tube for broadcast-class color television 
cameras replacing the Image Orthicon tube that 
used the external photoelectric effect. The research 
and development of photoconductive camera tubes 
was quite active. The 1970s, for example, saw the 
back-to-back development and commercialization 
of various blocking-type photoconductive cam- 
era tubes, including the Chalnicon using CdSe, 
CdSeO,, and As,S, as targets [3] and the Saticon 
using Se-As-Te [4,5]. 

In the 1990s, the solid-state imaging device such 
as the charge-coupled device (CCD) became the 
mainstream image sensor even for cameras used 
in the field of HDTV broadcasting. There is still a 
demand, though, for the camera tube, which was 
once the predominant type of image sensor in the 
form of the Plumbicon, Saticon, etc., for purposes 
of camera maintenance. As a consequence, mod- 
ern camera tubes normally employ past technology. 


An exception, however, is the high-gain avalanche 
rushing amorphous photoconductor (HARP) 
camera tube developed by the Japan Broadcasting 
Corporation (NHK) and Hitachi, Ltd, whose novel 
technology has become the focus of attention [6]. 
This camera tube achieves a level of sensitivity higher 
than that of CCDs and conventional pickup tubes 
by converting light to an electric signal in a photo- 
conductive target and simultaneously amplifying 
that signal by an avalanche multiplication effect. It 
features higher quality pictures than past ultrahigh- 
sensitivity image sensors using image intensifiers. 
The HARP camera tube can be used in high-defini- 
tion (HD) cameras and for a wide range of applica- 
tions, including scientific and medical fields. 

Section 4.1.2 describes the mechanism of the 
photoconductive camera tube covering past types 
and the ultrahigh-sensitivity HARP tube based on 
a new operating principle. 


4.1.2 Basic configuration and 
operating principle of 
camera tubes 


Camera tubes come in two main types: an image 
camera tube that uses an external photoelectric 
(photoemission) effect and a photoconductive 
camera tube that uses the photoconducting effect, 
a type of internal photoelectric effect. For the 
image camera tube, a typical example is the Image 
Orthicon developed during the monochrome 
television age. For photoconductive pickup tubes, 
there is the Vidicon, which can be called the 
original tube of this type, and tubes such as the 
Plumbicon and Saticon that played a great role 
in improving the performance of broadcast color 
cameras and achieving handheld video cameras. 
The HARP camera tube to be described here is also 
a photoconductive pickup tube. 

The basic configuration of the photoconduc- 
tive camera tube is shown in Figure 4.1. The tube 
consists of a photoconductive target that performs 
photoelectric conversion and charge storage, anda 
scanning electron beam system for reading stored 
charge. The operation of this tube is described later. 

Referring to Figure 4.2, light incident on the 
target generates electron-hole pairs in the film 
(using, for example, a blocking-type target as 
described later). Here, for an ordinary pickup tube, 
the scanning electron beam irradiates the target at 


4.1 The camera tube and camera 99 


Face plate 
Signal electrode 
Photoconductive film 
| | Beam-landing layer 
y 


a 


Photoconductive target 


Mesh electrode (G4) 


Electron gun 


Heater 
Cathode 


Beam limiting aperture 


Scanning electron beam system 


Figure 4.1 Basic structure of photoconductive camera tube. 
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low velocity, and voltage is applied in such a way 
that the signal-electrode side takes on a positive 
potential with respect to the scanning surface. 
This causes electrons to move toward the transpar- 
ent signal electrode and holes to move toward the 
target’s scanning surface. The material used for the 
target’s photoconductive film, however, generally 
has a high value of resistance with resistivity at 10” 
Q cm, and these charges accumulate at both ends 
of the target as a result. Meanwhile, the scanning 
electron beam system works to focus and deflect 
the electron beam emitted from the electron gun 
so as to make it incident on the target at low veloc- 
ity (low-velocity beam landing). This causes stored 
holes to recombine and disappear and an equiva- 
lent amount of signal current to flow from the 
transparent signal electrode. 

We can make this operation even easier to 
understand by focusing on a single pixel and using 
the equivalent circuit shown in Figure 4.3 consist- 
ing of resistors, a capacitor, and other elements. 
In the figure, the symbols r and C correspond to 
the resistor and capacitor making up the pixel. 
Here, the value of resistor r changes according to 
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Figure 4.3 Equivalent circuit for one pixel in 
camera tube. 


the intensity of the incident light. In addition, R, 
denotes the equivalent resistance of the scanning 
electron beam, S the scanning switch, R, the load 
resistance provided externally, and E, the power 
supply for applying voltage to the target. The arrival 
of the scanning electron beam on a certain pixel 
corresponds to the closing of switch S at which 
time C charges via R,. Conversely, departure of the 
beam from that pixel corresponds to the opening 
of switch S at which time the charge accumulated 
in C discharges via r. The discharge period is deter- 
mined by the number of frames per second and is 
1/30 (one frame’s worth) in principle in the NTSC 
system. In actual camera tubes, however, the beam 
is broad compared to the scanning line interval and 
beam scanning overlaps the odd and even fields. 
As a consequence, the discharge period becomes 
1/60s, or one field’s worth, despite interlaced scan- 
ning. Because the value of r changes according to 
the intensity of incident light, the amount of dis- 
charge is large for a bright subject and small for 
a dark one. This means that the charging current 
flowing to C from the power supply when S is closed 
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Figure 4.4 Focusing and deflection system of camera tube. 


is equivalent to the current, ie., signal current, 
modulated by the brightness of the subject. 


4.1.2.1 TYPES OF SCANNING ELECTRON 
BEAM SYSTEMS AND THEIR 
FEATURES 


The scanning electron beam system of the cam- 
era tube described here is classified in terms of 
electric-field/magnetic-field combinations used 
for focusing and deflecting the beam. There are 
the —_ electromagnetic-focusing/electromagnetic- 
deflection (MM) type, the electrostatic-focusing/ 
electromagnetic-deflection (SM) type, the 
electromagnetic-focusing/electrostatic-deflection 
(MS) type, and the electrostatic-focusing/electro- 
static-deflection (SS) type as shown in Figure 4.4, 
respectively. Of these, the MS type features high 
resolution up to the corners of the screen in prin- 
ciple. The SS type, moreover, requires no coil for 
focusing and deflection and can therefore achieve a 
compact, light, and low-power configuration. 


4.1.2.2 INJECTION AND BLOCKING TYPES 
OF TARGETS 


Targets can be divided into injection type and 
blocking type. The structure of the injection-type 
target is such that charge comes to be injected into 
the photoconductive film from both the signal- 
electrode side and electron-beam-scanning side 


or from either one of these sides. As a result, an 
amplification effect called “injection amplification” 
occurs within the target and high sensitivity with a 
quantum efficiency of 1 or greater can be obtained. 
(Quantum efficiency is defined here as the num- 
ber of output electrons per number of unit incident 
photons in the target; it is denoted as 1.) This injec- 
tion amplification effect is described here using the 
target shown in Figure 4.5 in which electrons are 
injected from the beam scanning side. Now, one 
hole created by one incident photon will come to 
be stored on the beam scanning side as shown in 
the figure. Then, when the scanning electron beam 
comes to read this hole, the hole will not immedi- 
ately recombine with an electron but will instead 
have to wait until N electrons are first injected into 
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Figure 4.5 Operating principle of electron- 
injection-amplification target. 
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the target until it can recombine with the (N+1) 
th electron. This means that N+1 electrons flow 
out of the target from the signal electrode. In 
other words, this operation provides an amplifica- 
tion effect with a gain of N+1 in which electrons 
greater than the number of incident photons can 
be read out to an external circuit. Before the inven- 
tion of the HARP target, the injection-type target 
was researched as the only photoconductive target 
that could achieve high sensitivity of 7 > 1. This 
target, however, suffers from sharp increases in lag 
and dark currents under high-sensitivity opera- 
tion and consequent drops in picture quality, and 
could not, as a result, be viewed as a new approach 
to camera tubes. 

On the other hand, a blocking-type target has a 
structure in which both the signal-electrode side 
and electron-beam-scanning side block the injec- 
tion of charge from the outside. As an example, 
Figure 4.6 shows a Saticon target whose main 
component is amorphous selenium (a-Se). Here, 
the injection of holes is blocked at the junction 
between the first layer consisting of Se+As (arse- 
nic) film and the SnO, (tin oxide) signal electrode 
and CeO, (cerium oxide) film. The injection of elec- 
trons, meanwhile, is blocked by the fifth layer con- 
sisting of Sb,S; (antimony trisulfide) on the beam 
scanning side. In addition, the second layer in the 
figure is a layer to increase sensitivity; layer 3 plays 
the role of conducting holes created in the first and 
second layers to the fourth layer; and the fourth 
layer acts to decrease storage capacitance in the 
target and reduce capacitive lag. A target such as 
this that blocks the injection of charge from exter- 
nal electrodes means that increase in dark current 
will be small even for an increase in applied volt- 
age. A sufficient electric field (1.25107 Vm"! in 
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Figure 4.6 Structure of Saticon target. 


the Saticon) can therefore be given to the photo- 
conductive film. As a result, most electron-hole 
pairs excited by incident light can be separated by 
a strong electric field to form a signal current, and 
a relatively high level of sensitivity can be obtained 
as a consequence. This kind of target also features 
little dark current and low lag. For the above rea- 
sons, targets for recently developed photoconduc- 
tive camera tubes have been of the blocking type 
for which high picture quality can be obtained. Ina 
target of this type, however, a scanning-beam elec- 
tron immediately recombines with a hole stored 
on the scanning side and subsequently disappears, 
as shown in Figure 4.2. To put it another way, the 
number of scanning-beam electrons landing on 
the target per one hole is simply one, which in turn 
means that no more electrons than the number of 
incident photons can, in principle, be read to the 
outside, ie., the limit of sensitivity is 7=1. In the 
past, this was referred to as the sensitivity barrier 
in blocking-type targets. 


4.1.3 Ultrahigh sensitivity 
photoconductive camera tube 


The more sensitive imaging devices are, the bet- 
ter they are able to produce clear pictures even 
in low lighting conditions. Consequently, achiev- 
ing increased sensitivity has always been the 
most important theme throughout the more than 
80-year history of research into imaging devices, 
and even today it is a matter of fierce competition 
between researchers. 

From the 1960s to the 1980s, NHK researched 
and developed a variety of high-sensitivity imag- 
ing devices such as secondary electron conduction 
tubes and I-CCDs, which are made by combining 
image intensifiers with CCDs. 

But because these conventional high sensitivity 
devices had problems associated with their picture 
quality, such as high noise levels and poor resolu- 
tion, demand grew in the 1980s for the develop- 
ment of imaging devices that combine high 
sensitivity with high picture quality. During this 
period, HDTV camera using camera tubes such as 
Saticon began to be used in practical applications. 
However, their sensitivity was still rather poor, and 
as reporting breaking news programs and science 
programs began to attach increasing importance 
to camera sensitivity even for standard TV broad- 
casts, it became even more important to develop a 
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TV camera with high sensitivity and high picture 
quality, capable of producing clear images even 
from poorly lit subjects. 

Consequently, from about 1980, NHK began a 
fresh study with the aim of realizing a high sen- 
sitivity and high picture quality imaging device 
suitable for use in HDTV applications. This study 
focused on using the amplification effect of an a-Se 
to obtain a high level of sensitivity. 

In 1985, it was found that when an a-Se target in 
the camera tube is operated in a strong electric field 
of about 10° Vm-!, continuous and stable avalanche 
amplification takes place, allowing high sensitiv- 
ity to be obtained with little picture degradation. 
Based on this discovery, NHK and Hitachi, Ltd, 
went on to develop a new kind of imaging device 
called HARP camera tube, which has been studied 
to this day to achieve further increases in sensitiv- 
ity and a wider range of applications. 

HARP camera tubes, which have achieved sen- 
sitivities roughly 100 times greater than CCDs, are 
used not only for standard TV broadcasts, but also 
in HDTV handheld cameras, and are used in the 
production of night-time news flashes and special 
programs such as imaging the aurora. 

The following section summarizes the research 
conducted so far into HARP camera tubes and 
describes the features of ultrahigh sensitivity cam- 
eras that use them. 


4.1.4 The development of HARP 
camera tubes 


4.1.4.1 THE INVESTIGATION OF HIGH 
SENSITIVITY AND HIGH PICTURE 
QUALITY IMAGING DEVICES 


The basic technique employed in conventional 
high-sensitivity imaging devices involves direct- 
ing the incident light toward a photocathode and 
accelerating the photoelectrons emitted from this 
surface with a large voltage inside a vacuum. For 
example, in the I-CCD mentioned earlier, these 
accelerated electrons impinge on a fluorescent 
surface where they form a bright picture that is 
imaged using a CCD. Using a photocathode has 
the advantage that a high level of sensitivity can 
be obtained quite easily, so it has also been devel- 
oped in various other types of high-sensitivity 
imaging devices, such as the silicon intensifier 
target tube [7]. 


But conventional high-sensitivity imaging devices 
that use a photocathode also suffer from drawbacks, 
such as the following: 


1. Because they use the external photoelectric 
effect for photoelectric conversion, it is difficult 
to increase their conversion efficiency to values 
close to 100%. A low conversion efficiency 
results in increased picture quality degradation 
due to shot noise. 

2. Picture quality can also be degraded by other 
forms of noise that are characteristic to the 
device, such as ion feedback noise that arises 
from residual gases inside the tube. 

3. It is difficult to achieve the high resolution 
needed for HDTV cameras with a compact 
imaging device. 


To address these problems, NHK decided to work 
on developing high sensitivity and high image 
quality imaging devices that do not rely on the use 
of a photocathode and to investigate the possibility 
of achieving substantial increases in the sensitivity 
of the photoconductive target in Saticon tubes that 
were also used in HDTV cameras. 

In the mid-1980s when this investigation got 
underway, the mainstream of imaging devices had 
started to shift from the camera tubes to solid-state 
devices (CCDs). Not only are CCDs compact, light- 
weight, easy to use, and highly reliable, but it is also 
possible to suppress the noise from their internal 
amplifier circuits to a much lower level than can 
be achieved with the external amplifiers used with 
camera tubes. CCDs, therefore, seemed to have 
greater potential than camera tubes in terms of 
sensitivity. 

Nevertheless, NHK decided to take a fresh 
look at photoconductive camera tubes because 
it was considered the targets in these camera 
tubes to have the best potential for meeting the 
conditions necessary for realizing the ultimate 
ultrahigh-sensitivity imaging device, i.e., the 
conditions for obtaining a high S/N ratio at the 
theoretical limit. 

To achieve the ultimate ultrahigh-sensitivity 
imaging device, the following three conditions had 
to be met: 


1. All of the incident photons must be guided to 
the photoelectric conversion part (100% fill 
factor). 
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2. All the photons must be converted into 
electrons in the photoelectric conversion part 
(100% photoelectric conversion efficiency). 

3. It must be possible to amplify the converted 
electronic signal without adding any noise. 


A camera tube has a fill factor of 100% and thus 
satisfies condition (1). Also, because the target uses 
the internal photoelectric effect, it is also easier to 
increase the photoelectric conversion efficiency 
than in imaging devices based on the external pho- 
toelectric effect, such as image devices using inten- 
sifiers. In other words, it can also satisfy condition 
(2). Accordingly, if a way can be found to satisfy 
condition (3), then it will be possible to obtain 
imaging devices with unparalleled sensitivity and 
picture quality. To achieve this, it is first necessary 
to bring about some form of amplification within 
the target. Based on this reasoning, NHK began to 
research targets with in-built amplification capa- 
bilities, as described later. 


4.1.4.2 TARGET TYPES AND AMPLIFICATION 
EFFECTS 


As described earlier, targets can be classified 
into two types: an injection type where electrical 
charge is injected into the film from outside and 
a blocking type where the injection of electrical 
charge is blocked. In the injection type, an ampli- 
fication effect is obtained whereby the external 
circuitry extracts a greater number of electrons 
than the number of photons incident on the tar- 
get. Although blocking types result in good picture 
quality with low lag and low dark current, it has 
not been possible to produce amplification effects 
in such targets. We, therefore, concentrated our 
studies on injection-type targets. 

However—to cut a long story short—the HARP 
camera tube target is actually a blocking type, 
not an injection type. Injection-type targets suf- 
fer from drawbacks such as a susceptibility to 
increased dark current and the amplification of 
lag by the same gain. However, the reason why we 
applied ourselves to the study of injection-type tar- 
gets is because at that time there were thought to be 
no other ways of conferring amplification proper- 
ties to the target. 

Although NHK’s research initially focused on 
injection-type targets for these reasons, in 1985, 
an unusual experiment was conducted involv- 
ing making a blocking-type target behave like an 


injection-type target by forcibly applying a large 
voltage. This led to the discovery of a phenom- 
enon whereby the sensitivity is increased within 
the photoelectric conversion film in a manner that 
could not be explained in terms of charge injec- 
tion. This was the starting point for the develop- 
ment of the HARP camera tube. The experiment is 
described below. 


4.1.4.3 EXPERIMENTAL DETAILS 


At the time, NHK was working on an injection- 
type target using a-Se, which can produce an 
amplification effect with a relatively weak electric 
field (510° Vm-!). A weak electric field reduces 
the efficiency with which photons are converted 
into electrons, and thus gives rise to problems such 
as increased shot noise. 

It was deduced that this noise could be reduced 
by subjecting the target to a suitably strong electric 
field before injecting the charge, so the target was 
initially designed for an experiment to inject elec- 
trons as shown in Figure 4.5 by forcibly applying 
a very high voltage to a blocking type target that 
has a structure in which the injection of charge is 
blocked. Figure 4.7 shows the structure of the pro- 
totype target we produced for this experiment. The 
photoconductive film in this target is a 2- m-thick 
a-Se film formed by vacuum deposition (vacuum: 
1.33X10-*Pa). The target is of the blocking type. 
Like the Saticon, it blocks the injection of holes at 
the junction formed between the a-Se film and the 
transparent signal electrode (indium-tin oxide) 
and CeO, layers and blocks the injection ofelectrons 
through the use of an Sb,S, layer. But, in contrast to 
the Saticon, it does not include high-concentration 
Te- and As-doped layers to concentrate an electric 
field near the signal-electrode interface, which 
means that even better hole-injection-blocking 
characteristics can be expected. Also, for the Sb,S, 
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Figure 4.7 Structure of prototype target. 
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layer, inert-gas (Ar) pressure at the time of deposi- 
tion was set to 31.9 Pa considering the porous-film 
fabrication conditions that would suppress the 
emission of secondary electrons even when target 
voltage is exceptionally high and promote stable 
low-velocity beam landing. The thicknesses of the 
CeO, and Sb,S, films in the target are 20 and 100 
nm, respectively, indicating that these two films 
are considerably thinner than the Se layer. Target 
thickness can, therefore, be regarded as essentially 
the same as that of the Se layer. 


4.1.4.4 CURRENT-VOLTAGE 
CHARACTERISTICS 


Figure 4.8 shows target current-voltage character- 
istics of the experimental tube with the prototype 
target. Blue light (center wavelength: 440 nm) is 
used here as incident light. From Figure 4.8, we see 
that signal current increases rapidly as target volt- 
age increases from 0V but comes to saturation, at 
least temporarily, starting at about 20 V. This satu- 
ration region is thought to correspond to the state 
where most electron-hole pairs excited in the a-Se 
film by incident light have come to separate under 
a strong electric field within the film becoming 
signal current as a result. As target voltage contin- 
ues to increase; however, we see the phenomenon 
of signal current again rising dramatically beyond 
this saturated region. 
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Figure 4.8 Signal current and dark current versus 
target voltage. 


Quantum efficiency 1 with respect to blue light in 
a-Se film is estimated to be 0.9 for an operating elec- 
tric field of 8x 10” Vm" [8]. In the experimental tube, 
this electric-field strength corresponds to a target 
voltage of 160 V and this fact enables us to establish 
a scale for 1 on the right vertical axis in Figure 4.8. 

This scale tells us that n exceeds 1 at a target volt- 
age of 180 V and reaches 10 at 240 V. Furthermore, 
at a target voltage of 260, n is 40 and extremely 
high sensitivity occurs in the experimental tube. 
As for dark current, it also becomes large in the 
high-voltage region, but it is nevertheless quite 
small at 0.2nA under target-voltage operating 
conditions of 240 V (n=10). As described earlier, 
the phenomenon of increased sensitivity with n 
exceeding 1 has been observed when operating an 
a-Se photoconductive target with a blocking-type 
structure in a very strong electric field. 


4.1.4.5 THE ORIGIN AND OPERATING 
PRINCIPLE OF THE HARP CAMERA 
TUBE 


Because 1 was greater than 1 in the prototype 
target, it was thought that an amplifying action 
occurred due to the expected blocking type 
target behaving as an injection type. However, 
we found that this target exhibited hardly any 
dependence of lag on the applied voltage, which 
one would normally expect to see in an injec- 
tion-type target. Specifically, according to the 
operating theory of injection-type targets, the 
effective storage capacitance of the film increases 
by an amount corresponding to the magnitude 
of the gain, so when the applied voltage exceeds 
180 V the lag ought to increase steeply. However, 
such phenomenon was not seen in the prototype 
target. We, therefore, performed several new 
experiments. As a result, it became clear that the 
phenomena exhibited when ny > 1 conform to the 
following properties: 


1. The effective storage capacitance of the target is 
constant and does not increase even when n is 
greater than 1. 

2. The amplification effect is dependent on the 
direction of the incident light, and compared 
with the face plate side, the degree of amplifica- 
tion is smaller when light is incident from the 
beam scanning side. That is, the target has a 
higher gain for hole transport than for electron 
transport. 
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3. When the electric field inside the target has 
a constant intensity, the amplification gain 
increases as the a-Se layer gets thicker. 


Based on these findings, it was concluded that the 
amplification action obtained with the test targets 
is not due to the injection of charge but is due to 
an avalanche amplification effect that occurs sta- 
bly and continuously in blocking-type targets for 
imaging devices. This marks the origin of HARP 
camera tubes that use the avalanche amplification 
phenomenon, and in this way HARP camera tubes 
were born out of research into completely different 
targets where charge injection activity is taken into 
consideration. 

Figure 4.9 schematically illustrates the operat- 
ing principle of this target. Electrons and holes pro- 
duced by the incident light are accelerated inside 
the target, to which a strong electric field of about 
10° Vm~! is applied, and new electron-hole pairs 
are then generated successively by impact ioniza- 
tion. As a result, a large number of electrons are 
extracted from the signal electrode for each inci- 
dent photon. The high sensitivity of HARP camera 
tubes is due to the avalanche multiplication effect in 
the a-Se target and the fact that this multiplication 
results in hardly any added noise. Furthermore, a 
HARP camera tube also has superior lag character- 
istics and resolution as mentioned later. 


4.1.4.6 BASIC STRUCTURE OF TARGETS FOR 
PRACTICAL USE 

Figure 4.10 shows the basic structure of a HARP 

camera tube target for practical use. Like the pro- 

totype target as shown in Figure 4.7, it uses layers of 

a-Se, CeO,, and Sb,S,. However, the target for prac- 

tical use also contains arsenic (As), lithium fluoride 
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Figure 4.9 Operating principle of the HARP 
target. 


(LiF), and tellurium (Te). The arsenic suppresses 
crystallization of the a-Se, thereby preventing the 
generation of defects. The lithium fluoride serves to 
control the electric field inside the target, and pre- 
vents the generation of defects by decreasing the 
electric field near the interface between the a-Se film 
and the CeO). The tellurium increases the target’s 
sensitivity to red light and is added to the target for 
the red channel. The parts to which LiF and Te are 
added are exceedingly thin and are no more than a 
few per cent of the overall target film thickness. 


4.1.5 The evolution of HARP camera 
tubes 


When HARP camera tubes were first devel- 
oped, they had a target film thickness of 2pm 
and their sensitivity was about 10 times that of 
conventional Saticon camera tubes (Figure 4.11). 
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Figure 4.10 Structure of the HARP target. 


Figure 4.11 Appearance of the ultrahigh- 
sensitivity HARP camera tube. 
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Table 4.1 The evolution of HARP camera tubes 


Year in which developed 


1985 


Target film thickness (1m) 2 
Sensitivity (relative to a 
Saticon tube) 


Lag (50 ms) 4.6% 


About 10x 60-80x 


1990 1995 
6-8 25 
About 600x 


Below measurable 
limit (theoretical 
value: 0.09%) 


1.5%-1.2% 


By taking advantage of the fact that these targets 
also have high resolution, HDTV camera tubes 
were developed, which were put to use at the 
Seoul Olympics [9]. But as they came to be used 
increasingly for TV programs, there was a demand 
for even higher sensitivity. Furthermore, because 
CCDs, which had by then become the most com- 
mon imaging devices, have virtually no lag prob- 
lems, there was also a demand for improving the 
lag characteristics of HARP camera tubes. 

Ascan beseen from the operating principle shown 
in Figure 4.9, the avalanche multiplication factor 
of a HARP camera tube increases as the target gets 
thicker, resulting in greater sensitivity. The lag also 
decreases as the film thickness increases. This is due 
to reduction of the target storage capacitance, which 
dominates the lag characteristics. Consequently, by 
increasing the thickness of the layer consisting pri- 
marily of a-Se in the target to around 6-8 1m, it was 
able to develop a practical HARP camera tube with 
improved lag characteristics that had 60-80 times 
the sensitivity of a Saticon tube. 

Furthermore, experiences such as news 
coverage of the Kobe earthquake disaster in Japan 
(1995) resulted in increased demand for the devel- 
opment of an ultrahigh-sensitivity imaging device 
capable of producing, for example, aerial night- 
time shots of the stricken region, which had been 
plunged into darkness due to power failures. NHK, 
Hitachi, Ltd, and Hamamatsu Photonics K.K., 
therefore, studied ways of making the HARP cam- 
era tube even more sensitive, as described later, and 
developed an ultrahigh-sensitivity HARP camera 
tube with the target film thickness increased to 
25m whose sensitivity is 600 times greater than 
that of a Saticon tube. The lag of this camera tube 
was reduced to a level below the measurable limit. 
Note that because modern CCDs are about six 
times as sensitive as Saticon tubes, the sensitivity of 


this HARP camera tube is about 100 times greater 
than that of a CCD. 

Table 4.1 shows how the target film thickness, 
sensitivity (relative to a Saticon tube), and lag (the 
value 50 ms after the incident light is cut off) of 
HARP camera tubes have changed over the years. 
In the following, characteristics of the HARP 
camera tube with a target film thickness of 251m, 
whose sensitivity exceeds that of the naked eye, 
will be mentioned. 


4.1.6 Principal characteristics of 
the ultrahigh sensitivity HARP 
camera tube 


This section describes the principal characteristics 
of the ultrahigh-sensitivity HARP camera tube 
with a target film thickness of 25m (2/3 in MM 
type, shown in Figure 4.4) [10,11]. 


4.1.6.1 SENSITIVITY 


Figure 4.12 shows how the signal current (which 
represents the sensitivity) and dark current vary 
with the applied voltage. By way of comparison, 
this figure also shows the signal current mea- 
sured using an ordinary (Saticon) camera tube 
subjected to the same amount of incident light. 
With an applied voltage of 2500V, the HARP 
camera tube is over 600 times more sensitive than 
the Saticon tube. The dark current in this case is 
about 2nA. 

Note that since the HARP camera tube’s sen- 
sitivity can be varied greatly by controlling the 
applied voltage, it can be adjusted to the sensitiv- 
ity of ordinary imaging devices by reducing this 
voltage. In other words, it can also be used to take 
pictures in very bright situations such as day lit 
outdoor scenes. 
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Figure 4.12 Signal current and dark current 
versus target voltage in the ultrahigh-sensitivity 
HARP camera tube. 


4.1.6.2 SPECTRAL RESPONSE 
CHARACTERISTICS 


Figure 4.13 shows the spectral response character- 
istics of an ordinary HARP target and a Te-doped 
target with increased sensitivity to red light (for use 
in the red channel). In the a-Se layer of the HARP 
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Figure 4.13 Spectral response characteristics of 
the ultrahigh-sensitivity HARP camera tube. 


target, there is little cancellation of charge due to 
recombination, even close to the junction interface 
with the CeO,, so a high photoelectric conversion 
efficiency is obtained for short-wavelength (blue) 
light that is absorbed in this part. On the other 
hand, because the band gap of a-Se is about 2.0 eV, 
the limit of sensitivity to light of longer wave- 
lengths (red light) is the corresponding wavelength 
which is about 620 nm (standard type). Because 
the red channel of a color camera should exhibit 
sensitivity up to about 700 nm [12], the target for 
the red channel is made more sensitive to red light 
by doping it with Te. 


4.1.6.3 LAG CHARACTERISTICS 


The lag characteristics are determined by the 
storage capacitance of the film and the electron- 
beam temperature of the scanning electron 
beam. When the target film thickness is 25 pm, 
the theoretical value of the lag in the third field 
after turning off the incident light is 0.09%. This 
value is calculated from a target-layer storage 
capacitance of 130 pF and an electron-beam 
temperature of 3000 K. As shown in Table 4.1, 
the lag of the HARP camera tube is below the 
measurable limit when the target film thickness 
is 25m. 


4.1.6.4 RESOLUTION CHARACTERISTICS 


The HARP camera tube has a limiting resolution 
of more than 800 TV lines, and no degradation 
of resolution due to the avalanche multiplication 
action was observed. Because the resolution char- 
acteristics of the camera tube are controlled by the 
spot diameter of the scanning electron beam, even 
higher resolution can be obtained by combining 
with electron optics having a smaller beam spot 
diameter. 


4.1.6.5 NOISE 


The magnitude of the noise added as a result of 
avalanche multiplication is represented by the 
excess noise factor, but in the case of a HARP 
camera tube its value is approximately 1. In other 
words, the amplification achieved with this cam- 
era tube is almost noise free. The reason why this 
result is obtained is thought to be because of the 
large ratio of the respective ionization coefficients 
of holes and electrons in the a-Se film and because 
the beam scanning side of the photoelectric con- 
version film is in a floating state (its potential is not 
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fixed), whereby a type of negative feedback action 
takes place which controls the noise. 

In this way, the HARP camera tube combines the 
characteristics needed for high sensitivity with the 
characteristics needed for superior picture quality. 


4.1.7 Ultrahigh sensitivity HARP 
cameras and their applications 


A handheld HARP color camera has been pro- 
duced using HARP tubes with a 25-y1m-thick 
target. Its appearance is shown in Figure 4.14. 
Table 4.2 shows major specifications of the camera. 
This three-tube color camera with the target volt- 
age of 2500 V has greater sensitivity than the naked 
human eye and can obtain clear color images even 
in lighting conditions equivalent to moonlight. 
Figure 4.15 shows an example of how the images 
taken with such a camera compare with the images 
taken by a CCD handheld camera under the same 
conditions (subject illumination: 0.3 lux, lens aper- 
ture: F1.7). The dark subject was difficult to view 
with the CCD camera even when its gain was 
boosted by 18dB, whereas the HARP camera was 


Figure 4.14 Appearance of the ultrahigh- 
sensitivity HARP color camera. 


able to produce a clear image. It was confirmed 
by the color camera test that the HARP camera is 
about 100 times as sensitive as the CCD camera. 
It goes without saying that the sensitivity of the 
HARP camera can be decreased by decreasing the 
target voltage, so that the camera is capable of pro- 
ducing excellent picture quality over a wide range 
of shooting conditions from daylight to moonlight. 


(a) 


Figure 4.15 Monitor pictures produced by color 
cameras with the ultrahigh-sensitivity HARP 
tubes and CCDs. Illumination is 0.3 lux and lens 
irises are at F1.7. (a) lmage taken with the HARP 
camera and (b) image taken with a CCD camera 
(+18 dB). 


Table 4.2 Specifications of the ultrahigh-sensitive HARP camera 


Maximum sensitivity 
Minimum scence illumination 
Signal-to-noise ratio 
Resolution 

Amplifier gain selection 
Weight 

Power consumption 


11 lux, F8 

0.03 lux (F1.7,+24dB) 
60dB 

700 TV lines 

0, +9dB, +24dB 

5kg 

25 W 
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In addition, the HARP tube offers other excellent 
features, such as insensitivity to burning, com- 
pared with the Saticon camera tube. It was also 
confirmed that the additional noise produced by 
the avalanche multiplication was negligibly small. 

We have also developed a handheld HDTV 
HARP camera as shown in Figure 4.16. Figure 4.17 


ay. 


Figure 4.16 Appearance of the hand-held HDTV 
HARP camera. 


Figure 4.17 Monitor pictures produced by HDTV 
cameras with HARP tubes and CCDs. (a) Image 
taken with the HDTV HARP (25-um-thick) camera 
and (b) image taken with a HDTV CCD camera 
(+42 dB). 


shows the results of comparing the HDTV HARP 
camera with an HDTV CCD handheld camera 
when viewing a night-time scene (lens aperture: 
F2). Here, the image from the CCD camera has 
been made brighter by boosting the gain by 42 dB. 
However, this has caused a corresponding increase 
in noise, and in the darker parts of the image (e.g., 
the trees in the park) the details are completely 
swamped by noise. On the other hand, the HARP 
camera produces a clear image with low noise. In 
this case, the difference in sensitivity manifests 
itself as a difference in the S/N ratio. 

This camera was developed for use by the 
broadcast. Therefore, the HARP camera was not 
only used for emergency night-time broadcast, 
but was also used in the making of a wide variety 
of programs including subjects such as nocturnal 
animals, the aurora, and the rainbow at night in 
falls. However, the high sensitivity and superior 
picture quality of HARP cameras has also led to 
a considerable amount of interest from scientific 
and medical fields. This section describes how it is 
applied to deep-sea exploration and research into 
X-ray medical diagnosis. 

In oceanic research involving studies of deep-sea 
organisms, mineral resources on the sea floor, and 
so on, underwater TV cameras are an indispensable 
means of gathering information. In particular, the 
TV cameras carried onboard unmanned deep-sea 
vehicles perform an essential role as the eyes of the 
remote control pilot. Consequently, the cameras 
used on such equipment must have superior char- 
acteristics such as highly sensitive and high picture 
quality. Artificial lighting equipment is essential 
for taking pictures deep underwater where no 
sunlight reaches. However, when developing an 
underwater camera it is important to bear in mind 
that light propagates very differently underwater 
because it attenuates much faster than in air. Red 
light attenuates particularly quickly, whereas blue 
light can penetrate somewhat further. This means 
that to obtain images of distant objects it is neces- 
sary to use a highly sensitive imaging device that 
can cope efficiently with blue light. 

Because HARP camera tubes exhibit just such 
characteristics, in 2000, the Japan Agency for 
Marine-Earth Science and Technology developed 
an HDTV HARP camera for deep sea use, which 
was carried on the 3000m class Hyper Dolphin 
unmanned deep-sea vehicle in conjunction with 
NHK. In test dives, this camera obtained clear 
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Figure 4.18 Deep-sea organism captured with 
the HARP camera. 


pictures of deep-sea organisms as shown in Figure 
4.18 and was confirmed to be a highly sensitive 
HDTV camera. 

We are also researching the use of HARP 
cameras in the medical applications. A notable 
example is the potential use of HARP cameras 
in next-generation X-ray medical diagnosis sys- 
tems. This research was done in collaboration with 
other research institutions such as the National 
Cardiovascular Center Research Institute, the 
Tokai University School of Medicine, and the 
High Energy Accelerator Research Organization 
in Japan. 

The X-ray equipment currently used in hospi- 
tals is only able to see large blood vessels with a 
diameter of at least 0.2-0.5mm, but this study 
aims to make it possible to obtain clear images of 
blood vessels that are several times smaller. It has 
been said that if narrow blood vessels with a diam- 
eter of 0.1 mm or less can be imaged, then it should 
be possible to detect cancer earlier and make better 
diagnosis of conditions such as heart disease and 
cerebrovascular disorders. 

For this purpose, it is necessary to have special 
X-rays that are absorbed well by a tiny quantity of 
contrast medium inside the narrow blood vessels 
to be imaged, and a TV camera that can clearly 
reproduce the image formed on a fluorescent 
screen (placed behind the subject being viewed) 
due to this absorption. For the special X-rays, we 
have been using monochromatic X-rays with a 
specific energy obtained from the synchrotron 
radiation. The TV camera is required to have supe- 
rior sensitivity and resolution. This is because the 
image on the fluorescent plate is finely detailed and 
very dark (so as to restrict the exposure of the sub- 
ject to X-rays). 


Neogenesis blood vessels 
(50 um in diameter) 


Figure 4.19 Minute blood vessels (mouse cancer). 


We have, therefore, conducted experiments 
involving the use of an ultrahigh-sensitivity and 
high-resolution HDTV HARP camera in the 
imaging section ofa next-generation X-ray medical 
analysis system. Figure 4.19 shows a photograph 
(obtained using this system) of tiny blood vessels 
called neogenesis blood vessels that developed in 
cancerous parts of a mouse. This image also shows 
narrow blood vessels of a characteristic shape with 
a diameter of about 50pm, which it has not been 
possible to see hitherto. This technology is attract- 
ing interest as an X-ray diagnosis technique that 
can lead to the early detection of cancer. 

In addition, because a HARP target can convert 
X-rays into electrons directly, it should be possible 
to exploit this capability to produce X-ray imag- 
ing devices with unparalleled levels of resolution 
and sensitivity. Consequently, this technology is 
attracting high levels of interest for applications 
such as the early detection of cancer and diagnosis 
of heart disease as a clinical diagnostic study. 


4.1.8 Conclusion 


This section has provided a description of the mech- 
anism of the photoconductive camera tube cover- 
ing past types and the ultrahigh-sensitivity HARP 
camera tube based on a new operating principle. 

In general, ultrahigh-sensitivity imaging devices 
have so far been regarded as special-purpose 
devices, in a separate class to ordinary high picture 
quality imaging devices such as CCDs. But if we can 
develop the ultimate ultrahigh-sensitivity imag- 
ing device—with noise-free internal amplification 
and extremely high gain, and whose fill factor and 
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photoelectric conversion efficiency are 100%—then 
we will have a device with ultrahigh picture qual- 
ity and an S/N ratio close to the theoretical limit. 
Such a device will be able to take clear pictures with 
less noise than any other existing device under all 
lighting conditions. Although HARP camera tubes 
are coming very close to this ultimate goal, we 
would still like to make further improvements to 
the photoelectric conversion efficiency. 
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5.1 Vacuum tube devices 


5.1.1. CRT structure and its operation 
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5.1.1 CRT structure and its operation 


The cathode ray tube (CRT) is a luminescent dis- 
play invented by K. F. Braun in 1897. The display, 
which is inexpensive but can show resolute pic- 
tures on its screen, has been the leading technol- 
ogy in the display field over more than 100years. 
But the heavy weight and long depth are the CRT’s 
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weaknesses. The liquid crystal display (LCD) and 
other new flat panel displays that have no weak- 
nesses as such are expanding their market share in 


recent years. 


The CRT comes in several types—direct view, 
monochrome and projection. Based on the tele- 
vision broadcast specifications, the CRT with the 
screen’s length-height ratio of 4:3 was the most 
common but the CRT with the ratio of 16:9 is now 
popularized. Even the 1:1 ratio CRT is produced for 
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some special customers. The CRT screen was once 
round to intensify the glass strength but a series of 
recent technological innovations has made it possi- 
ble to design more varieties of flat-screen-type CRT. 


5.1.1.1 CRT OPERATION 


Figure 5.1 shows the monochrome CRT structure. 
Video signal is fed to the cathode which is a part 
of the electron gun. The cathode generates free 
electron. The electron is focused by the electron 
gun like light is focused by a lens. Electrons travel 
freely in its evacuated inside, while glass is used 
to “envelope” the CRT. Electro-conductive sub- 
stance is applied to the funnel’s inside to form a 
film that keeps the inside’s electric potential sta- 
ble. The film builds as a high-voltage condenser 
in-between with another film made on the outside 
of the funnel. The condenser stabilizes the supply 
of anode voltage. Emitted by the electron gun and 
accelerated with the voltage of 20-30kvV, electrons 
travel fast in the form of a beam onto the panel 
coated with phosphor. The deflection yoke creates 
a magnetic field that bends the electron beams 
and makes them scan the entire panel. The elec- 
trons hit against the phosphor that emits light. 
Individual components and their function are 
described below. 


5.1.1.2 ELECTRON GUN 


Figure 5.2 shows the cross-section of the electron gun. 
This device is equipped with a cathode and operates 
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Figure 5.1 CRT structure. 


in an electric lens system. The cathode is usually an 
oxide composition of barium (Ba), calcium (Ca) and 
strontium (Sr). Activated by heat at about 800°C, the 
cathode emits electrons. When varying picture signal 
voltage with an amplitude of about 100 V is applied to 
the cathode, the volume of electrons going through 
the grit 1 (G1) changes: the bigger the volume, the 
brighter the picture and vice versa. But to focus the 
dispersed electrons emitted from the cathode in large 
current and to create pictures on the panel, we must 
carefully design the structure and shape of the gun’s 
electrode and the arrangement of applied voltage. 
Using 100V signals, the electron gun can control 
electrons of 30 keV of energy. This function as a noise 
free amplifier is unique and not found in any other 
device of flat panel displays. 


5.1.1.3 DEFLECTION 


An electron beam put out from the gun trav- 
els straight toward the center of the CRT. To let 
it reach all over the screen, we usually use two 
methods—electrostatic deflection and electro- 
magnetic deflection. The former method requires 
two flat-plate electrodes facing each other. The 
beam travels between the two. When electrical 
potential between them varies, the beam changes 
the direction. In spite of its lower deflection effi- 
ciency, the method is quite effective in deflecting 
high frequency. Meanwhile, the latter method of 
electromagnetic deflection characterized by its 
higher deflection efficiency is used for television 
and many other CRTs, and the deflection yoke per- 
forms electromagnetic deflection, using two pair of 
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Figure 5.2 Electron gun cross-section. 
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Figure 5.3 Deflection yoke. 


coils. As a pair of coils facing each other generates 
one magnetic field, the two pairs create two mag- 
netic fields, directing the beam both horizontally 
and vertically. See the structure in Figure 5.3 for 
magnetic deflection yoke. 


5.1.1.4 PHOSPHOR SCREEN 


The inside of the CRT panel is coated with layers 
of phosphor particles. Each particle is 3-10 pm in 
diameter. The aluminium film covers and protects 
the phosphor. Gasses still remaining in the CRT 
would be hit by electrons and become ionized. 
Without the film, the ion would crash against the 
phosphor screen and damage the phosphor. This 
film is also effective in raising CRT luminance 
by reflecting light coming from the phosphor. It 
stabilizes electric potential around the screen as 
well. 


5.1.2 Monochrome CRT 


Monochrome rather than color CRTs are mainly 
used in the medical field where high resolution 
and high brightness pictures are required. The 
electromagnetic focusing method is applied to the 


electron gun to achieve high resolution. Electric 
current running through the coils attached around 
the CRT neck induces magnetic field and works as 
electro-magnetic lens. Being put outside the neck, 
the “lens” has a very small aberration, making the 
beam spot extremely tiny. 


5.1.3 Projection CRT 


The color picture projection CRT is a combination 
of three monochrome CRTs in red, green and blue, 
respectively. Pictures on each of the three single 
CRT panels are expanded through each optical 
lens and projected onto the outside screen where 
the pictures are combined. The system is depicted 
in Figure 5.4. The phosphor screen of the single 
CRT is usually 7-9 in. in diagonal. The size of pro- 
jected pictures is as large as 40-60 in. in diagonal, 
and higher brightness (10,000 cd-m~) and higher 
resolution (the spot size is 0.2 mm in diameter) are 
requisite. If larger electric current is applied to illu- 
minate phosphor, however, the phosphor becomes 
extremely hot; CRT brightness is saturated; and 
phosphor quality degrades. Then, the CRT fails to 
operate. To prevent this problem, the CRT panel is 


116 Vacuum tube and plasma displays 


Red 
Green 


Blue 


| 


CRT 


Figure 5.4 CRT projection system. 


kept cool with coolant, while a kind of phosphor 
strong enough to bear such heavy load is applied. 


5.1.4 Color CRT 


5.1.4.1 PRINCIPLE 


Phosphor in three different colors is applied to pro- 
duce color pictures on the CRT screen. Figure 5.5 
shows the mechanism for exciting phosphor. The 
color selection device (mask) is put in front of the 
phosphor-coated panel. Three pieces of cathode 


Color selection mask 


Figure 5.5 Color CRT structure. 


Optical lens 


Screen 


in the electric gun put out electrons, which travel 
through the mask and hit onto red, green and blue 
phosphor and let it illuminate. 


5.1.4.2 COLOR SELECTION MECHANISM 


Among several types of color selection masks, the 
shadow mask and the aperture grille are widely 
applied to the CRT these days. The former is 
steel plate of 0.2mm in thickness with round-or 
rectangular-shaped holes 0.2-1.0mm apart. The 
mask’s opening must be put in place to let a beam 
from one particular color cathode reach the same 
color phosphor. Hit by electron beams, the mask 
becomes hot and expands and the position of the 
opening moves. Therefore, the material Invar that 
has the very small thermal expansion coefficient is 
sometimes used for the mask. To match more vari- 
eties of flat-faced CRT designed these days, new 
shadow masks are being developed. 

The aperture grille has a shape of a vertical reed 
screen. Both the shadow mask and the aperture 
grille have light transmittance of about 20%. This 
means 80% of the electrons out of the gun do not 
go through the mask. Figure 5.6 shows the aper- 
ture grille system and Figure 5.7 shows the shadow 
mask system. 


5.1.4.3 PHOSPHOR SCREEN 


Slurry that contains phosphor and photo sensi- 
tizer is applied to the panel to create the phosphor 
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Figure 5.6 Aperture grille system. 
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Figure 5.7 Shadow mask system. 


screen. After being dried, the coated slurry is 
exposed to ultraviolet (UV) radiation light emit- 
ted from the lamp. When the panel is washed with 
water, the material except the light-exposed and 
hardened phosphor flows away. After this process 
is repeated three times, red, green and blue phos- 
phor stripes appear; black stripes between each 
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Phosphor stripes 


different color phosphor are also created by a simi- 
lar method. Figure 5.8 illustrates the cross-section 
of the phosphor screen. 


5.1.4.4 ELECTRON GUN 


The modern electron gun for a color picture tube is 
quite complicated compared to the basic structure 
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Figure 5.9 Electron gun for color CRT by Wada 
et al. [6]. 


shown in Figure 5.2. Figure 5.9 depicts an exam- 
ple of the electron gun for color CRT [6]. As one 
gun is equipped with one piece of cathode and its 
electrons are designated to illuminate the corre- 
sponding color phosphor, the color CRT generally 
requires three electron guns to illuminate three 
colors to produce color pictures. 

The trinitron gun is equipped with one large 
electric lens for three electron beams. A large lens 
generally shows better performance than a small 
lens. 


5.1.4.5 DEFLECTION YOKE 


Unlike the monochrome CRT, the color CRT 
needs to deflect three electron beams at once. 
Those beams are required to converge into every 
spot spread all over the phosphor screen in order to 
realize quality pictures. The deflection yoke helps 
distribute needed magnetic field. 
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5.1.4.6 PURITY 


The CRT is designed for electron beams to travel 
through the mask and hit the designated phosphor. 
But this is disturbed when the path is affected by 
terrestrial magnetism. To prevent this problem, 
the CRT is equipped with a magnetic shield. When 
the CRT power switch is turned on, attenuating 
alternating current runs through the coil installed 
around the CRT to produce attenuating magnetic 
field. This “degauss” process is designed to magne- 
tize the shield in the intensity opposite to that of the 
outside magnetic field and to alleviate the influence. 


5.1.5 Contrast 


Several measures are taken to prevent ambient 
light reflection off the surface of the panel and the 
phosphor screen and to maintain good picture 
contrast. For instance, the swath between color 
stripes (dots) is blackened to halve the light reflec- 
tion without blocking phosphor light. 

Glass of lower transmittance is applied to the 
CRT panel. Incident rays come through the panel 
glass, reflect against the phosphor screen and go 
out through the glass again. Glass of low light 
transmittance can reduce light reflected through 
the two passages substantially. Phosphor light trav- 
els through the glass but the passage is only once. 
As the lower the glass transmittance, the better the 
picture contrast, the transmittance rate of about 
50% is the most favourable (see Figure 5.10). 

However, 4% of incoming light is still reflected 
at the surface of the panel. Computer display CRTs 
are equipped with glass whose surface is treated in 
the non-reflection process. 
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Figure 5.10 Light from phosphor emission and ambient light reflection. 


5.1.6 Safety 


Glass is used for the CRT and air in its inside is 
exhausted. Being exposed to the atmospheric pres- 
sure, even a small fault of the CRT could lead to 
a dangerous implosion, dispersing glass pieces all 
around. A metal band is applied around the CRT 
to prevent such accidents. The band that is a little 
smaller in circumference than the CRT reinforces 
the glass strength by canceling out glass stress 
caused by the atmospheric pressure. The band pre- 
vents damage even if the glass is broken. 


5.1.7 Other CRTs 


5.1.7.1 FLAT CRT 


A big challenge for the CRT is how to shorten the 
long depth. As the deflection yoke bends electron 
beams, the CRT depth becomes shorter if the yoke’s 
deflection angle is wider. Most TV CRTs these days 
have the yoke with the angle of 110° and new CRT 
models even wider 120°. But the larger deflection 
angle requires much more deflection power to 
bend the beam more sharply. The magnetic field 
gets distorted and the beams running through it 
end up producing poor pictures. 

Maeda [7] designed a thin monochrome CRT 
(Figure 5.11). So far, no color CRT adopting this 
technology has been sent to the market. 

Unlike the traditional CRT, the field emis- 
sion display (FED) is shorter in depth. It has no 


deflection device. Now, engineers are intensifying 
R&D effort to produce this type of product too. 


5.1.7.2, BEAM INDEX CRT 


Only 20% of electrons are transmitted through the 
color selection mask and the rest are absorbed into 
the mask. Many years ago, the color CRT without 
the mask to raise power efficiency was developed 
and sold in the market. This CRT is designed to 
make one electron beam scan all over the phosphor 
screen. The beam should have a very small beam 
spot size not to strike more than one stripe of phos- 
phor. That stripe stands next to another stripe of 
phosphor, which emits none of the three colors 
but UV as a signal when the beam moves onto it. 
Detecting that signal and finding its own location, 
the beam immediately changes its picture color 
information signal. The problem of this system is 
that the beam spot size becomes bigger when high 
beam current for high luminance is generated. 
CRT engineers want to solve this problem and suc- 
ceed in developing this CRT some day. Their effort 
will continue until that day (see Figure 5.12). 


5.1.8 Recent developments 


Efforts to improve performance and cost reduction 
are continuing. Some of the recent developments 
are described below. 

Okano developed a 21-in. CRT having very 
high resolution. Aperture grille pitch at the screen 
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Figure 5.11 Flat monochrome CRT. 
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Figure 5.12 Beam index CRT. 


center of CRT measures 0.126 mm. Horizontal res- 
olution of 2800 dots was realized [8]. 

Beam index tube is a CRT which has no mask. 
A new idea was proposed by Bergman et al. [9]. 
Primary function of the mask, color selection, is 
taken over by an electronic control system. This 
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CRT (called F!T tube) employs the system which 
has phosphor stripes parallel to electron beam 
scanning lines. Figure 5.13 shows the tracking 
principle of this new beam index tube. 

One of the major problems of CRT is its weight. 
Most of the weight comes from the glass envelope. 
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Figure 5.13 Tracking principle of F!T tube by Bergman et al. [9]. 


Sugawara [10] reduced funnel weight by redesign- 
ing the shape of the funnel shown in Figure 5.14. 

The cathode is another component that needs to 
improve its performance. Oxide cathode is used for 
most CRTs. The current density from oxide cath- 
ode is limited. The new cathode called a hopping 
electron cathode (Figure 5.15) was proposed by van 
der Varrt et al. [11]. It is based on a self-regulation 
secondary emission process enabling transport of 
electrons over insulation surface. The cathode uti- 
lizes this mechanism to compress electrons com- 
ing from a large conventional cathode into a small 
funnel structure of insulating material. The exit 
of the funnel serves as a high-brightness electron 
source for a CRT and can be used to reduce the 
spot size. 


Figure 5.14 Novel funnel shape to reduce its 
weight by Sugawara et al. [10]. 


5.2 PLASMA DISPLAY 


5.2.1 Introduction 


Plasma displays have greatly advanced in the 1990s 
and are getting a position in the mainstream of 
the large area flat panel television and display. The 
road to the development was long and not peaceful 
in 30 years. Both successes of the color moving pic- 
ture presentation on plasma display panels [12,13] 
and production of a 42-in. diagonal PDP [14] 
promise great business growth. The further devel- 
opments of the interlaced and progressive displays 
from 32 to 60in. HDTVs are making a new value- 
added market in addition to the market replac- 
ing the conventional TV of CRT because PDP has 
made a new large area and beautiful displays pos- 
sible and is giving people a new impression. 


5.2.2 Development of color 
plasma displays 


There were two ways researched to present color 
image on PDPs. One was to use visible light caused 
by the discharge. The other was to use visible light 
from excited phosphors by UV rays or electrons in 
discharge. Asa result of these researches, phosphor 
system with UV ray excitation has been applied 
to the recent color PDPs, because the system was 
superior to the other methods due to high lumi- 
nance and color purification. In particular, the gas 
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Figure 5.15 Hopping electron cathode by van der Vaart et al. [11]. 


system of Ne and Xe including 4-5% of Xe contents 
showed the excellent results to achieve high lumi- 
nance and luminous efficiency. Vacuum UV rays 
of 147, 152 and 172 nm are radiated from Xe and 
Xe-dimers as shown in Figure 5.16. 

At the early stage of color plasma display devel- 
opment, both AC and DC PDPs were carried out 


Net 
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to accomplish computer monitors and color televi- 
sion. Most of the color PDP researches to achieve 
color television were DC ones, because color AC 
PDPs had difficulty in attaining long lifetime due 
to the degradation of color phosphors caused by 
the ion sputtering [15]. Because phosphors in DC 
color PDPs were deposited around the anodes and 
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Figure 5.16 Energy level of transition for Ne+Xe mixture gas. 
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then no ion bombardment, color DC PDPs had an 
advantage of longer phosphor lifetime compared to 
that of AC ones at that stage. 

Color DC PDPs for television were investigated 
by the application of negative glow and positive 
column region. Even though the color DC PDPs 
using positive column had higher luminous effi- 
ciency compared to those of negative glow region, 
the luminance and luminous efficiency were still 
not sufficient to be released into the market. 

In 1983, NHK developed 16-in. diagonal color 
DC PDPs whose luminance and luminous efficacy 
were 21 cd-m® and 0.05 Im-W"™ respectively [16]. 
The performance of luminance and luminous effi- 
cacy by NHK was not able to cross the value of 150 
cd-m~ and 0.4 1m-W"! respectively [17]. 

The practical panel structure for a color plasma 
display panel is called a three-electrode surface 
discharge belonging to a kind of AC-type plasma 
display [18]. AC PDP was invented by Bitzer and 
Slottow in 1966 [19] and the monochrome display 
has been put into practical use with opposed dis- 
charge technologies. The color PDP, however, did 
not succeed in practical use although there were 
many researches with the opposed discharge tech- 
nologies. There were evolutional developments 
in structural and operational technologies from 
monochrome PDP to put the color AC PDP into 
practical use. The new direction to develop the 
color PDP was opened by introducing surface dis- 
charge technologies. 

Table 5.1 shows the summary of the develop- 
ment of the color PDP technologies with surface 
discharge. Takashima [20] firstly reported the 
application of surface discharge technology on the 


color PDP for segment-type display in 1973. Dick 
[21] has also reported the surface discharge tech- 
nology of a matrix-type monochrome PDP in 1974. 
The matrix-type color PDP with surface discharge 
technologies has been proposed and followed by 
authors from 1979 [22,23]. From the beginning 
of the research and development, the surface dis- 
charge color AC PDPs showed extremely excellent 
luminous efficacy performance of 0.75, 0.4 and 0.15 
Im W! for green, red and blue phosphors, respec- 
tively [24]. Figure 5.17 shows the comparison 
between opposed discharge and surface discharge 
PDPs. The phosphors were deposited on the sur- 
face of the MgO layers on the opposed electrodes 
as shown in Figure 5.17a and alternated pulses 
were applied between the electrodes to ignite the 
discharge. Then the phosphors exposed to the dis- 
charge resulted in the rapid degradation due to the 
ion bombardment in the discharge. On the other 
hand, the phosphors were deposited on the front 
cover glass substrate placed away from the dis- 
charge area as shown in Figure 5.17b; therefore, the 
surface discharge color PDPs have got an advan- 
tage of long life. All of these surface discharge color 
PDPs employed two electrodes. The two-electrode 
surface discharge system, however, did not succeed 
in developing the practically available color PDPs. 

The research has finally resulted in a new 
structure with the three electrodes as shown in 
Table 5.1 [17,25]. Shinoda invented both essential 
technologies such as the three-electrode struc- 
ture and a new greyscale driving technology, or 
address-, display-period separation method (ADS 
method), which enabled the realization of practi- 
cal color plasma displays. Figure 5.18 summarizes 


Table 5.1 Development of color PDP technologies with surface discharges 


Color technologies history 


Electrode Electrode 
Year Researcher Phosphors Electrode On Type Configuration Ribs 
1973 Takashima Green Two Single Transmitting Segment Glass sheet 
substrate 
1974 Dick Non Two Single — Matrix Stripe 
1979 Shinoda RGB Two Single Transmitting Matrix Non 
1984 Shinoda RGB Three Single Transmitting Matrix Stripe+mesh 
1985 Dick Non Three Double — Matrix 
1987 Shinoda RGB Three Double Reflecting Matrix Stripe+mesh 
1992 Shinoda RGB Three Double Reflecting Matrix Stripe 
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Figure 5.17 Comparison between surface discharge and opposed discharge. Phosphors are depos- 
ited on the front cover glass substrate placed away from the discharge area in the surface discharge. 


Problems to be solved at early stage of color PDP development 
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Figure 5.18 Summary of the technical issues to develop the surface discharge color PDP. 


the technical issues to develop the color PDP, such 
as realizations of color PDP, long operating life, 
high luminance, high resolution and full color 
operation when the research for the surface dis- 
charge color PDP was started in 1979. Developing 
a three-electrode panel structure has solved these 
first four issues. And development of the new driv- 
ing technology has solved the last one. Finally, the 
three-electrode PDP structure with stripe rib and 
phosphor structure has been completed and the 
practical 21-in. diagonal color PDP has been devel- 
oped with these technologies in 1992 as shown in 
Figure 5.19. The larger 42-in. diagonal plasma dis- 
play shown in Figure 5.20 was put onto the market 
in 1996, which started the era of plasma television. 


5.2.3 Essentials of color AC PDP 


5.2.3.1 OPERATING PRINCIPLE OF 
COLOR PDP 


The color PDP is the display using the luminance 
from the combination of the phosphors and gas dis- 
charge. It is possible to think of a color PDP model 
as if some millions of miniature florescent lamps 
are arranged between the glass plates of an area of 
1m’. Figure 5.21 shows the luminance model in the 
each lamp, which is usually called a cell. When the 
voltage is applied to the gas, discharge is ignited 
in each cell making ions and electrons from the 
atoms. The ions and electrons lose energy by emit- 
ting a UV ray. The plasma display is designed to 
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Figure 5.19 Specification of 21-in. diagonal color plasma display. 


Aspect ratio 


Number of pixels 


Number of colors 260,000 
Luminance 180 cd m-2 
Viewing angle >160° 
Power consumption | 100 W,,,. 
Weight 4.8 kg 

Item Performance 
Display area 920 mm x 518 mm 


16:9 


852 (R,G,B) x 480 


Pixel pitch 1.08 mm x 1.08 mm 
Number of colors 16.7 million 
Luminance 350 cd m-2 
Viewing angle >160° 
Power consumption | 300 W, 

(1996) P max 
Weight 18 Kg 

Figure 5.20 A 42-in. diagonal color plasma display. 
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Figure 5.21 The luminance model of each cell in plasma display. 


irradiate UV rays of Xe resonance emission (147 
nm) and Xe molecular emission (173 nm). The irra- 
diated UV rays stimulate the phosphors and visible 
lights are emitted. The three prime color phosphors 


Phosphors 


are arranged in each of the discharge cells that are 
at the cross point of the electrodes. 

To display a color image on PDP, the discharge 
needs to be controlled between the ON and OFF 
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Figure 5.22 The operating principle with the opposed discharge structure without phosphors. 


states in each discharge cell (sub-pixel) that has 
the three prime color phosphors. The operating 
principle is simply explained with the opposed 
discharge structure without phosphors as shown in 
Figure 5.22. The electrodes are arranged orthogo- 
nally on each opposite glass plate and covered by a 
dielectric layer. The dielectric on the front substrate 
is also covered by an MgO protecting layer. The 
plates are assembled with a gap of about 100 pm and 
a Ne+Xe gas system is introduced between them. 
The operating waveform is composed of the 
write pulses, the sustain pulses and the erase 
pulses. The write pulse whose voltage is higher 
than the firing voltage (V,), i-e., a threshold voltage 
to ignite the discharge, is applied to the X electrode 
and then the discharge is ignited. The ions and 
electrons generated by the discharge are absorbed 
to the opposed dielectric surfaces by the electric 
fields. As a result, the internal electrical field of the 
cell is reduced rapidly by these absorbed charges 
and then the discharge is stopped. Therefore, the 
discharge has a pulse shape. The charge deposited 
on the dielectric is called a wall charge and the 
voltage through the capacitance between the sur- 
face of the dielectric and the electrode under the 
dielectric is called a dielectric voltage. The voltage 
due to the wall charge across the gas is called a wall 
voltage and the voltage across the gas due to both 
the wall voltage and externally applied voltage is 
called a cell voltage. The width of the write pulse 
is adjusted to accumulate a sufficient wall charge 
on the dielectrics. The sustain pulses are applied 
on Y electrodes successive to the write pulse. The 


polarity of the voltage is reversed to the write pulse 
and the voltage is superimposed on the formerly 
accumulated wall voltage and then the cell voltage 
can exceed the firing voltage to ignite discharge, 
although the applied sustain voltage itself is lower 
than the firing voltage. The discharge also has a 
pulse shape similar to the discharge by the write 
pulse. In the same manner the successive sustain 
pulses sustain the discharge state and make display. 
The luminance of the display is proportional to the 
number of the sustain pulses and from 30,000 to 
50,000 pulses per second are usually applied for 
sustaining. The narrower width than the sustain 
pulse is applied to erase the discharge. Although 
the discharge is ignited, the wall charge cannot be 
accumulated on the dielectric because of the nar- 
row width of the erase pulse. The successive sustain 
pulses cannot sustain the discharge because the 
cell voltage is lower than the firing voltage. 

In this manner the discharge state (ON state) 
and the non-discharge state (OFF state) are main- 
tained. This phenomenon is called the memory 
effect of AC PDP. 


5.2.3.2 FEATURES OF COLOR AC PLASMA 
DISPLAY PANEL 


The developed color AC PDP has all the advantages 
of conventional monochrome AC PDP. 

The following are the advantages: (1) good non- 
linearity, (2) memory function, (3) high addressing 
speed, (4) wide viewing angle, (5) high luminance 
for large area and large display capacity, (6) high 
contrast ratio, (7) high greyscale, (8) full color, 
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(9) digital display, (10) flat panel, (11) simple struc- 
ture and (12) large area. 

The good nonlinearity means that the relation- 
ship between an applied voltage and the lumi- 
nance has a clear nonlinearity. That is, when the 
applied voltage is increased, the discharge current 
is increased rapidly and then bright illumination 
begins at a certain voltage, and when the applied 
voltage is decreased, the discharge current decreases 
rapidly at a certain voltage and then illumination is 
eliminated. As the luminance levels are quite dif- 
ferent between ON and OFF states, the high quality 
display with high contrast ratio is possible. 

The memory effect is the function to maintain 
the ON and OFF states on the panel and is a great 
advantage for realizing a large area or a large dis- 
play capacity. For example, in the case of the CRT 
without memory effect, the luminance decreases 
by increasing the display size or the display capac- 
ity because the electron beam stimulating the 
phosphors stays for a short duration at the phos- 
phor surface of a display spot. In contrast, as all of 
the cells can be illuminated at the same time when 
the common sustain pulses are applied to the elec- 
trodes in the AC PDP, the high luminance level can 
be kept not depending on the display size or the 
display capacity. 

The high speed addressing is due to the gas 
device. As the discharge is finished within 1 ps and 
the wall charge is accumulated within 2 1s after the 
pulse is applied, the data input is possible within 
2 ps. There is a report that the data input is pos- 
sible within 1.5 ps for one scan line. As a result, 
the PDP can display a beautiful moving image with 
16.8 million colors. 

The PDP is expected to play an important role 
in the future digital society because it is essentially 
a digital device. The digital ON and OFF states of 
the discharge cell are easily controlled by the digi- 
tal signals of the computer and the network. 


5.2.3.3 PRINCIPLE OF THREE-ELECTRODE 
PLASMA DISPLAY PANEL 


Although the color PDP has been researched 
since the end of the 1960s when the AC PDP was 
invented, it was not successful with the two-elec- 
trode structure because of the phosphor and MgO 
degradation and a difficulty in the operation [26]. 
One of the key breakthroughs that solved the issues 
was the introduction of the surface discharge and 
three-electrode structure. Figure 5.23 shows the 
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Figure 5.23 The principal electrode configura- 
tion of three electrode PDP. 


principal electrode configuration and operation. 
There are three kinds of electrodes, such as two 
display electrodes (sustain and scan electrodes) 
and an address electrode. The sustain electrodes 
are connected to each other with a common elec- 
trode and the scan electrodes are independent. 
And then the one electrode terminal is added to 
the conventional two-electrode system. There are 
two kinds of operation method. One is write-in 
operation, which means the discharges are ignited 
to make wall charges between the selected address 
electrodes and a scan electrode for input display 
data and then sustain the discharge in the cells to 
be displayed by applying the sustain pulses between 
the display electrodes. Another one is erase opera- 
tion, which means discharges are ignited to make 
the wall charges in all of the cells between display 
electrodes, scan and sustain electrodes, along a dis- 
play line at once and then ignite discharges between 
the address electrodes and a scan electrode to erase 
the wall charges resulting in elimination of the dis- 
charges in the cells not to be displayed. 


5.2.4 Practical panel structure and 
fabrication process 


The first successful full color PDP has a diagonal 
size of 21in. and the fabrication processes were 
completed while developing the panel. The new 
simple panel structure with a shape of stripe ribs 
and phosphors was developed to realize the fine 
pixel pitch (0.66 x 0.22 mm) for high resolution and 
was the most suited structure for mass production. 
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Figure 5.24 The practical panel structure of the 
three-electrode surface discharge PDP with the 
stripe structure. 


Figure 5.24 shows the practical panel structure 
which is called the three-electrode surface dis- 
charge PDP with the stripe structure [27]. Paired 
parallel display electrodes, sustain electrode and 
scan electrode, are formed on the front glass sub- 
strate. Each display electrode is composed of a 
transparent ITO (indium-tin oxide) and a narrow 
bus electrode of multi-layered Cr, Cu and Cr, to 
emit a luminance effectively through the transpar- 
ent electrode and reduce the electrode resistance. 
These electrodes are covered by a dielectric layer, 
which is made of low melting point glass materi- 
als. These are also covered with a thin MgO layer. 
On the other rear substrate, the striped address 
electrodes are arranged. Striped barrier ribs are on 
both sides of the address electrodes to separate the 


adjacent discharge cells and to eliminate the opti- 
cal cross talk between them. Three primary color 
phosphor materials for red, blue and green colors 
are deposited in the neighboring channels made by 
the ribs to cover both on the side walls of the ribs 
and on the dielectric layer. The structure has real- 
ized good performances such as a high luminance, 
a high luminous efficacy and a wide viewing angle. 
Phosphor materials are BaMgAl,,O,,:Eu for blue, 
(YxGa)BO,:Eu for red and Zn,SiO,:Mn for green. 
The substrates are assembled to each other with 
about 150m gap. A Ne+ Xe gas mixture is intro- 
duced between the substrates. The panel structure 
developed for the 21-in. diagonal color PDP is the 
simplest one of conventionally researched color 
PDPs. And the fabrication process is also sim- 
ple enough for mass production. So the PDP has 
advantages such as a low cost process, and an easi- 
ness in the manufacture of the large area panels 
and the high resolution panels. 

The essential fabrication process as shown in 
Figure 5.25 is also completed to develop the 21-in. 
PDP. The transparent conductive ITO film is made 
on the front glass. The plural paired display elec- 
trodes are formed by a photolithography technology. 
The metal electrode film ofa Cr/Cu/Cr multi-layer is 
sputtered on these transparent electrodes. The bus 
electrode is also formed by a photolithography tech- 
nology. These electrodes are covered with a frit glass 
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Figure 5.25 The essential fabrication process of color plasma display panel. 
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layer with a printing technology and then heated at 
about 600°C to make a transparent dielectric layer. 
The seal glass layer with a width of about 3mm is 
made on the outside of the display area and then 
pre-heated. An MgO protecting layer is evaporated 
on the dielectric layer over the display area of inside 
of the seal layer. The front plate is completed with 
these processes. 

A small hole of a diameter of about 1mm is 
made on a corner of the rear plate. The Ag address 
electrodes are printed and heated. The frit glass is 
printed on the electrodes in the display area and 
then heated at about 600°C to make a dielectric 
layer. The barrier ribs are made by sandblasting the 
frit glass on both sides of the address electrode. The 
red, blue and green phosphors are printed inside 
the channel between the barrier ribs. Each color 
phosphor is printed at a same time and then the 
printing is repeated three times and then dried. 
The rear plate is completed with these processes. 

Both plates are assembled and fixed with clips. 
The assembled plates are heated to melt the seal 
layer and the plates are glued resulting in a panel. 
At the next gas filling process, the panel is con- 
nected to an evacuation/gas-filling system through 
the evacuating tube. After the baking, the dis- 
charge gas is introduced. Finally, the PDP is com- 
pleted after melting off the evacuating tube. The 
driving pulse is applied to the panel and discharges 
are ignited in every discharge cell to reduce and 
make the operating voltage stable. 
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5.2.5 Improvement of the cell 
structure 


Fujitsu. Hitachi Plasma Display (FHP) had 
reported on the alternate lighting of surfaces 
(ALIS) technology as shown in Figure 5.26 [28]. 
The ALIS method does not have a non-luminous 
area. Discharge takes place between adjacent 
display electrodes, instead of scan and sustain 
electrode pairs one by one. This system, how- 
ever, does not permit line progressive scanning 
by upper and lower discharge cells with the use 
of shared cells. This interlaced scanning can 
be operated by the drive circuits, which are the 
same as those in existing PDPs with 480 scan- 
ning lines. It does not require special high speed 
addressing technology or dual scanning, which 
requires twice the usual number of ICs. It is pos- 
sible to double its definition by using the same 
manufacturing and driving technology as that 
for the conventional method. In other words, 
they can apply conventional methods to turn a 
VGA color PDP into an SXGA PDP. 

The ALIS system itself does not improve lumi- 
nous efficacy. However, the adoption of the method 
makes a non-luminous area unnecessary and 
raises the aperture rate to 65%. As a result, ALIS 
improved the luminance by 150% of that of the con- 
ventional method. By adopting the ALIS system, 
FHP made a 32-in. color PDP with 852 x 1024 pix- 
els capable of displaying high definition pictures. 
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Figure 5.26 The alternate lighting of surfaces (ALIS) technology developed by FHP. 


130 Vacuum tube and plasma displays 


Method Alternate 
Display size 

Number of cells 
Sub-pixel pitch 

Colors 

Luminance 


Contrast ratio 


Power consumption 


Figure 5.27 A 42-in. HDTV with ALIS technologies. 
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Figure 5.28 Three-electrode surface discharge plasma display with waffle rib and T-shaped electrode 


structure developed by Pioneer. 


Figure 5.27 shows the 42-in. diagonal HDTV with 
ALIS technologies. 

Pioneer devised PDPs with a high lumi- 
nous efficacy and a high contrast ratio by adopt- 
ing a T-shaped electrode structure as shown in 
Figure 5.28 [29]. The T-shaped electrode structure 
produces a favorable effect on luminous efficacy 
and contrast ratio. 

In the conventional system, the ribs on the 
rear plate side, to which the RGB phosphors are 
applied, adopted a stripe structure. The waffle 
structure arranges these ribs in parallel crosses 
[30]. The waffle rib structure eliminates light leaks 
vertically, to reproduce sharp image contours. At 
the same time, the arrangement can widen the 
per cell area of applied phosphors. The adoption 


of the T-shaped electrode structure and the waffle 
rib structure enables a luminance of 560 cd m”, 
which is 60% higher than that of the conventional 
system, raises luminous efficacy by 40% and pro- 
duces a color PDP with optimum high resolution. 


5.2.6 Gradation 


The display is performed by controlling the two 
states, ON and OFF, in AC PDP. Changing the 
number of sustain pulses changes the luminance. 
The wall charge especially plays an important role 
to control the ON and OFF states in AC PDP and a 
sufficiently wide pulse width is needed to get a suf- 
ficiently wide operating margin. The ADS method 
is adopted for operating the AC PDP to meet the 
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Figure 5.29 The ADS method, address-, display-period separation method, adopted for operating 
the AC PDP to realize an HDTV image with 256 greyscales. 


requirement and realizes an HDTV image with 256 
greyscales. Figure 5.29 shows the ADS method [31]. 
An image is constructed with 60 fields and each field 
is divided into eight sub-fields. Each of the eight sub- 
fields has different luminous level and the example 
of the luminance ratio for the eight sub-fields is 
1:2:4:8:16:32:64:128 to realize the 256 greyscales. 
The luminance level is determined by setting the 
number of sustain pulses for each sub-field. The 256 
greyscales are realized with the combination of the 
sub-fields in which the ON and OFF states are con- 
trolled by depending on the display data. 

Each sub-field is divided into two periods, such 
as address period and display period, as shown in 
Figure 5.30. In the address period the discharge is 
ignited between selected address electrodes and a 
scan electrode sequentially depending on the dis- 
play data. And then the wall charges are formed in 
the selected cells all over the display area and the 
successive sustain pulses are applied in the display 
period between all display electrodes. The width 
of the address pulse is less than 21s, and then the 
addressing speed is sufficiently high to realize the 
256 greyscales for HDTV format. 

The address period is further divided into 
the reset step and address step. The reset step is 


important for operating AC PDP. The wall charges 
accumulated in the previous display period are 
eliminated and then the pre-condition is made 
to ignite the discharge stable in the address step. 
Although the display period and address step 
are indispensably important, the reset step is 


Y 
AY 
at 
Setup Address 
step step 
Address __ period Display 
period 


Figure 5.30 Detailed waveform for ADS method. 
Each sub-field is divided into two periods, such 
as address period and display period. 
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also important for stable addressing, high speed 
addressing and controlling the contrast. 

The ADS method has advantages, such as an 
easiness in setting the width of sustain pulses, the 
number of sustain pulses, low power consump- 
tion, controllability of the greyscale and stable 
operation. 

The ADS has been improving to realize a high 
quality display image and stable operation. 


5.2.7 Improvement of drive systems 


Characteristically, many color PDPs now adopt 
software-based improvements in addition to 
the hardware-based improvements such as the 
improvement of cell structures. 

False contour issue was one of the essential 
issues to degrade the image quality when PDP 
adopts a driving method for greyscale using the 
luminous combination of different luminance 
sub-fields as shown in Figure 5.29. They have 
always been one of the most serious problems that 
degraded the display image on PDPs. To diminish 
the false contours, engineers previously resorted 
to methods that made false contours as invisible 
as possible. There are many methods to reduce the 
false contours, such as error diffusion, dithering, 
duplicated sub-field method, and so on [32]. With 
the combination of the methods, the false contour 
issue is improved as an acceptable level for the 
television application. Figure 5.31 shows the dupli- 
cated sub-field method. 

Pioneer achieved a high contrast ratio of 
560:1 and solved the problem of false contours 
by adopting a new ADS sub-field method called 
the Hi-Contrast and Low Energy Address and 
Reduction of False Contour Sequence (CLEAR) 
system, to achieve a Hi-Definition progressive dis- 
play with excellent picture quality [33]. 

The CLEAR method resolved this problem by 
preventing, in principle, the generation of false con- 
tours by using the luminance accumulation of dif- 
ferent luminance sub-fields as shown in Figure 5.32. 
Although one TV field is divided into sub-fields, 
which have different luminance in each the same 
as the conventional one, the reset period is only 
once. The discharges are ignited at once in all cells 
of the panel and form wall charges in the reset step. 
The wall charges are erased at the address period in 
the selected sub-fields according to the display data 
and then eliminate the sustaining discharge in the 
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Figure 5.31 Suppression of the dynamic false 
contour by the duplicated sub-field method. 


display period of the sub-field. So the luminance of 
unselected cells is accumulated from the first sub- 
field to the sub-field just advanced to the sub-field 
in which the erase pulses are applied. When the 
luminance is gradually varied, the light emission 
pattern of the sub-field does not change largely as 
in the case of the conventional sub-field method. 
Then the false contour issue is essentially solved. 
If the principle described above is applied simply, 
the grey levels are insufficient to display a beauti- 
ful image. To get more grey levels, the number of 
sustain pulses of the sub-field is changed in each 
TV field. When a TV field is composed of the m 
sub-fields and the sustain pulse number is changed 
between the n TV fields, an [mxn-+]] -step greyscale 
is obtained. And dither method and error diffusion 
method are also applied; then, the CLEAR system 
yields a color PDP with the same gamma character- 
istic as that of CRTs. Consequently, a display capa- 
bility of over 256-step grey levels for each RGB cell 
can be realized. 


5.2.8 Future prospects of color PDP 


One of the latest innovative developments is the 
delta arrangement PDP with meander barrier 
ribs reported by Fujitsu Labs [34]. This improves 
luminance and luminous efficacy by increasing the 
discharge cell size and also the area of phosphor 
application as shown in Figure 5.33. The luminance 


References 133 


Relative luminance for each subfield 
is determined by the factor 2” 


1:2: 4:8: 16: 32: 64: 128. 
Low 


Luminance 


af oe ll ee 
a 
LC] 
—= 
= 


= 
ga 
=m 


Luminance 


Input 


Conventional methods 


On-subfield 
MM Off-subfield a 


Relative luminance for each 
subfield is determined freely to 
improve greyscale 


Luminance 


a 
3 
oO 


Luminance 


Input 


Clear driving 


Figure 5.32 The CLEAR method resolved the problem by preventing, in principle, the generation of 
false contours by using the luminance accumulation of different luminance sub-fields. 


and luminous efficacy of this structure are about 
double those of the conventional method. 

The next approach to improving performance 
was the idea of raising the concentration of Xe in 
the conventional rate of 4 or 5%. It is well known 
that this method did not attract much atten- 
tion, because it did not solve the problem of the 
extra cost for the driver ICs that must handle the 
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Figure 5.33 Delta arrangement PDP with 
meander barrier ribs. This improves luminance 
and luminous efficacy by increasing the discharge 
cell size and also the area of phosphor. 


additional driving voltage. Nevertheless, it cer- 
tainly was effective in improving the luminous 
efficacy and luminance of PDPs. 
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OBJECTIVE 


Over the past half century, liquid crystal displays 
(LCDs) have grown to be one of the most suc- 
cessful optoelectronic technologies, becoming an 
integral part of communication devices and often 
an enabling technology. This success has required 
many adaptions to meet the requirements of ever- 
increasing complexity and performance. Indeed, 
it is the adaptability of liquid crystal devices that 
grounds their success. After outlining the basic 
physics of liquid crystals and device construction, 
the various modes used in commercial displays 
will be reviewed, both for mainstream and niche 
markets, together with a summary of the impor- 
tant complimentary technologies. Finally, liquid 
crystal devices that have promise for future appli- 
cations in optoelectronics will be discussed. 


6.1 INTRODUCTION 


On May 28, 1968, the cinema audience at the Loews 
Capitol Movie Theatre on Broadway, New York, 
USA, would have been astounded by the vision of 
the future presented to them in Stanley Kubrick’s 
masterpiece 2001: A Space Odyssey [1]. Among 
the technological advances envisaged, from an 
International Space Station, video telephony to the 
omnipresence of computers, the viewers will have 
been no less captivated by the tablet computers that 
astronauts Dave Bowman (Keir Dullea) and Frank 
Poole (Gary Lockwood) apprise themselves with 
from the BBC12 podcast, after being awakened 
from hibernation. These devices had flat panel dis- 
plays that are able to present full color, video infor- 
mation, something that would be as unfamiliar to 
the audience in 1968, as it is familiar to us now. Just 
one block away, at the Headquarters of RCA in the 
Rockefeller Plaza on 6th Avenue, George Heilmeier 
was at a press conference [2] to announce his 
recently published patent of the technology that 
would eventually lead to Kubrick’s vision becom- 
ing reality: the first practical LCD [3,4]. 

Although the first device utilizing liquid crys- 
tal electro-optical effects dates back to 1934 [5], 
the era of LCD research had begun in earnest with 
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the groundbreaking inventions at RCA in the mid- 
1960s [2,6,7]. This work was driven by RCA CEO 
David Sarnoff’s dream for “hang on the wall” 
television displays [6] and inspired by the work 
of George Gray [8] of Hull University in the UK, 
Glenn Brown at Kent State University in the US 
[9], and Richard Williams [10,11] at RCA’s Sarnoff 
Laboratories in Princeton. LCDs offered both 
reflective mode and backlit transmissive modes, in 
displays with a simple construction, light weight, 
and flat form factor. Display power consumption 
and operating voltages were attractively low and 
suited for being driven by Complementary metal- 
oxide-semiconductor (CMOS) circuitry. 

Soon after Heilmeier’s invention of the Dynamic 
Scattering Mode LCD [3,4], RCA terminated their 
LCD investment but the seed had been sown. 
Research groups from around the world, notably 
from Europe [12] and Japan [7,13], entered the field, 
stimulated by the RCA Press announcement. Having 
teamed up with Martin Schadt at Hoffman La 
Roche in Switzerland, ex-RCA researcher Wolfgang 
Helfrich invented the twisted nematic (TN) LCD 
in 1970 [14], a device with a more attractive opti- 
cal appearance than its Dynamic Scattering Mode 
predecessor and a design that was to become the 
mainstay of flat panel displays for the following 
three decades. In the UK, Gray’s group had formu- 
lated the first stable room temperature liquid crystal 
compounds [15] allowing mixtures with operat- 
ing temperatures below zero degrees centigrade 
[16] to be formulated by Peter Raynes at the Royal 
Signal and Radar Establishment in Malvern, UK. In 
Japan, Tomio Wada at Sharp led a joint collaboration 
with Dainippon Ink and Chemicals that launched 
the world’s first commercial LCD, incorporated in 
Sharp’s electronic calculator EL-805 in early 1973. 

The promise of flat-screen, low-power, high- 
information content displays that met Sarnoff’s 
goal for television displays seemed imminent [17]. 
Developments in the fabrication of LCD devices 
[7,12,13] led to early success in applications such 
as wristwatches and pocket calculators. However, 
it would take a further three decades of invest- 
ment, invention, and development before the LCDs 
replaced the dominant cathode ray tube (CRT) 
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displays used in televisions. Today’s state-of-the- 
art ultrahigh-resolution LCDs for Ultra-high 
Definition Television (UHDTV) combines tech- 
nologies such as thin-film transistors behind each 
color subpixel, optical compensation layers to give 
the widest angle of view, new alignment modes 
and addressing methods to achieve 120Hz frame 
rates, liquid crystal mixtures that can operate from 
—20°C to +80°C to produce displays from several 
millimeters to over 100” diagonal, at a price afford- 
able for mass market adoption. Although it took 
this period to achieve Sarnoff’s goal, the road map 
includes many consumer products that have been 
enabled by the emerging LCD: flat screen desktop 
monitors, laptop computers, mobile phones, and 
tablets. LCDs are the purveyors of the Internet age. 

There is a plethora of different LCD types, each 
with properties that have been optimized for dif- 
ferent markets. After reviewing some basics of 
liquid crystal science and display construction, a 
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selection of typical LCD modes is described in this 
chapter, together with some of the more esoteric 
devices that meet the needs of niche markets or are 
yet to achieve commercial success. 


6.2 LIQUID CRYSTALS AND THEIR 
DISPLAY RELEVANT PHYSICAL 
PROPERTIES 


6.2.1 Orientational order 


Although discovered by Friedrich Reinizer in 1888 
[18], the true nature of liquid crystals, and the oxy- 
moronic term liquid crystal itself, was described a 
year later by Otto Lehmann [19,20]. The type of liq- 
uid crystal used in all devices to date is formed from 
rigid rod-like (calamitic) organic compounds, such 
as those shown in Figure 6.1. There are a variety of 
liquid crystal phases that may appear between the 
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Figure 6.1 Some liquid crystals basics: (a) Schematic representation of (a) the nematic phase and the n 
director; (b) the smectic A phase; and (c) the smectic C phase. The principal axes for (d) the cylindrical 
nematic and smectic A phases and (e) the monoclinic smectic C. The spontaneous twist of the chiral 
nematic or cholesteric and definition of the helical pitch length P are shown in (f), and the spontane- 
ous twist/bend of the chiral smectic C and ferroelectric spontaneous polarisation P, are shown in (g). 
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isotropic liquid and solid crystal of certain organic 
compounds. Another, less common expression for 
liquid crystals is the term mesogenic, with mol- 
ecules capable of forming liquid crystals being 
called mesogens, and the various phases termed 
mesophases. The simplest of the liquid crystal 
phase is the nematic (denoted N for shorthand, 
Figure 6.1a), which has no positional order, as with 
a conventional isotropic liquid, but the liquid crys- 
tal phase has long-range orientational order of the 
long molecular axes. The molecular axes tend to 
orientate in a common direction called the direc- 
tor, represented by the unit vector n. This orien- 
tational order leads to crystal-like anisotropy of 
physical properties such as the refractive indices, 
dielectric constants, and conductivities. However, 
the lack of positional order means that the nematic 
is fluid and can flow when perturbed. The statisti- 
cal distribution of the molecular axes is quantified 
by order parameters. For a cylindrically symmet- 
ric phase, such as the nematic, which is composed 
of cylindrical molecules, the degree of order is 
defined using the S order parameter: 


$ == (3c0s*E-1), (6.1) 


where the brackets <> represent the statistical 
average over the coherence volume of the material 
and the deviation from the director of an individ- 
ual long molecular axis is §. The anisotropic physi- 
cal properties of the liquid crystal are related to S, 
which varies with temperature with the approxi- 


mate form [21]: 
s-(-2] , 
Tu 


where v is a material constant, typically 0.15 < v 
< 0.2, Ty, is the nematic to isotropic transition 
temperature, and T is the temperature below Ty. 
Above Ty, S = 0 by definition. This expression 
neglects the weakly first-order nature of the nem- 
atic to isotropic phase transition, which jumps 
immediately from 0 to about S = 0.43 at Ty, in actu- 
ality. However, Equation 6.2 is satisfactorily a cou- 
ple of degree celsius below Ty, predicting typical 
nematic materials to have S ~ 0.60-0.80 for typical 
nematic material at ambient temperatures. 


(6.2) 


A phial containing a nematic liquid crystal is 
milky in appearance. Unlike the common colloidal 
fluids with this type of behavior (such as milk itself), 
this appearance occurs for a pure, single compound. 
It arises because, although the liquid crystal mol- 
ecules tend to point in the same direction locally, 
this direction changes over micron length scales 
due to thermal fluctuations, causing strong Tyndall 
scattering of the light. When heated to the isotropic 
liquid, the scattering disappears suddenly and the 
liquid is clear; for this reason, the temperature of the 
liquid crystal to isotropic transition (for example, 
Ty) is often termed the clearing point. 

Almost all LCDs sold are based on the simple 
nematic type of liquid crystal because it is the easi- 
est to handle and most well understood. However, 
there are many other phases that can form that are 
intermediate between the liquid and crystal states, 
from those with higher degrees of order (lay- 
ered smectics, helical cholesterics, and columnar 
phases), those using different shaped molecules 
(discs, banana-shaped, bowl-shaped, main-chain, 
and side-chain polymeric) to those systems where 
the liquid crystal nature is due to changes in con- 
centration, rather than, or in addition to, tem- 
perature (lyotropic and chromonic phases). The 
interested reader is referred to Reference [22] for 
a comprehensive review on all aspects of liquid 
crystal science. For the present chapter, only the 
smectic and cholesteric phases are considered 
because of their (limited) application to displays. 
Smectic liquid crystal phases have some degree of 
positional order in addition to orientational order. 
The simplest smectics are termed smectic A and C 
(for which the shorthand SmA and SmC is used, 
respectively) and they exhibit one-dimensional 
positional order to form layers of nematic-like 
material. The layers of such phases are described by 
the unit vector a. SmA have n parallel to a (Figure 
6.1b), whereas the director n is at an angle 0, to 
a in the SmC phase (Figure 6.1c). Both the nem- 
atic and SmA phases have cylindrical symmetry 
about n (Figure 6.1d), whereas the SmC has mono- 
clinic symmetry. Nematics comprised, at least in 
part, of chiral molecules undergo a spontaneous 
twist of the director to form a helical structure 
(Figure 6.1f). Chiral nematics, represented by N*, 
are often called cholesterics after the first liquid 
crystal behavior was observed for this phase in a 
compound derived from cholesteryl benzoate [18]. 
A full 2x rotation of the n director is defined as the 
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pitch length, P, which may vary from submicron 
in some pure compounds, to several hundreds of 
microns in a nematic material doped with a small 
concentration of a cholesteric dopant. In the chi- 
ral smectic C (SmC*) phase (Figure 1g), the loss 
of mirror symmetry induced through the addi- 
tion of chirality has two notable effects. First, the n 
director undergoes a spontaneous twist and bend 
deformation (so that the c director forms a twist in 
a similar way to the cholesteric), again with pitch P. 
Second, the combination of the loss of the mirror 
plane with the monoclinic symmetry of the SmC 
leads to a spontaneous ferroelectric polarization 
P, in the plane of the layers and normal to the tilt 
plane [23]. As the c-director precesses about the 
layer normal in a helical fashion, the net ferroelec- 
tric polarization cancels out throughout the bulk 
of the phase because the c-director precesses about 
the layer normal in a helical fashion. 

Liquid crystal phases are formed for molecules 
that comprise both a rigid core structure and pen- 
dant flexible hydrocarbon chains. The occurrence, 
temperatures, and phase sequences of liquid crys- 
tals depend on the relative proportion of the flex- 
ible chains and rigid core: the flexible groups are 
needed to space the cores sufficiently to prevent 


crystallization but sufficiently small to allow ori- 
entational ordering of the cores. Predicting the 
temperature range over which liquid crystal 
phases form is a complex subject, requiring skill 
and experience for the synthetic chemist. For the 
simple rod-shaped calamitic molecules used in 
commercial displays, the transition from isotro- 
pic to liquid crystal phase is related to the length 
of the molecule. However, viscosity is also depen- 
dent on molecular size and so most practical liquid 
crystal compounds are formed from two or three 
ring structures to keep viscosity low and switching 
speeds high. The ring structures in the core may 
be benzene, cyclohexane, pyrimidine, thiophene, 
etc., joined together with parasubstitution (e.g., 
1,4 benzene) or with near-linear linking groups 
(such as esters, ethyl groups), and terminated on 
one or both sides with alkyl chains. In each case, 
the molecular core retains its near-cylindrical 
symmetry with each of any low-energy conform- 
ers allowed by the core, such as rotations about the 
ester or ethyl linkages. Table 6.1 lists a range of dif- 
ferent compounds that are typical for nematic liq- 
uid crystal mixtures, together with some of the key 
physical parameters [24] related to typical device 
behavior. 


Table 6.1 Typical nematic liquid crystal compounds and their physical properties 


An Ae Kyi: 
(20°C, (20°C, Kp0; ¥, (Pa-s, 
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6.2.2 General anisotropic physical 
properties 


All LCDs operate because of the combination 
of the anisotropic physical properties combined 
with fluidity exhibited by liquid crystals. Tensorial 
anisotropic physical properties such as the refrac- 
tive indices and electric permittivities can be 
diagonalized to give principal values parallel (im) 
and perpendicular (,) to the director [25]. Most 
commercially available devices use reorientation 
of the director induced by an applied electric field 
E coupling to the dielectric tensor e, to reduce the 
electrostatic free energy G,, given by 


Gp =-5DE =—5e (£.e.E) =— 5 erde(n.B)” ; 


(6.3) 


where the dielectric anisotropy Ae is the dif- 
ference of the principal components €—€,. 
Equation 6.3 predicts that the director tends to 
reorient parallel to the applied field if the material 
has a positive Ae and perpendicular to the field 
if Ae is negative. Similarly, many devices rely on 
changes in optical retardation from the mate- 
rial birefringence An (= n)—n, = n,—1,), where 
e and o refer to the extraordinary and ordinary 
rays, respectively, to give the perceived opti- 
cal modulation. For example, if viewed between 
crossed polarizers, a device will appear isotropic, 
and therefore black, when viewed along the optic 
axis (parallel to n,) and will exhibit some white 
light transmission when viewed in a direction at 
an angle to the optic axis due to the effect of the 
optical retardation. 

The physical parameters depend on both 
chemical structure and temperature. Anisotropic 
properties, such as An and Ae, are also related to 
the order parameter S, whereas the fluid proper- 
ties, such as density, bulk viscosity, and the mean 
refractive indices and permittivities, are primar- 
ily related to absolute temperature. The uniaxial 
refractive indices are given by [25] 


ie 14 PME fa as ; 
M£&o 3 
(6.4) 
me=l+ PNAbF {a+ Z Aas}, 
Me£o 3 


where p is the density, N, is the Avogadro number, 
M is the molecular weight, & is the mean molecu- 
lar polarizability, and Aw is the difference between 
the molecular polarizability parallel to the long a, 
and short 0, axes. The cavity field factor h and the 
Onsager reaction field F relate to the internal field 
experienced by an average molecule, approximated 
for a spherical cavity by 


ae es es 1 - 
2+1 = ONE + 
3 M-€\2€+1 


(6.5) 
where mean anisotropic properties are expressed 


using a bar above the symbol, such as for the mean 
polarizability on the molecular scale: 


(6.6) 


and permittivity and refractive index on the mac- 
roscopic scale: 


— 1 1 
€=—(2e€, +€));n=—(2n, +n 6.7 
3(28. +e))s7=3(2m, +n.) 6.7) 
The birefringence is given by 
ig DAE | = \s (6.8) 
Me (3n-n 


and is largely dictated by the polarizability anisot- 
ropy Aa of the constituent molecules. The density 
p is usually close to that of water at 20°C, with a 
selection of nematic compounds p = 990 + 50 
kgm [26], and with a near linear temperature 
dependence through the liquid crystal (LC) phases 
that is typically 1 kgm~3-K"!. Thus, the temperature 
dependence of the birefringence is dominated by 
the order parameter S. A calamitic liquid crystal 
has a positive birefringence because Aa is positive 
for the cylindrical molecular core structures. If the 
rigid core is short and comprises weakly polarizable 
saturated moieties such as cyclohexanes, the bire- 
fringence will be low. It will be higher for longer, 
unsaturated groups such as phenyls and ethynes, 
reaching An = 0.4, but more typically being maxi- 
mum at An = 0.26 for light stable compounds. 
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Similarly, the uniaxial electric permittivities are 
given by [25] 


NAF? 1 
e, =ni+ PA {1+ 1—3cos” s|, 
eer 2{1-3e0s"B) 
(6.9) 
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where B is the angle between the molecular dipole 
uu and the long molecular axis. The dielectric 
anisotropy is 
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There is more freedom for controlling the dielectric 
anisotropy than for birefringence by placement of 
strong dipole moments onto the core structure. 
Strongly positive materials result from polar moi- 
eties such as cyano-terminal groups, or 3, 4, 5 sub- 
stitution of fluoro-groups onto a terminal phenyl 
group. Such placement ensures that f is kept low, 
with Ae ~ +50 being readily achievable. It is harder 
to make strongly negative materials for a number 
of reasons. First, the n; — n> term of Equation 6.10 
is always positive and leads to Ae ~ +2 even if there 
is a negligible dipole moment. Equation 6.10 pre- 
dicts that transverse dipoles, where B ~ 90°, are 
only half as efficient at contributing to a negative 
Ae due to the cylindrical symmetry. Moreover, 
bulky polar side groups detract from the rod-like 
shape of the molecule that leads to liquid crystal- 
linity. Nevertheless, mixtures with Ae ~ —6 have 
been achieved for modern LCD TV applications. 
Note, Equation 6.10 also predicts that a dipole 
moment at B = 48.2° contributes equally to € and 
€,; even a large dipole moment oriented at B ~ 52° 
gives Ae = 0. 

Implicit in the definition of liquid crystal 
behavior is the concept of orientational elasticity 
and the energetic cost associated with deforming 
the director field. Whereas a crystal solid has elas- 
ticity associated with the positional translation of 


the constituent molecules, the liquid crystal has 
elasticity associated with changing director ori- 
entation. The curvature strain tensors of phases 
with cylindrical symmetry contain terms in splay, 
twist, and bend deformations [27] (as shown in 
Figure 6.2a). The elasticity of the chiral nematic 
is equivalent to that of the achiral nematic but 
includes 2z/P subtracted from the twist term to 
represent the spontaneous twist of director. The 
elastic bulk free-energy density for nematics and 
cholesterics G, is given by 


el 


2 
k,(V.n)' + k(n xn “= | 


+k33(nxV xan) 


(6.12) 


where P = ce for the usual achiral nematic case and 
k,, (i = 1, 2, 3) are the splay, twist, and bend elastic 
constants, respectively. 

The elastic constants are important to the dis- 
play engineer because they dictate the amount of 
deformation induced by the applied electric field. 
Both splay and bend elastic constants contribute to 
distortions in the plane of the director and deform- 
ing torque, whereas twist occurs where the distor- 
tion is perpendicular to the director and torque. 
There have been extensive studies of the relation- 
ship between molecular structure and the elastic 
constants and because of this important role they 
play in the operation of LCDs. To a first approxi- 
mation, the temperature dependence of the elastic 
constants follows S?, a relationship that holds well 
for k,, and k,,, where the ratio k,,/k,, is relatively 
insensitive to temperature and chemical structure 
with 0.5 <k,,/k,, < 0.8 and usually k,,/k,, = 0.5. The 
behavior of the bend elastic constant k,, is more 
complex [28], with k,,/k,, being both temperature 
dependent and showing a strong dependence on 
structure, with values ranging from 1.0 < k,,/k,, < 
2.2. Approximating the constituent molecules of 
the liquid crystal to rigid, hard rods of length L and 
diameter W lead to the approximate relationship: 

ky/ky, ~ L/W’, (6.13) 
which provides a useful rule of thumb for the LC 
mixture designer. However, there is a strong influ- 
ence on this elastic ratio from short-range local 
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Figure 6.2 Nematic elasticity and disclinations: (a) Splay, twist, and bend deformations; (b) line dis- 
clinations of strength +1, +%, —%, and -1. In each case, the disclination continues normal to the page, 
and the director remains invariant in this direction (cylindrical symmetry). The defect core is indicated 
in gray; (c) flexoelectric behavior from banana-like molecules in a bend deformation; and (d) flexoelec- 
tric behavior from wedge-like molecules in a splay deformation. 


ordering in the nematic phase due to intermolecu- 
lar dipole correlations or local smectic ordering. 
For example, smectic local ordering of positive Ae 
materials leads to a higher k,,/k,,. 

Liquid-like fluidity of a liquid crystal is dictated 
by the viscosity, which determines the switching 
speed of LCDs. How quickly the director reori- 
ents with respect to a deforming torque is deter- 
mined by the ratio of the viscosity and the elastic 


constants. However, even for the simplest nem- 
atic case, there are five viscosity coefficients [29], 
describing flow of the director in directions par- 
allel and perpendicular to the director and shear 
force directions. The effect of fluid flow can play 
an important part in device behavior, such as the 
optical bounce that occurs in twisted-nematic 
displays due to backflow [30]. Full understanding 
of these effects requires numerical modeling [31]. 
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For simplicity, backflow is often ignored and the 
time dependence of director reorientation without 
coupling to mass flow is represented by the single 
twist viscosity y,. Viscosity has a strong tempera- 
ture dependence related to the change in order 
as well as the Arrhenius dependence common to 
liquids [32]: 


U 

VY =(aS +a, Je #, (6.14) 
where a, and a, are material-dependent constants, 
U is the activation energy for molecular move- 
ment, k is the Boltzmann constant, and T is the 
absolute temperature. Most commercial liquid 
crystal mixtures are based on aromatic systems, 
where a, >>a) for compounds with low birefrin- 
gence, a, >a, for highly birefringent compounds, 
and a, ~ a, ~ 1074 sm~ and U = 0.3 eV for An = 
0.18. At low temperatures, the viscosity often devi- 
ates from this Arrhenius type of behavior as it 
diverges toward a low temperature glass transition. 
The standard Vogel-Fulcher-Tammann equation 
has been applied successfully to a range of nematic 
compounds [26]: 


B 
] =A+ ; 
oe eae | 


(6.15) 
where A and B are material-dependent constants 
and T, is a temperature that is typically about 20°C 
below the liquid crystal glass transition. 
Continuum descriptions of the director field 
are not always satisfactory because the director 
field often also includes topological discontinui- 
ties, called defects or disclinations [27]. Indeed, it 
was the thread-like appearance of nematic samples 
containing such defects that originally led to the 
naming of the phase, from the Greek for thread 
(nematos). Both point defects and line defects are 
common; examples of nematic line disclinations 
with different strengths are shown in Figure 6.2b. 
At each discontinuity in the director field, there 
is local melting of the phase and the liquid crys- 
tal order is zero at the disclination core, which is 
typically of the order of 10nm in diameter. For 
the nematic phase, disclinations of strengths +1, 
—l, +4, and —”% are common, with +% and —” 
appearing as pairs and usually forming defect 
loops or may terminate at interfaces such as the 
containing substrates of a device. Avoiding defects 


is a key part of the design of most LCDs, although 
recently there has been an insurgence of inter- 
est in the potential applications for liquid crystal 
devices with controllable formation of topological 
defects, such as the zenithal bistable display (ZBD) 
described in Section 5.3. 

The constituent molecules for the liquid crys- 
tal do not exhibit perfect cylindrical symmetry 
but have more complex shapes. For example, they 
may not have a twofold rotational symmetry about 
either short or long axes, leading to “wedged” or 
“banana” molecular shapes, respectively. Normally, 
the nematic includes all possible orientations of 
the molecules that maintain the symmetry of the 
phase. However, with bend and splay deforma- 
tions, the distribution of asymmetric molecules is 
shifted slightly as the molecules pack. For example, 
banana-like molecules will tend to orient so that 
molecular curvature follows that of the director 
field (Figure 6.2c). Similarly, the apex of wedge-like 
molecules points into the splay (Figure 6.2d). In 
either case, any molecular dipole will then contrib- 
ute to a macroscopic polarization [33] and flexo- 
electric terms are required in the free energy. The 
degree of polarization for splay and bend deforma- 
tions is represented by the flexoelectric coefficients 
e, and e, respectively, which are dependent on the 
material shape, dipole moments, and density. 


6.2.3 Formulating liquid crystal 
mixtures 


Table 6.1 includes some of the important display- 
related physical properties for a number of com- 
mon nematic compounds, including temperatures 
for the nematic to isotropic transition T,,, as well as 
the room temperature values for the optical, elec- 
trical, elastic, and viscous properties. Each of these 
compounds is stable at operating temperatures and 
when exposed to light, allowing device lifetimes of 
scores of years to be achieved. As shall be shown 
in the following sections, the properties of each of 
the compounds might be suited to different display 
technologies. For example, devices using optical 
scattering effects require a high birefringence to 
become strongly scattering, whereas polarized light 
displays using the retardation effect of the liquid 
crystal often require a lower birefringence to enable 
higher device spacing (and hence higher yields). 
Modes such as the supertwist nematic (STN) 
require k,,/k,, to be high, whereas the simpler TN 
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display requires k,,/k,, to be kept low. Such require- 
ments are rarely isolated, being combined with a 
range of target physical properties, such as appro- 
priate temperature range of the required liquid 
crystal phase, low viscosity, to more complex needs, 
such as low temperature dependence of elastic con- 
stant ratios, etc. This necessitates mixtures to be 
used, frequently involving scores of components. 

Properties such as the birefringence An, the iso- 
tropic to nematic transition temperature T,,, and 
splay elastic constant k,, depend roughly linearly 
with concentration. The dielectric anisotropy Ae, 
and twist and bend elastic constants k,, and k,,, 
are slightly more complex because of the effects 
of dipole correlations and smectic ordering. For 
example, the epoch-making compound pentyl- 
cyano-biphenyl (5CB, compound 1 in Table 6.1) 
has a measured Ae of about 15 close to room tem- 
perature and can be treated as Ae of about 24 when 
adding into mixtures. This is because the antipar- 
allel dipole correlations of the pure compound 
that effectively reduce the parallel dipole moment 
are disrupted in a multicomponent mixture. Care 
still has to be taken when formulating mixtures 
because the breaking of the dipole correlations 
can lead to the unmasking of smectic behavior: it 
is quite common for a mixture of polar and apolar 
nematic compounds to exhibit an unwanted smec- 
tic phase despite neither component having smec- 
tic behavior. Such a phase is called an “injected” 
smectic and arises because the dipole correlations 
of the polar compound were preventing the forma- 
tion of the smectic layers. 

Ignoring the order parameter-related terms in 
Equation 6.14 and considering only the viscosity 
to follow the Arrhenius behavior of conventional 
liquids, the viscosity of an n component mixture 
is given by 


(6.16) 


log(y:)= °C, -log(v1),» 


where i= 1, 2, 3, ...n, and C, is the concentration of 


component i such that bye ; =1. This logarithmic 
concentration dependence means that highly 
viscous additives can be used at low concentra- 
tion without increasing the mixture viscosity sig- 
nificantly. For example, it is common practice to 
increase the clearing point through the addition of 


three- or four-ringed compound, where the linear 
increase in Ty, is accompanied with a dispropor- 
tionately small viscosity increase. 

None of the compounds in Table 6.1 show a 
room temperature liquid crystal phase, rather 
freezing to a crystal form. The liquid crystal phases 
usually supercool below the melting point because 
of the strong first-order nature of crystallization; 
this is particularly true in a thin container such as 
an LCD, where crystallization is suppressed by the 
surfaces. However, LCDs require operation typi- 
cally between —20°C and +70°C and to be stored 
for months down to —40°C. Such temperature 
ranges require the formation of eutectic mixtures 
to suppress melting, at concentrations approxi- 
mated by the Schroeder-Van Laar equation: 


where R is the Rydberg constant; C;, AH,, and T; are 
the molar concentrations, enthalpy of freezing and 
melting point of the ith component, respectively. 
This provides a guide for the material scientist to 
calculate the eutectic composition, given that, at 
the eutectic temperature, the concentrations sum 
to 100%. In practice, more thorough empirical 
methods [34] are required to formulate commer- 
cial mixtures. Table 6.2 includes the composition 
of three typical positive Ae eutectic mixtures, E7, 
ZL12293, and ZLI 4792, together with their impor- 
tant physical properties. 


6.2.4 Functional liquid crystal 
compounds 


As shall be shown, it is not just the nematic phase 
that is used for LCDs. A variety of mesogenic com- 
pounds have been used to impart some new func- 
tionality to the system. Table 6.3 lists examples of 
mesogenic compounds, which are used for impor- 
tant display-related purposes, as summarized 
below. 


1. The inclusion of one or more chiral centers 
(denoted *) in the flexible end chain ofa 
mesogen imparts a tendency for spontane- 
ous twist of the director. The material may 
exhibit inherent cholesteric (N*) or chiral 
SmC* phases (such as CE3 in Table 6.3) or 


Table 6.2 Physical properties of selected nematic LCD mixtures 


Mixture 


Typical use 


Composition 


Nematic temperature range (°C) 
An; n, 

(589 nm, 20°C) 

Ag; €1 

(1kHz, 20°C) 

ky1; kooi keg 

(pN, 20°C) 

Dynamic bulk viscosity n (cP, 20°C) 
Twist viscosity y, 

(mPa-s, 20°C) 

Flexoelectric coefficients 

e, + 3, 

e,-e3 


(pCcm-2, 20°C) 


E7 


ZLI 1132 


Passive matrix TN 


CsHi1—O)XO- C =N 51% 
CHy34O)4O)- C=N 25% 
CaHyO-~OC)O)- C=N 16% 


CsHy, OQHO)- C =N 8% 


S <-30N 58 | 
0.2253; 1.5211 


13.8; 5.2 
11.7; 8.8; 19.5 


465 
180 


+15; 12.2 


CH, \“O)-— C=N 24% 


C3Hi1-¢ D<C)-CEN 36% 


GH, ( = ( 


CsHi1 ( nee 


S<-6N711 
0.1396; 1.4830 


13.1; 4.6 


1.95 


200 
250 


\—C=N 25% 


ZLI-4792 


Active matrix TN 


CeHiis (AOE 7% 


CrHis < )XO)-F 5% 
CH > ( )XO)—OCE, 7% 
C3H,-{_) { )O)-OCE, 11% 
CyHo{_) ()XO)- OCF, 8% 
CsHy-{_) ( XO)-OCE, 8% 

C3H, € XO)HXO)-F 11% 
ie: 
S<-40N 921 


0.0969; 1.4794 


5.2; 3.1 
13.2; 6.5; 18.3 
150 

109 

\10|; -15 


Source: Data collected from References [35-37]. 
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Table 6.3 Examples of functional mesogenic compounds 


# 
1 


Compound 
(+)-4-n-Hexyloxyphenyl-4-(2-methylbutyl) 
biphenyl-4’-carboxylate (CE3) 
O 


ie OOF © oc 


CHs K 65°C Sc* 79°C N* 162°C 1 


K 65°C S-* 79°C N* 162°C | 
2,3-Difluoro-1-ethyloxy-4-[trans-4-propy] 
cyclohexyl] biphenyl 
FOF 


Example anthraquinone dye 


CoH XO)—N O OH 
O NO)— Git 


1,2-Bis(4-hex-5-enyloxyphenyl)diazene 


= C,H,O “Sn-O)-0cyH, = 
RM257: 1,4-Bis[4-(3-acryloxyproyl 
benzoxyl]-2-methylbenzene. 


O 
O 
rf C,H,O O*K 6- 


O03 
K 64°C N 126°C1 O 


DB126: Triphenylene hexa-(2-methyl-4-n- 


decyloxy)benzoate 
RR 


©) es 
R E26) R : PHO oat 
R R 


K 109°C (Col 75°C) Np 164°C 


K 109°C (Col 75°C) Np 164°C | 


may induce chirality through doping into a 
nematic or SmC host material. The important 
characteristic of the chiral compound as a dop- 
ant is its helical twisting power and handed- 
ness. These are related to molecular structure 
through various empirical rules, such as those 
of Reference [38]. 

Smectic phases occur with longer terminal 
chain groups at the mesogenic core. The 


O)-0cH, SS 
6) 


Use Key properties 
N*; 

Chiral Dopant; 
Ferroelectric 


Helical twisting power; 
Induced Ferroelectric 
polarisation 


compound 
VAN Low y, 
Sc High de 
FLC host N phase; 
Wide Sc. 


Pleochroic dye High order parameter; 


High dichroism; 


High miscibility 
Optically induced Solubility; 
reorientation S; 
Absorption efficiency 
Bifunctional reactive Solubility; 
mesogen Photoreactivity 
Discotic nematic. Negative An 


Acrylate version used for 
optical compensator 
films 


formation of tilted phases, such as the SmC, 
occurs with suitable placement of transverse 
dipole moments, though precise control of 
phase transition temperatures and sequences 
remains somewhat an art for the chemist [e.g., 
Reference 39]. 


. Pleochroic and fluorescent dyes can be added 


to liquid crystals to give appropriate opti- 
cal functionality. Pleochroic dyes such as the 
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anthraquinone shown in Table 6.3 have reason- 
ably good miscibility in the cyano-biphenyls, 
leading to anisotropic absorption that is much 
more parallel to the director than perpen- 
dicular to it [40]. This is used in Guest—Host 
displays, where the polarizer is replaced by the 
constituent dye mixed in the liquid crystal, 
allowing switching of the appropriate wave 
bands between absorbing and transmitting 
states. Performance is dictated by the com- 
bined order parameter of the system and the 
direction of the dye transition moment with 
respect to the liquid crystal director. 

. Including a central azo-moiety into the core of 
a mesogen allows photoinduced realignment 
of the director due to anisotropic absorp- 

tion of the dye. Irradiating the molecule with 
polarized light causes reorientation of the 

dye molecule to the direction normal to the 
polarization, usually via transition between 
the trans- and cis-isomers. This may be used 

to effect photoreorientation of the nematic 
director, either reversibly [41] or irreversibly by 
combining the azo-mesogen into polymeriz- 
able groups [42]. 

. Reactive mesogens have become a major tool 
for the displays field [43]. These compounds 
exhibit conventional liquid crystal behavior, 
either individually or when used in mixtures. 
The inclusion of a photoreactive group in one 
or both of the terminal end chains allows the 
material to be polymerized by exposure to 

UV light. Hence, the reactive mesogen may 

be aligned appropriately (through interaction 
with the surface, electric fields or polarized 
light) and the liquid crystal structure stabilized 
by polymerizing the reactive mesogen. The 
solubility of the reactive mesogens is high and 
so a variety of systems are possible, from linear 
elastomers to polymer-stabilized networks and 
gels, and liquid crystal polymers. The photo- 
reactive groups are often acrylates and so may 
also be used in conjunction with standard 
acrylate cross-linkers and photoinitiators. 
Reactive mesogens have been used widely, from 
creating patterned optical retarders, functional 
alignment layers, to creating templates for 
extending the temperature range of narrow 
liquid crystal phases. 

. Discotic liquid crystals have a symmetry axis 
that is parallel to the short molecular axis [44]. 


This leads to distinctive physical properties, 
such as negative birefringence, one-dimen- 
sional conductivity, and semiconductivity, and 
the formation of columnar rather than smectic 
phases. Many of the other anisotropic physi- 
cal properties, such as the order parameter, 
permittivities, and elastic constants are similar 
to those of their calamitic (i.e., rod-like) coun- 
terparts. A particularly important application 
for LCDs is the use of polymerizable discotic 
liquid crystals to form optical compensation 
plates. 


6.3 BASICS OF LIQUID CRYSTAL 
DEVICES 


6.3.1 Basic display construction 


Much of the basic scientific understanding of liquid 
crystals was available in the first half of the 20th 
century. Mauguin [45] had found that, contrary 
to the optical rotation caused by a quarter-wave 
plate, polarized white light could be guided by a 
liquid crystal with a twisted structure, allowing 
light to be transmitted between crossed polarizers 
by samples with a twist of 90°. Fréedericksz had 
shown field-induced switching and the existence of 
a critical threshold for magnetic [46] and electric 
[47] fields applied to a uniform sample. Chatelain 
[48] achieved uniform alignment of the liquid 
crystal director using thinly spaced samples and 
rubbing to define the orientation. Such scientific 
advances prompted the first liquid crystal device to 
be patented by the Marconi Wireless Company in 
the UK in 1936 [49]. However, the first LCD com- 
mercialization had to wait until the early 1970s, 
and the availability of various other components, 
many of which were invented and developed inde- 
pendently for other applications. These associated 
technologies include the following: 


e Glass. Typically 0.7 mm thick, polished glass 
is remarkably optically uniform (n = 1.520 + 
0.005), transparent (91% transmission from 320 
to 700 nm) and flat (£0.1 pm). Together with 
its excellent mechanical, thermal, and eco- 
nomic properties it is hard to imagine a better 
substrate material. Today’s 40” TV displays 
are made on $2B production lines that handle 
Generation 10 glass (2850 mm x 3050mm), 
though most small displays are still made 
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using Generation 4 production lines (550mm 
x 650mm). Two types are common: soda lime 
glass for low-cost displays and borosilicate 
glass for high-end TFT (thin-film transistor) 
displays. The latter has a low alkali ion content 
and so adds far fewer ionic impurities to the 
contacting LC that would otherwise prevent 
the TFT maintaining its charge across the 
pixel. Recently, ion-implanted reinforced glass 
and ultrathin (<100 1m) glass have been used 
for high-end displays, particularly for use in 
portable display applications and curved screen 
HDTV. 

e Indium tin oxide. Following the initial work 
done in the mid-1960s at Philips [50,51], and 
in the Far East at the Japanese Government 
Research Institute [52], indium tin oxide (ITO) 
has become the dominant transparent con- 
ductor for the display industry, as well as for 
many other applications. The layer is sputtered 
onto the glass surface between 50 and 200nm, 
depending on the display requirements. The 
thin layer causes loss of light through inter- 
ference and reflection, but these thicknesses 
usually lead to about 88% and 85% transmis- 
sivity of the glass overall and correspond 
to sheet resistances of 80 Q/O and 15 Q/O, 
respectively. It is important to understand 
how sheet resistance works: a square of ITO 
gives a resistance of 15 Q regardless of its area. 
Calculation of the resistance of any electrode 
requires the number of squares to be multi- 
plied by the sheet resistance. So, for example, 
the resistance of a display electrode that is 
10cm long and 200 um wide is typically about 
15 Q/O X L/W =7.5 kQ. 

e CMOS circuitry. CMOS integrated circuits were 
invented by Wanlass at Fairchild in 1963 [53], 
whereby p-type and n-type transistors are used 
in parallel to provide low-power logic circuitry 
(and hence suitable for use with battery oper- 
ated equipment) and logic output voltages 
of between 3.5 and 5 V, which could be used 
directly to address the liquid crystal. The first 
LCD product was launched by Sharp in April 
1973: the EL-805 electronic calculator. The 
CMOS provided both the logic for the calcu- 
lator operation and the driving circuitry for 
the dynamic scattering mode LCD. This was 
followed in October 1973 by the 06LC digital 
watch from Seiko, which used a TN LCD. 


e Polarizing film. Although prism and wire- 
grid polarizers were established technologies, 
Land’s invention of iodine doped stretched 
polymer film polarizers in 1929 [54] and the 
subsequent use of polymerized dichroic nemat- 
ics by Dreyer in 1946 [55] enabled low-cost 
flat-panel displays to operate by controlling 
the polarization state. Today’s polarizers use 
aligned silver nanoparticles, and comprise 
protective films, adhesives, release liners, and 
other filters required for different display pur- 
poses [56]. 

e Thin-film transistor [57]. It was invented at RCA 
by Paul Weimar in the early 1960s. The original 
demonstration used tellurium as the semicon- 
ductor but other materials researched at that 
time included silicon, germanium, cadmium 
selenide, and cadmium sulfide. Most displays 
today use either amorphous or polycrystalline 
silicon. 


There are various different types of LCD but many 
of the design principles and modes of operation are 
common throughout. Consider the common TN 
LCD [14] shown in Figure 6.3. The display is formed 
from two containing plates, the front one at least 
being transparent and coated with transparent 
conducting electrodes, such as ITO. The plates are 
spaced a few microns apart by glass beads, adhered 
together and the laminate filled with the liquid 
crystal material and sealed. The most common 
substrates in use are 0.5 or 0.7mm glass, though 
optically isotropic plastic substrates may also be 
used for low weight and flexibility. Reflecting metal 
foil layer has been used as the rear substrate where 
flexibility and high temperature thermal processes 
are required for a reflective display. The ITO is 
etched to form the appropriate electrode pattern. 
This is often a series of rows and column electrodes 
on the opposing internal faces of the device. In 
such an arrangement, the electrodes form a par- 
allel plate capacitor, wherein the individual pixels 
are formed in the regions of overlap of the rows 
and columns. Of course, other designs are pos- 
sible, such as polar coordinates (formed from axial 
and radial electrodes), alphanumeric characters 
and icons, or interdigitated electrodes on one of 
the substrates to provide an in-plane electric field. 
The most sophisticated displays, used from mobile 
phones to HDTV, use a TFT on one of the plates, 
addressed using copper bus lines to provide the 
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Figure 6.3 Construction and operation of a twisted nematic LCD: (a) Schematic diagram of a 4 pixel 
TN in the off-state; (b) cross section of a modern chip-on glass LCD; (c) typical electro-optic response 
for a monostable LCD, such as the TN; (d) schematic diagram of the 4 pixel TN with voltage V, applied 
to one row, and data +V, applied to the columns, such that one pixel (bottom left) is switched dark. 


signals to a pixel electrode. In these instances, the 
opposite plate is a single electrode, held at Earth. 
Creating an electro-optic effect requires some 
optical property, such as the reflectance or trans- 
mittance for a backlit display, to be changed with 
an applied field. Today’s commercial LCDs almost 
exclusively use reorientation of the director from 
some initial prealigned state, dictated by alignment 
layers on the inner surfaces of the display. Various 
alignment layers are possible, but most devices use 
a polymer coating to impart either homeotropic 
(ie., normal to the surface), planar homogeneous, 
or tilted homogeneous alignment of the director 
(Figure 6.4). For example, the TN device includes 
polyimide coatings rubbed in a direction to impart 


a homogeneous alignment with a surface pretilt 0, 
of typically 1° to 2°, though an STN would require 
a somewhat higher pretilt of 2° < 0, < 6°. For the 
simple TN mode, a 90° twist from the top to bot- 
tom surfaces is induced in the device by arranging 
the rubbing directions on the opposing surfaces to 
be normal to each other. The prevention of tilt and 
twist disclinations is ensured by including a small 
helical twisting power to the nematic through the 
addition of very low quantities of a chiral nematic 
and by matching the sign of the helicity to the two 
pretilts to minimize bend and splay distortion [58]. 
Ignoring biaxial surface terms, the orientation of 
the director at each surface is governed by the sur- 
face energy Gs: 
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Figure 6.4 Nematic surface alignment leading to uniform director profile: (a) Definition of in-plane 
azimuthal angle B and out-of-plane zenithal tilt angle 0; (b) homeotropic; (c) planar homogeneous; 
and, (d) uniform tilted from antiparallel surfaces with pretilt 6,; (e) basic structure of the dedicated 


(e) 


polyimide AL 1051 from Japan Synthetic Rubber KK. 


We(n-s—0,) +Wa(n-r 
Gs=5] dS, 
s| +W,(P:s)” 


(6.18) 


where s is the surface normal and r is the preferred 
alignment direction in the plane of the surface 
(Figure 6.4a). Changes to the surface tilt are related 
to the zenithal anchoring energy, W,. Typically, 
107 < W,< 103 Jm~ and 10° < W, S$ 10° Jm? [59] 
though for typical commercial devices the anchor- 
ing energies are greater than 10-°-Jm~ and are 
usually considered as fixed boundaries. The polar 
surface term W, is insignificant for nematics but 
is important for ferroelectric liquid crystal (FLC) 
devices, where there is often a preferred orientation 
of the spontaneous polarization with respect to the 
surface normal. 

Each device is constructed in two stages: the 
back-end process is done under strict clean room 
conditions, and the front-end, wherein the devices 
are filled with liquid crystal and the associated 
optical layer and driving electronics added and 
testing is done. The back-end processes are done 
in a clean room and consist of producing empty 
laminates each with the potential to form multiple 
displays. The factory equipment dictates the size of 
the plates used; it is important for high yield and 
efficiency that the glass is large enough to allow at 
least six or eight displays to be made on a single 
laminate. Even 47in. diagonal television displays 
are produced eight per laminate using generation 8 


sized glass (2200 x 2500 mm) and large-scale asso- 
ciated manufacturing equipment. 

The plates are made on a production line pre- 
venting the use of many standard laboratory-based 
practices, such as oven baking and spin coating. 
Each step occurs within the TAKT time, which is 
the time that one plate moves from one step to the 
next; the TAKT time, by definition, must be the 
same for all of the steps on a given line. Following 
a brief inspection, the laminate is then sent to the 
front-end, where it is singulated, filled, and the 
remaining components attached. 

The processes for constructing a passive matrix 
display are shown in Figure 6.5. Many of these 
processes are common to either passive matrix or 
TFT-driven active matrix devices. Common back- 
end processes include 


1. Glass cleaning: Glass is supplied with the 
appropriate thickness of ITO deposited onto 
one side. The plate is washed with deionized 
water, ultrasound, soaps and sometimes sol- 
vents such as IPA (though this is not preferred 
due to flammability). 

2. Electrode patterning: This is done using 
photolithography. A thin layer of positive 
photoresist is printed onto the substrate, soft 
baked (to harden) and a mask is placed in 
contact with the layer. Where exposed to UV, 
the photoresist is washed off by developer but 
the photoresist remains in contact with the 
ITO everywhere else. The exposed ITO is then 
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Figure 6.5 Construction steps for a passive matrix display. 


removed by acid and finally the remaining the laminate. The density of the spacers 
photoresist is stripped from the patterned ITO depends on the cell gap but is typically 50-100 
using an aggressive solvent. It is common for per mm’. 
the electrode patterning to be done separately 5. Glue seal: A thermal epoxy is syringe depos- 
from the standard production process. ited onto the other substrate to form the 
3. Alignment layer: Following another clean step, boundaries for each individual display. The 
the alignment layer is pad-printed onto the pat- seal is designed to be as thin as can be reliable, 
terned glass in the areas that form the viewing with four edges comprising the sealant but 
area of each display. The alignment layer is pat- with one edge having a =5 mm wide hole to 
terned to avoid the part of the electrodes where allow filling. After deposition, the glue solvent 
bonding (electrical access) is required and is removed in a soft bake and the plate brought 
where the glue seal will be deposited (to ensure into contact with its opposing, spacer-coated 
a strong seal). The polyimide is then baked at plate. The laminate is then sealed under 
high temperature (e.g., 180°C) to harden it and pressure using a heated press. Usually, the 
then rubbed using a rayon cloth mounted onto glue seal includes glass rod spacers to ensure 
a roller. The resulting pretilt is not only depen- the most uniform spacing to the edge of the 
dent on the choice of the alignment polymer panel. Often, the glue seal will also include a 
but also on the rubbing strength L: low density of gold beads, each with a diam- 
eter 20% or so higher than the spacers. These 
L=N:1 {» 2nr | (6.19) gold beads are distorted to near cylinders 
60V under the pressure used to seal the device. In 
this fashion, they form multitudinous one- 
where Nis the number of passes of the roller, / is its dimensional conductors randomly distributed 
contact length (mm), n is the roller rotation speed throughout the sealant, linking the top and 
(rpm), ris the roller radius (mm), and v is the speed bottom substrates. These allow connection 
of the plate (mm-s"'). Each of these steps needs to from one plate to the other and thereby allow 
be controlled to minimize display variation. a single driver to be used to supply the row 
and column signals. Careful patterning of 
4. Spacers: Polystyrene spheres of the required the ITO electrodes ensures that no unwanted 
spacing are either deposited in air or sol- shorts occur, within the alignment tolerance 


vent onto one of the glass plates that form of the plates. 
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The completed laminate then leaves the high-end 
clean room for the front-end processes. Common 
front-end processes are as follows: 


6. Singulation: Each display is cut from the lami- 
nate using a diamond scribe. Each display is 
cut to give at least one bonding ledge to allow 
electrical access to the electrodes. The panels 
are inspected for any nonuniform cell gaps and 
tested for shorts. 

7. Filling and sealing: For small area LCDs, each 
device is filled in a vacuum chamber, where the 
cell is initially evacuated, the liquid crystal is 
brought into contact with the sealant’s fill- 
ing hole where it begins to fill by capillary 
action. Although still in contact, the vacuum is 
removed and the air pressure used to force the 
liquid crystal into the remainder of the display. 
Once complete, each display is sealed with 
a low-ion content UV glue while pressure is 
applied to the cell to ensure that the correct cell 
gap is obtained on the sealed cell. Large display 
panels are filled before singulation using a one- 
drop-fill method [60], which is combined with 
sealant deposition and vacuum assembly steps. 
One-drop filling is a major enabling technol- 
ogy that helped LCDs meet the cost require- 
ments for large area applications. 

8. Polarizers: After cleaning, each cell has polar- 
izers and associated optical films mounted 
front and back. The films are supplied with an 
index-matching adhesive to reduce extra reflec- 
tions from the layers. Ifa reflective display is 
required, the rear polarizer also comprises 
a diffusive mirror. Front polarizers too may 
include lightly scattering elements that act to 
remove unwanted reflections and compensa- 
tion films for improving viewing angle. 

9. Driver bonding: Connections are made from 
the exposed ITO on the bonding ledge to the 
driving circuitry by one of four methods: 

a. Surface mount technology: Connectors are 
made directly from the bonding ledge to 
the printed circuit board (PCB); 

b. Chip-on-board: The silicon die is mounted 
onto the PCB, connected to the panel using 
gold wires, and protected by adhesive; 

c. Tape automated bonding-mounted drivers. 
The driver manufacturers often supply 
tape-automated bonded chips. The chip is 
mounted onto a tape with gold I/O lines 


etched into it. The tape is adhered to the 
bonding ledge using a one-dimensional 
conducting adhesive; 

d. Chip-on-glass (COG). COG uses gold- 
bump soldering to attach the silicon chip 
directly to the bonding ledge. The chip 
must be protected from light and handling 
by embedding in a black epoxy adhesive. 

10. Inspection and test: Obviously, the degree 
of testing is minimized to reduce costs, but 
manufacturers will operate a quality system 
to determine and minimize the occurrence of 
optical and electrical defects. 

11. Provision of color: Full-color LCDs are made 
in the same way as described above but the 
glass plates are prepatterned with color 
filters (for the front plate) and TFT (for the 
rear plate). Usually, Red-Green-Blue filters 
are used and aligned over subpixels in the 
column electrodes. A black matrix surrounds 
each filter to maximize contrast, minimize 
color leakage, and optically isolate the TFT 
(thereby preventing problems with photo- 
generated charge). The resulting structure 
has a lower pixel fill factor, typically <50%. 
Given the losses associated with the low fill 
factor and absorption of 67% of the light by 
the color filters, reflective full-color LCDs are 
yet to be successful commercially: the images 
are too dim in all conditions but the bright- 
est sunshine. Instead of using a reflective rear 
polarizer, color panels are combined with a 
backlight unit to provide illumination. 


6.3.2 LCD polarization optics 


Although liquid crystals can be used to emit light 
[61], all LCDs use the liquid crystal medium to 
modulate light incident on the panel, whether from 
ambient light or from a built-in source. The aniso- 
tropic nature of the liquid crystal presents a number 
of means through which contrast can be produced, 
whether by scattering, absorption, selective reflec- 
tion, or changes of optical retardation and hence 
polarization state. The great majority of LCDs 
are sandwiched between film polarizers and use 
changes of optical retardation to give the required 
appearance. The state of polarization of light, as it 
travels through an optically transparent medium, 
can be linear, elliptical, or circular, depending on the 
relative magnitudes of two orthogonal polarization 
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components. As the light goes from one transpar- 
ent medium to another, light of each of the polar- 
ization components will be refracted, reflected, 
and the polarization reoriented. Finding the opti- 
cal state requires solution of Maxwell’s equations at 
each interface for each polarization. However, if we 
ignore the reflections and refractions and just con- 
sider the polarization state, then we can consider 
the light to be given by a 2 x 1 Jones vector: 


where A is the amplitude, x, y orthogonal compo- 
nents for light traversing in the z direction, and 5 
represents the phase related to the wavelength A 
through the refractive index by 


= 2mn;t 


8; x p= xy. (6.21) 


If the light is incident on a birefringent layer, which 
has refractive indices n, and n, and thickness t, 
then the state of the polarization of the transmitted 
light will depend on the orientation of the incident 
light with respect to the optic axis and the phase 
difference between the x and y components. The 
maximum retardation due to the birefringence is 
I} given by: 
2nAnt 


T= : 
ny 


(6.22) 


Transformation of the Jones vector requires the 
operators to be 2 x 2 matrices called a Jones matri- 
ces [62]. 

Consider a simple uniform planar sample with 
parallel rubbing directions and a uniform cell 
spacing d. Ignoring the reflections and absorption 
in this fashion is reasonable for thin (d < 25pm), 
well-aligned samples, and it means that the polar- 
ization state may be calculated using the Jones 
matrix for a birefringent retarder: 


(6.23) 


Placing this retarder at an arbitrary angle 
between crossed polarizers gives the Jones vector 
for light transmitted by the device: 


Ey 1 0)\(cosd —sind e? 0 cosh —sind\(0 0)(0 
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(6.24) 
The transmission T is then given by 
_ E’-E’ E+E,’ 
EE E+E,’ 
= sin’ 2osin” 5 = sin’ (26)sin"( cand } (6.25) 


where E* is the conjugate if E (which may be com- 
plex) and t = d is used for the spacing of the liquid 
crystal cell. 

Equation 6.25 suggests that the transmitted light 
is maximum if the cell gap and birefringence are at 
the quarter-wave plate condition An-d=A/4, with the 
polarizers oriented at +45° to the alignment direc- 
tion. The eye is most sensitive to green wavelengths 
and setting A = 550nm gives broad transmission 
across the wavelength range such that the device will 
appear white; for a typical liquid crystal, this condi- 
tion occurs for cells spaced at about 2-3 1m. 


6.3.3 Basic operation: electrically 
controlled birefringence mode 


Applying an electric field to the aligned liquid 
crystal will tend to align the liquid crystal director 
parallel to the field direction, if the material has a 
positive Ae or is perpendicular to the field if nega- 
tive. Considering the pixel in one dimension only 
and the director at the containing surfaces anchored 
strongly, the total free energy of the liquid crystal F 
is given by combining Equations 6.3 and 6.12 as 


¢ ky(V-n) +ky(n-Vxn)y 


F=5| dz. 
: 4 + ks3(nxV xn) —eoAe(n- EY 


(6.26) 
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For the simple case shown in Figure 6.6a, the elec- 
tric field is applied normal to the surfaces such that 
E= (0, 0, V/d) and initially n = (1, 0, 0). As the field 
is increased, there is a torque on the director acting 
to increase the tilt angle 0: 


F= 


Nile 


¢ 2 

fl sin’ 0+ ks3 cos?) 9 eek” sin 0a ; 
Z 

0 


This integral may be solved using the Euler- 
Lagrange equation: 


2 
(ki cos’ 0+ k33 sin’ (2) +€ AcE’ sin? 0=C, 
Zz 


(6.28) 


where the constant C is found from realizing that the 
maximum tilt 6,, must occur at the cell center z = 
d/2 because of symmetry, at which point d0/dz = 0 


(6.27) and hence: 
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Figure 6.6 (a) Fréedericksz transitions in the planar homogeneous geometry for a positive Ae nem- 
atic (top) and the homeotropic geometry with a negative Ae nematic (bottom). The dominant elastic 
constants at different positions in the device are indicated. (b) Calculated tilt profile versus V/V, for a 
material with e, = 20.25,£, =5.36, k,,= 11 pN, kz; = 17 pN in a cell of gap d= 101m [63]. 
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C=€,A8E’ sin’ 0). (6.29) 


Substituting back into the 


Equation 6.28 gives 


Euler-Lagrange 


(sin? 0. —sin’ 8) 


do_V 
((kss — kn) sin? 0+ ky) 


EAE 
dz d 


(6.30) 


At low-field strengths, the small angle limits to 
the tilt angles and allow Equation 6.30 to be reex- 
pressed as 


2 1 ky 
ved | ety 


a be 


(6.31) 


which tends toward 


1 
V-dz=d Au__ (92-9) 240 (6.32) 
Ae 
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as 8° 0. Integrating Equation 6.24 to find the 
threshold field gives 


d 
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, 2 \ eoAe , 


®m 


d ku -1 9m { 0 
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Hence, there is a critical voltage that is indepen- 
dent of cell gap at which the electric field induced 
distortion begins, a threshold that depends on the 
root of the ratio of the relevant elastic constant and 
the dielectric anisotropy. By analogy, the threshold 
for a negative Ae liquid crystal in the homeotropic 
geometry is 


(6.34) 


Ve =T Kss . 
€o|Ae| 


Figure 6.6b shows the situation where a positive 
Ae material is used but the field is applied in the 
plane of the cell; for example, using interdigitated 
electrodes. Similarly, a planar sample with the pre- 
ferred alignment direction normal to the applied 
field will cause the director to twist at the electrode 
surface. In such instances, the electric field is not 
uniform, though it may be approximated at the 
electrode surface to be V/I, where | is the separa- 
tion between the electrodes of opposing voltage. In 
such cases, the cell gap does not cancel in Equation 
6.33 and the threshold depends on both cell gap 
and electrode spacing 


I k33 1 ky 
Vo = 3 Vo zt 6.35 
5 d \ € Ae : d \ € Ae ( ) 


for the homeotropic and planar cases, respec- 
tively. This latter geometry is utilized with in- 
plane switching (IPS) devices, as will be described 
in Section 4.6. If the field is applied normal to a 
uniform director, the threshold is well defined. 
In practice, the alignment layers induce a small 
pretilt, and the director profile of the quiescent 
state is not uniform, neither is the electric field 
applied normal to the director. Thus, the thresh- 
old is rather second order in nature and begins to 
occur somewhat below the voltage predicted by 
Equations 6.33 through 6.35. 

Above the threshold, the solution to Equation 
6.30 must include the higher angle terms. Again 
considering the planar-aligned case shown in 
Figure 6.6a, dividing equation through by V; gives 


V d Ksin’@+1 
-dz= 


d@, (6.36 
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where « = (k33—k,,)/k,,. Immediately above the 
transition, a reasonable solution is found by inte- 
grating Equation 6.28: 
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using a Fourier analysis to solve the complete ellip- 
tical integral of the third kind, 


Mea (1+ F(c)sin?6, +), (6.38) 
Ve 4 


which has the first-order solution: 
z 1. 4f 8 | 
d = oe {2 6 (On° 0°). 
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where 


6, = sin’ (ealZ=)} (6.40) 


This indicates that, immediately above the thresh- 
old, the tilt of the director increases linearly, with 
a gradient that is inversely related to the elastic 
ratio k,,/k,,. This is indicated in Figure 6.6a, where 
the bend elastic constant becomes increasingly 
pertinent at the center of the device, and splay is 
increasingly pushed to the alignment surfaces. 

Above the electric field-induced Fréedericksz 
transition, the situation is complicated by the 
effect of the nonuniform dielectric properties 
of the distorted director profile on the applied 
electric field. The electric torque is related to the 
electric displacement vector D and above the 
transition the director profile is no longer uni- 
form such that E has a direction that is depen- 
dent on the distortion. That is, the effect of the 
field depends on the elastic energy, but conversely, 
the elastic energy depends on the field. Ignoring 
the effects of charge, V-D = 0 and assuming D is 
a function of z only, solutions to the free energy 
expression require E to be calculated self-consis- 
tently with the director profile: 


(6.41) 
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and thus 
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Solutions to this integral are done numerically. 
Indeed, display manufacturers use commercial 
packages, such as “LCD Master” from Shintech, 
“TechWiz” from Sanayi, or “DIMOS” from Autronic 
Melchers [64], to calculate the director profile as a 
function of applied field for their devices. 

Consider the case of Figure 6.6a where the cell 
retardation is set to the quarter-wave plate condi- 
tion at 550 nm (the peak of the eye’s response, An.d 
= 550nm/4 = 137.5nm) and the alignment direc- 
tion is set at @ = 45°. As the voltage is then applied, 
the overall birefringence of the cell decreases 
as the optic axis of the director at the center 
rotates to toward the field and viewing direction 
and the effective birefringence is reduced to An’. 
Simplistically, we can consider the director to be 
uniformly at some average tilt angle 6, the new 
refractive indices are given by 


1 _ sin’® . cos’ 


m= nh; = (6.43) 
caer ae 
such that the effective birefringence An’ is 
net 
An’ =n-n = aaa ——n,. (6.44) 
Jn’sin’® + n,’cos’0 


From Equation 6.25, the transmission of polarized 
light (i.e., after the first polarizer) becomes 


sin? ( mAn'd } 
a 


With sufficient field, And — 0 and Equation 6.45 
predicts T — 0. Thus, above V¢ the cell changes 
from white toward black. Figure 6.7a shows this 
decrease in transmission for red, green, and blue 
wavelengths, calculated using Equations 6.40 and 
6.45 for a 0.67mm cell filled with a liquid crystal 
material with n, = 1.7, n, = 1.5, and k,,/k,, = 2.0. 


T= (6.45) 
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Figure 6.7 Theoretical transmission characteristic for an electrically controlled birefringence LCD. 
(a) The retardation at the quarter-wave plate condition for green light (d= 0.67 um, An= 0.2); (b) a 
thicker cell with retardation at the full wave condition for A = 500nm (d= 2.5 um, An= 0.2). Other 
fitting parameters used: n, = 1.7, n, = 1.5, and k33/k,, = 2. (c) Experimentally determined transmission 
curve for a cell close to the full plate condition. The results are taken for a white light source imaged 


through an eye-response filter. 


Figure 6.7b shows the behavior for a sample 
with a higher initial retardation; the same material 
with a 2.5 1m cell gap. Each wavelength has trans- 
mission peaks corresponding to odd multiples of 


the quarter-wave plate condition (i.e., And/i = %4 
and '4) because these peaks occur at different volt- 
ages for different wavelengths, the cell appears to 
change color as the voltage is increased. 
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Figure 6.8 Explanation for the poor viewing angle for the electrically controlled birefringence effect. 


There are a number of reasons why such a sim- 
ple device is not used as a display effect, illustrat- 
ing the thinking that the display engineer needs to 
follow. 


e The optimum cell gap is very small, typically 
less than 1 pm, and always much lower than the 
cell gap required for a high production yield 
(typically 4m or higher). 

e The device cannot be passive matrix multi- 
plexed because the saturation voltage is typi- 
cally several times V,. This will be explained in 
the following section. 

e The optical configuration is very sensitive to 
changes in cell gap. For example, a typical LCD 
tolerance is +0.15 ym; such a change would 
cause a 12% reduction in the transmission and 
coloration for the Electrically controlled bire- 
fringence (ECB) device. 

e The viewing properties of the device are very 
poor in both high and low voltage states, as 
illustrated in Figure 6.8. In the 0 V state, the 
effective birefringence decreases off-axis, 
the quarter-wave plate condition is lost and 
the device looks dark instead of bright (in the 
direction orthogonal to this, there is little 
change, and the device still looks white). In 
the high voltage state, the off axis transmis- 
sion is even darker than the full ON state 
when viewed in the direction of the director 
tilt. When viewed in the other quadrant, the 
retardation increases rather than decreases as 
desired on switching causing contrast inver- 
sion, and the device begins to appear colored. 


The response time for even such a simple device 
is complex due to the effects of field-induced flow 
and the need for the anisotropic viscosities to 
be considered. If a flow is ignored, the response 


time depends solely on the twist viscosity y, that 
describes rotation of the director. The Euler- 
Lagrange equation for the time-dependent free 
energy of Equation 6.28 then becomes 
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(6.46) 


For simplicity, the elastic anisotropy is discarded 
and a single elastic constant k is assumed: 
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Applying a small angle approximation [65], the 
characteristic ON and OFF times are 


Ton = nh = vad" 
2 2 2 
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These times are made short by using materials with 
low viscoelastic ratios y,/k but most effectively by 
keeping the cell gap d low. Examples of mixtures 
providing response times as low as 30 ms at —20°C 
and 10 ms at +20°C in a 31m cell have been pro- 
duced [66]. Given the 3—-41m lower limit on cell 
gap set by clean-room quality and device unifor- 
mity, it would seem advantageous for high An 


(6.48) 


6.3 Basics of liquid crystal devices 161 


materials to be used to enable the required optical 
effect with a lower cell gap. In practice, however, y, 
tends to increase with An and so the efficacy of this 
is limited. Ensuring that the voltage is made high 
can quicken the ON time significantly. However, it 
is the sum of the ON and OFF times that is relevant 
for display applications. Moreover, if the display 
requires intermediate grey levels or some degree 
of passive matrix addressing (see Section 3.4), the 
ON voltage is limited and even the ON time can be 
slow. For example, switching to the gray level clos- 
est to V, is inherently slow. 

In the case of a passive matrix addressed device, 
even the OFF pixels have a voltage V, applied and 
the response time is given by [67] 


In practice, the response is complicated by induced 
flow of the liquid crystal and the aligning effect of 
the director in response to such flows. For example, 
the OFF response is often slowed considerably by 
backflow that tends to initially reorient the direc- 
tor at the center of the cell in the opposite direction 
to that which gives the eventual lowest energy state 
[68]. Avoiding this “optical bounce” effect is con- 
sidered further in Section 6.6.2. 


6.3.4 Passive matrix addressing and 
the multiplexing limit 


For low information contents, it is satisfactory to 
form a display where each pixel is driven directly 
from the driver. Usually, one of the electrodes is 
shaped into the desired pattern, such as a pixel for 
a seven segment or alphanumeric image and the 
other electrode is a common electrode. A display of 
N pixels then requires N + 1 electrodes. An exam- 
ple arrangement is shown in Figure 6.9a, where a 
single seven segment number is displayed using just 
eight electrodes. Rather than have separate connec- 
tions to the two plates of the LCD, access to all of 
the electrodes is through a single bonding ledge: the 
opposing common electrode is connected through 
electrodes that are connected through the one- 
dimensional conducting seal. The access electrodes 
to each segment is kept small, to minimize overlap 


with the common electrode. It is essential for such 
a design that the common plate has only a single 
point of overlap with ITO on the electrode plate 
and conducting seal to prevent unwanted short- 
ing of pixels. A typical transmission versus voltage 
characteristic for an LCD is shown in Figure 6.9b. 
With a direct drive scheme, the OFF voltage must 
be below the threshold T,, (usually 0 V) and the ON 
voltage should be above the saturation level T,,. 

A direct drive approach is satisfactory for 
devices with a small number of pixels. More 
complex images require some degree of matrix 
addressing (or multiplexing), where appropriate 
signals are applied to electrodes on both top and 
bottom plates. The electrodes may also be shaped 
into alphanumeric characters and designed for 
a low level of multiplexing or indeed into a more 
complex (8) arrangement for the display of polar 
coordinates. The most common design, however, is 
for the electrodes to comprise N rows and M col- 
umns to form a rectangular M x N display. Time 
division multiplexing is used to apply appropriate 
signals to the rows and columns, when driven from 
only M + N connections. 

Consider the simple 7 X 6 matrix display illus- 
trated in Figure 6.10, where each pixel is addressed 
by sequentially scanning through the six rows, 
although the appropriate data are synchronously 
applied to the columns to discriminate between the 
ON and OFF states. Figure 6.10 shows an example 
instant within the image frame, where the data volt- 
ages are being applied to seven columns and are 
those for the third row electrode. The data signal on 
the columns is +D volts, where it is the sign of the 
signal that discriminates between the ON and OFF 
states of the pixel on the third row. At this instant, +S 
volts is being applied to the third row while all other 
rows are kept at ground OV. The data being applied 
at this instance are the sequence of pulses of differ- 
ing polarity given by (++--+-+)D. The potential 
difference at the pixels is defined as Row-Column, 
such that the third row experiences the voltage: 


Row 3: +S—D;+S—D;+S+D;+S+D; 
+S—D;+S+D;+S—D, 
Each of the other rows experiences the following: 


Row 1,2,4,5,6: —D;-—D;+D;+D;—D;+D;—D 
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Figure 6.9 Design of (a) single element seven-segment LCD, showing front and rear plates face-up, 
and the completed module. (b) A typical transmission voltage characteristic for an LCD. 


Once the third row is addressed, the scan signal discriminate from the low voltages |(S—D)| despite 
moves to the fourth row and the data signals change __ the fact that each row is only addressed for a 1/N 
sign appropriately to supply the signal for that row. fraction of the frame time and for the remainder 
The aim of the multiplexing scheme is to ensure of the frame (N—1)/N each pixel experiences the 
that the high voltages |(S + D)| are sufficient to data signal +D. 
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Figure 6.10 Example of passive matrix multiplexing for a 7 x 6 pixel graphic display. The row and 
column signals are for the 1/6 proportion of the frame that corresponds to the third row being 


addressed. 


Nematic displays respond to the root-mean- 
square (RMS) voltage applied over the whole frame. 
As soon as the signal has been removed, each pixel 
starts to relax to the off state, so constant refreshing 
is done at a frame rate sufficiently fast to prevent 
this relaxation (or frame response) and maintain 
the image. For a simple scheme, the frame time 
will equal the slot time for each row Tt, multiplied 
by the number of rows N. It is independent of the 
number of columns, and hence it is common for 
the rows to be chosen as the lower number in the 
matrix to ensure the faster frame rate. 

The RMS of the voltage applied over N rows for 
the ON and OFF signals is 


(6.50) 


where D and S should be chosen to ensure that Voy 
is above the transmission saturation (i.e., >Ty 9.) 
and Vopp is below the threshold (i-e., <T\9.,). The 
steepness of the electro-optic response is related to 
R= Von! Voge! 
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where b = S/D is called the bias ratio. The maxi- 
mum number of lines that can be driven for a given 
Sand D is found by differentiating 


3 b’+2b+N 
a(R*) _ b?-2b+N a 4(b? +N) _ 
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(6.52) 
This has solutions when either b = - or b = S/D 


= JN. Substituting the latter into Equation 6.52 
gives the Alt-Pleshko [69] relationships: 


Vie N+2VN+N _ |VN41 6.53) 
Vor), NN-2VN+N \YVN-1 


or: 


(6.54) 


This relationship suggests that, as N increases, the 
maximum Von/Vorr must tend toward unity; that 
is, the threshold characteristic must increase in 
steepness to allow more lines to be addressed, as 
shown in Figure 6.11a. The typical TN characteristic 
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Figure 6.11 (a) The Alt-Pleshko multiplexing limit and (b) addressing voltages. 
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six levels of time division multiplexing. 

The signal voltages S and D that are required 
for the N x M display are found by substituting 
the optimum multiplexing ratio S= DVN into the D 2(VN + 1) 
RMS voltages of Equation 6.50: JN 


shown in Figure 6.9c has a response suited to up to 
Von as | 


which leads to the following data D and strobe S 


voltages: 
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OFF — 
N N N 
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S=Vorr 


aN a1 


(6.57) 


Equations 6.56 and 6.57 are plotted in Figure 
6.11b; they indicate that, at high N, D~1/J2V< 
and S=,/N/2Vc so that the maximum voltage 
that needs to be delivered by the driver chip (S 
+ D) increases with the level of multiplexing N, 
while the discriminating voltage D decreases. For 
this reason, STN displays with N = 240 typically 
require drivers capable of delivering S + D = 20V. 

During the addressing frame, the liquid crystal 
director for the ON pixels in the addressed row 
will begin to decay as soon as the strobe voltage 
moves on to the subsequent rows. Thus, it is impor- 
tant that the slot time of the addressing scheme is 
arranged to minimize any flicker while at the same 
time the pixels are ready to display new informa- 
tion in the following frame; this occurs approxi- 
mately where 


pag a SO (6.58) 
2N 

In practice, the scheme shown above is unsuitable 
for long-term use because there is no DC balance 
to the waveform: this leads to eventual electrical 
breakdown of the liquid crystal and, if the image 
does not change, it can result in image sticking 
issues associated with the ionic conductivity. In 
practice, therefore, the waveforms will be inverted 
periodically to maintain a net zero DC voltage. 
Given the data changes from frame to frame, DC 
balancing can only be guaranteed if the voltages 
are inverted twice per frame. That is, ifthe slot time 
is halved, and the frame is divided into a positive 
field (+S, +D) and a negative field with (-S, =D). 
The slot time should not be made too short because 
the power dissipation of charging and discharg- 
ing the capacitances each field P, is approximately 
related to 


(6.59) 


Given that power should be kept low, inversion is 
usually done only once per frame, often randomly 
within the frame time to help ensure DC balance 
regardless of image pattern. 


6.3.5 Thin film transistors and active 
matrix addressing 


Contrary to the approach taken by Sharp who tar- 
geted the new market of portable electronic calcula- 
tors, RCA concentrated on replacing the cathode-ray 
tube for televisual display with an LCD from the 
outset [2]. The potential application of thin-film 
transistors as active elements in displays had been 
recognized since their invention at RCA in the mid- 
1960s [57]. By 1971, the RCA team believed [70] that 
active components were required to overcome the 
slow speed and poor electro-optic properties of the 
dynamic scattering mode device, proposing solutions 
using dual diodes, field effect transistors (FETs), and 
storage capacitors to ensure that the charge across 
the pixel remained constant across the frame. It was 
the team at Westinghouse [71] that were the first to 
implement this in practice, producing a TN 6 x 6 
matrix driven by CdSe TFT. In the UK, the team led 
by Hilsum at Royal Signals and Radar Establishment 
(RSRE) understood the handling, reliability, and life- 
time issues presented by CdSe and Te, and in 1976 
approached Spear and LeComber at the University 
of Dundee, who were working on the use of hydro- 
genated amorphous silicon (a-Si:H) for use in photo- 
voltaic cells. The Dundee team produced a-Si:H with 
electron mobility 1, of 0.4cm?/V-s and proved that 
the material was suited to address LCDs [72]. The 
Dundee and RSRE teams fabricated insulated-gate 
field effect TFTs onto a glass substrate using photoli- 
thography and fabricated the world’s first «-Si active 
matrix LCD in 1980 [73]. This was followed in 1982 
by a 240 x 240 a-Si TFT TN from Canon in Japan 
[74] and the immense ensuing effort and investment 
in Asia to bring TFT LCDs to the dominance of the 
display market that it enjoys today. 

Very high levels of multiplexibility are possible 
using thin-film transistors fabricated onto the rear 
plate of the LCD behind each pixel (or sub pixel in a 
color display). A 4k UHDTV, for example, has 2160 x 
3840 x 3=24.9M TFT per panel and is 2160 ways mul- 
tiplexed. Each TFT supplies charge to a (sub-) pixel 
to switch the liquid crystal to any of 256 gray levels. 
The TFT is an FET formed from thin metal-insula- 
tor-semiconductor layers as shown in Figure 6.12a. 
Most LCDs use a bottom-gate configuration, where 
the amorphous silicon is deposited onto the prepat- 
terned gate electrodes, n+ doped with hydrogen, and 
source and drain electrodes patterned on top of this. 
The electrodes are usually made from aluminium, 
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(a) Basic construction and operation of a TFT. (b) Current-voltage transfer characteris- 


tics for the original Dundee/RSRE «a-Si:H TFT [72], together with contemporary results for a-Si:H [74], 


LTPS [75], and IGZO TFT [76]. 


chromium, or molybdenum. With a negative volt- 
age applied between the gate electrode and the drain- 
Vop electrons are depleted from the semiconductor 
at the interface with the insulator, preventing current 
between source and drain electrodes (Figure 6.12a 
ii). When a positive-bias V,p is applied to the gate, 


however, electrons accumulate at this interface allow- 
ing current flow from source to drain I,, when the 
drain is positive and from drain to source when the 
drain is negative (Figure 6.12a iii). Figure 6.12b shows 
the transfer characteristic originally produced by the 
Dundee/RSRE team in 1981. 
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Figure 6.13 Cross section and plan views of a typical TFT-driven subpixel on a backlit color LCD. 


A schematic of how a TFT is used in a typical 
LCD pixel is shown in Figure 6.13. The LCD has 
an active matrix back-plane and a front plane with 
a single common electrode. For backlit color dis- 
plays, the front plate will also comprise the color 
filters, with a black matrix to ensure sufficiently 
high contrast, prevent color leakage, and shield 
the TFT from incident light and unwanted photo- 
induced charge generation. The TFT plate has row 
and column metal bus lines that allow the TFT to 
be addressed, with the row electrode connected to 
the transistor gate and the columns the transistor 
source. The ITO electrodes that form the pixel are 
connected to the drain electrode. The TFT is also 
protected using a SiOx barrier layer onto which 
the liquid crystal alignment layer is deposited. The 
presence of bus lines, shielded transistors, and stor- 
age capacitor reduces the active area of each pixel; 
aperture ratios as low as 30%-40% are typical. This 


reduces the transmissivity of the panel and neces- 
sitates brighter backlighting to provide an attrac- 
tive appearance. It also negates the use of the color 
TFT LCDs in reflective mode because of the light 
losses associated with the polarizers, color filters, 
and the low aperture ratio. 

Addressing a TFT matrix is done line-by-line in 
a similar fashion to that described previously for 
passive matrix displays, but now the row and col- 
umn signals are applied to the same substrate, as 
shown in Figure 6.14a. From the «-Si:H TFT char- 
acteristic of Figure 6.12b, it is clear that applying 
+15Vop to the gate electrode allows current I, to 
flow between the source and drain due to the sig- 
nal voltage Vj, applied to the column, whereas 5V 
on the gate turns the TFT off and prevents current 
from flowing. With the scheme of Figure 6.14a, the 
transistors in each of the unaddressed rows (rows 
n—1 and n + 1) are turned OFF by the negative 
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Figure 6.14 (a) Active matrix addressing of an LCD panel and (b) equivalent circuit for the pixel 
including the TFT parasitic losses, line losses, and the conductance of the liquid crystal. 


signal applied to the gate bus lines, whereas the 
transistors in the addressed row (row n) are opened 
using +15VGp to the gate electrodes and switched 
ON. Synchronously, data voltages are applied to the 
source bus; there is no current for any of the OFF 
rows, but the pixels on the ON row are charged to 
the appropriate voltage by the current I,,. After suf- 
ficient time for the pixel to charge to the new level t, 
the gate is reclosed (with-5V,)) and the gate pulse 
(+15Vop) scans to the next row and new data on the 


source columns. The frame is completed when all N 
rows have been scanned, after the frame time TN. It 
is important to ensure that the liquid crystal mate- 
rial is exposed to the minimum net DC over sev- 
eral frames. To help ensure this, the polarity of the 
data signal is reversed, using frame inversion, scan- 
line inversion, column inversion, or pixel inversion 
(shown in Figure 6.14a). 

To understand some of the important design 
rules for a TFT LCD, consider the example of a 
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16.3in. QSXGA monitor. It has 2048 x 2560 pixels 
at 202 dpi, with the RGB subpixels on the columns 
(to maximize the time required to address each 
row). Such a panel typically has an aperture ratio 
of 27% and operates at 60 Hz (t = 8.11). That is, 
the subpixel dimensions are about 22m x 65m 
and a pixel capacitance C,, of 0.16 pF (assuming a 
cell gap of d= 4 um and permittivity €,. = 5). 

First, the pixel should not fully discharge within 
the 16.7 ms frame time due to the off current of the 
transistor. Assuming an exponential decay and a 
1% voltage tolerance gives the condition [78]: 


V. 
Torr = Circ: Rore = C—2>2001N. 
Isp 


(6.60) 


The a-Si characteristic from Figure 6.12b shows 
that I, is 20pA when the TFT is OFF and 10 pA 
when ON. Hence, Equation 6.60 suggests that the 
TFT had a sufficiently low OFF current to drive a 
pixel of capacitance 6 pF, easily sufficient for the 
40 pF pixels used in the original work of Reference 
[74], but is far too leaky to drive the 0.16 pF of a 
modern QSXGA monitor. Simply employing a 
storage capacitance C, in parallel with the pixel, 
as shown in Figures 6.13 and 6.14, increases the 
capacitance to the desired level and ensures that 
the pixel remains charged throughout the frame. 
Similarly, I,, in the ON condition for the high- 
est signal level Vs, should be sufficiently high to 
charge the pixel to at least 99% of that voltage level 
within the addressing time t, which occurs when 
Ton is 10% of the addressing time T: 


V. 
Ton =C- Ron =(Cs+Cic) >> < 0-11. (6.61) 


SD 


Equations 6.60 and 6.61 can be combined to 
find the maximum number of lines that can be 
addressed by a TFT, N,,,. with a given ration of 
ON to OFF currents: 


ax? 


1 Isp(ON) 


= 6.62 
2000 Isp (OFF) Kee?) 


max 


The characteristic of Reference [73] shown in 
Figure 6.12b suggests that the original TFT could 
address up to 250 lines (as was claimed) but 
required improvement to address the 2048 rows of 


the monitor display. Some of the improvements to 
TFT performance made to achieve such high levels 
of multiplexing, 256 gray levels and high contrast 
ratios are outlined in the following discussion. 
The equivalent circuit for a more realistic pixel is 
shown in Figure 6.14b. The TFT includes an inherent 
resistance for the semiconductor Rp, and the capaci- 
tance per unit area of the gate C,.. Also important 
is the conductivity of the liquid crystal G,, and the 
resistance of the bus lines Ry. The simplest model 
for FETs predicts that for sufficiently high gate volt- 
ages, well above the transistor threshold V,, and drain 
voltage V,,, the current I,, is given by Reference [78]: 


WwW 1 
Isp = WeCes L (Veo Vin Ve Yeo (6.63) 


where J, is the electron mobility, W is the width of 
the TFT channel, and L is the length, as defined in 
Figure 6.13. The TFT threshold is typically about 
+3 V and is directly related to the charge density of 
free electrons ny: 


ste (6.64) 


where the gate capacitance per unit area Cg: 


oe we 
GS (6.65) 


and d, is the thickness of the SiN, insulator layer 
(Figure 6.13). The a-Si leakage resistance domi- 
nates the OFF current: 


(6.66) 


Substituting Equations 6.63 and 6.66 into Equation 
6.62 gives the following relationship: 


Nomax ~ WeCosRos- (6.67) 


That is, achieving the low OFF current is done 
by reducing the transistor width to length ratio 
WIL, but the high ON to OFF current ratio, and 
hence maximum number of lines that can be 
addressed, is achieved by reducing the thickness of 
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the insulating and semiconducting layers, leading 
to a typical C,. of about 0.1 F/cm?. A low insula- 
tor thickness has the additional benefit of reducing 
the area of the storage capacitor and correspond- 
ingly increases the aperture ratio. Figure 6.12b 
also includes the characteristic of a TFT used for 
modern LCD panels, such as the QSXGA monitor 
described earlier (where W/L = 10ppm/6 pm). 

Decay of the voltage across the pixel is not just 
related to the leakage current of the transistor and 
Equation 6.60: loss of charge across pixel occurs 
if the conductivity of the liquid crystal G,, is too 
high. This is quantified by the voltage holding ratio 
(VHR), which represents the time it takes for the 
pixel voltage to decay to 50% [79]: 


2™N 
VHR = [Bech e Rone } (6.68) 


2TN 


Ionic impurities in the liquid crystal must be 
minimized to maximize VHR and hence main- 
tain a high display contrast. This cannot be done 
with nitrogenated compounds, preventing the 
use of highly polar materials such as the cyano- 
biphenyls. Instead, perfluorinated compounds 
are essential. Although mixtures produced from 
such compounds have a much lower Ag, this is 
compensated by the low viscosity and hence fast 
switching speeds that can be achieved with these 
materials [79]. 

The pixel aperture ratio and hence the trans- 
mission efficiency of the backlight is dictated by 
the target display resolution, the area of the panel 
(due to the losses caused by Rgys) and the fabrica- 
tion tolerances. A typical TFT is fabricated as fol- 
lows [78]: 


1. The gate metal, usually Cr or Mo, is sputtered 
onto clean glass to a thickness of about 200nm. 

2. The first set of electrodes are wet etched: photo- 
resist is printed over the glass area and exposed 
through a large area chrome mask placed with 
very high accuracy using a mask aligner. This 
is then developed to form the gate electrodes 
and bus line, and the bottom electrode of the 
storage capacitor. 

3. Plasma-enhanced chemical vapor deposition 
(PECVD) is used to deposit 400 nm of Si,N,, 
followed by 130 nm of intrinsic a-Si and 50nm 
of n* a-Si:H. 


4. The source, drain, and storage capacitor elec- 
trodes are then formed by sputtering Cr over 
the surface and wet etching the appropriate 
patterns, again using a wet etching process and 
mask aligner. 

5. The chrome electrode then acts as a self- 
aligned mask for plasma etching of the n* 
a-Si:H to complete the source and drain elec- 
trodes (often using an etchant stopper). 

6. The remaining intrinsic a-Si:H is plasma 
etched away using a third mask and mask 
aligner step, thereby forming the TFTs. 

7. The ITO is sputtered and plasma etched to 
form the pixels using a fourth mask and align- 
ment step. 

8. The last step uses PECVD to deposit the 
350 nm of SiN, or SiO, barrier layer and uses 
a fifth mask to provide access to the bonding 
pads. A mask aligner is not usually required for 
this step that is far less critical than the previ- 
ous photolithographic steps. 


Each of the four critical mask alignment steps needs 
very high resolution to prevent panel variability. 
These steps are expensive, and various attempts 
to reduce the number of mask steps have been 
attempted, including the use of back-to-back diodes. 
High tooling costs lower the design flexibility and 
so TFT panels tend to be available only in a range 
of standard sizes and resolutions. If the requirement 
is for nonstandard size in a niche market, the end- 
user may need to choose a passive matrix approach, 
which is why the market for passive matrix displays 
remains strong despite the poorer performance. For 
a given display diagonal, the aperture ratio decreases 
linearly with increasing resolution. Various other 
pixel designs to that shown in Figure 6.13 have been 
suggested, often involving alternative placement of 
the storage capacitor. However, very high display 
resolutions require a different approach, such as the 
use of low-temperature polysilicon (LTPS). 
Polycrystalline silicon (p-Si) was among the 
first semiconductors to be used for LCDs [80] and 
found in the first applications for TFT by Canon 
as the watch used in the 1983 film Octopussy and 
Sharp’s 1991 hang-on-the-wall TV [7]. The mate- 
rial has a high mobility of 200-400 cm?/V‘s, which 
is intermediate between the 1.5cm?/V-s of amor- 
phous silicon and 1400cm?/V-s for crystalline. 
Such high mobilities allow far smaller transistors, 
higher ON currents [particularly important for 
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organic light emitting diode (OLED) displays], and 
potentially integrating the display drivers onto the 
glass itself. This latter advantage potentially leads 
to significant overall cost savings because the driv- 
ers would be produced in the same process steps 
as the pixel TFT. The problem with producing 
p-Si TFTs was the very high processing tempera- 
tures, requiring those early demonstrators to be 
produced on quartz substrates. In the mid 1980s 
[81], LTPS TFTs were fabricated using excimer 
laser annealing of the a-Si to form the polycrys- 
talline structure while keeping the processing 
temperature to 260°C, equivalent to that used for 
a-Si. Today, many smart phones benefit from the 
excellent properties of LTPS, which allows reso- 
lutions above 400 dpi and better battery life due 
to the reduction in backlight power that the high 
aperture ratio allows. However, the cost of LTPS is 
high because the fabrication of the top-gate tran- 
sistors required uses 9-11 critical mask steps: this 
typically adds about 20% cost to the panels over 
equivalent a-Si LCDs. 

Together with other disadvantages such as high 
leakage current, the high production cost of LTPS 
has a driven research into other semiconductors, 
including various metal oxides. ZnO is particularly 


interesting because it retains a high mobility and 
combines a very high ON to OFF state current ratio 
with optical transparency over the visible region. 
Recently, the 1:1:1 combination of indium gallium 
and zinc oxide (IGZO) [77,82] to form active ele- 
ment has been put into production by Sharp. A 
50nm amorphous IGZO layer is nitrogenated to 
form the n+ doped semiconductor with a mobility 
of 1. = 10cm?/V-s. Figure 6.12b includes a compari- 
son of the TFT transfer characteristics for 0-IGZO 
against both LTPS and a-Si. Although IGZO has 
a lower mobility than LTPS, it combines a very 
high ON/OFF current ratio (>10°) with the same 
low-cost fabrication designs of a-Si. It is likely 
that IGZO will play a leading role in future display 
devices, as displays continue to move to ever higher 
pixel contents, resolutions, and power efficiencies. 


6.4 STANDARD LCD MODES 


6.4.1 Overview 


There is a wealth of different liquid crystals modes 
possible, some of the important ones being shown 
in Figure 6.15. Some modes have niche applica- 
tions, such as the cholesteric temperature sensor or 
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Figure 6.15 Dendrogram of the important LC modes. The LCD mainstream described in Section 6.4 is 
indicated by the darker grey coloration. Abbreviations: PDLC: polymer dispersed liquid crystal; ECB: 
electrically controlled birefringence; VAN: vertically aligned nematic; IPS: in-plane switching; MVA: 
multidomain vertically aligned; PVA photoaligned vertical alignment; ASM: axially symmetric mode; 
FFS: Fringe field switching; STN: supertwist nematic; SSFLC: surface stabilized ferroelectric liquid 
crystal; SBE: supertwisted birefringence Effect; F-STN: film/foil compensated STN; ABD: azimuthal 
bistable display; ZBD: zenithal bistable display; OCB: optically compensated bend-mode. 
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the ZBD and these will be described in Section 6.5. 
Modes that have contributed to the mainstream 
LCD markets from the early watches and calcula- 
tors to today’s full color video rate displays are cov- 
ered in this section. 

The evolution of the modern LCDs used in 
mobile phones, computer monitors, and HDTV has 
been undertaken in several stages, exemplified by 
listing the major advances, as in Table 6.4. After the 
basic principles had been evaluated, as described 
in the previous sections, the most important step 
was to increase the complexity of the display to 
allow hundreds of lines of information to operate 
at video frame rate, with gray scale and color. By 
the mid-1990s, the technological steps to achieve 
this had largely been satisfied through the adop- 
tion of the TFT active backplane. At that point, the 
race moved from complexity to appearance and in 
particular improving poor viewing angles. Several 
new modes were developed, each finding market 
success for different manufacturers, first allowing 
LCDs to replace CRT in computer monitors and 
in 2007, surpass CRT sales for TV. However, the 
war shifted to new battle grounds and yet further 
improvement of appearance, including ultrahigh 
resolution and wide-color gamut drive the compe- 
tition in the mobile telecommunications market- 
place. Before describing the individual modes, it is 
apposite to discuss the technological and market 
forces that dictated each shift in stage. 

Following the discovery of the Alt-Pleshko 
multiplexing limit for passive matrix addressing 
[69] maximizing display content was the key driver 
for the LCD industry throughout the 1980s and 
early 1990s. Several approaches were taken: 


1. Increasing the gradient of the LC transmission- 
voltage response, either through material 
improvement (controlling the elastic constant 
ratio k,,/k,,) or new LCD modes (STN). 

2. Designing bistable LCD modes, where pixels 
no longer respond to the RMS signal over the 
frame, but are rapidly latched between the ON 
and OFF states and remains in the desired state 
after the signals applied. 

. Introducing a nonlinear element, such as TFT, 
that isolates and delivers the appropriate signal 
to each pixel. 


Ow 


The first of these approaches was successful for low- 
cost, black and white STN displays but was limited 


by the slow frame time, low number of gray levels, 
and a maximum of 480 multiplexed lines. Many 
novel methods for making an LCD inherently 
bistable were invented at the end of the 1970s and 
beginning of the 1980s [101]. The most promising 
bistable technology was the surface stabilized fer- 
roelectric LCD (SSFLCD), invented in 1980 [88], 
launching an immense worldwide effort for the 
next 15 years in a two-way battle between the FLC 
and TFT approaches. The complexity of TFT fabri- 
cation seemed to promise higher costs over the pas- 
sive matrix bistable approach. However, despite the 
launch of a color Surface Stabilised Ferroelectric 
Liquid Crystal (SSFLC) monitor by Canon in 1992 
(Figure 6.16d), the TFT-driven TN display was 
already beginning to enable the image quality 
required for a new market: the laptop. Early laptop 
screens were black and white only (such as Apples’s 
MacIntosh Powerbook 100, released in October 
1991) but the sensitivity of SSFLC to shock made 
the technology unsuited to portable products. 
Although there was no superior alternative to the 
TFT TN LCD for laptop displays, the competition 
for the SSFLC monitor came from the superior per- 
formance and much lower cost of the CRT, and well 
before the end of the 1990s, the SSFLC development 
effort targeting mainstream displays was minimal. 

Buoyed by early commercial success from lap- 
tops, the TFT TN would attract sufficient invest- 
ment to explore new modes, targeting shifting the 
incumbent CRT from the monitor, and eventually 
TV markets. The use of TFTs brought with it a new 
and very important advantage: the flexibility of 
LC mode design. Modes that had previously been 
impossible to multiplex became potentially use- 
able. In the late 1990s, modes such as IPS [92,93] 
and multidomain vertically aligned (MVA) nem- 
atic [94,95] offered sufficiently wide viewing angles 
to displace the CRT. By the beginning of the new 
millennium, the LCD was dominant from watches 
and calculators to large area monitors [102], as 
shown in Figure 6.16. 

The battle for the television market was com- 
plicated by another new technology: the plasma 
display panel (PDP). As an emissive display, the 
viewing angle and contrast of PDP could not be 
surpassed by LCDs. This gave PDP an early lead in 
the flat-panel television market. However, ongoing 
LCD improvements, such as the invention of the 
fringe field switching (FFS) mode [96,97], and the 
inability to fabricate high-resolution PDP below 


Table 6.4 Significant advances on route to LCD market dominance 


Year 


1967 
1971 
1971 


1971 


1973 
1973 


1973 


1973 


1974 
1979 


1980 


1982 
1984 
1987 
1992 


1997 
1998 
2000 
2001 
2004 
2010 


Invention 


Dynamic scattering mode 
Active matrix addressing 
Twisted nematic mode (TN) 


Vertically aligned nematic mode (VAN) 


Cyano-bipheny! nematics 

Formulation of wide temperature range eutectic 
LCD for TN 

In-plane switching (IPS) proposed 


First commercial LCD products in calculator and 
wristwatch 

Defect-free TN 

First amorphous silicon TFT used to address LCD 


Bistable surface stabilized ferroelectric liquid 
crystal (SSFLC) 

Supertwist mode (STN) 

Supertwist birefringence effect (SBE) 

Foil compensation for STN 

IPS mode 


Multidomain vertically aligned mode (MVA) 
Fringe field switching mode (FFS) 
Patterned vertical alignment mode 

Axially symmetric mode (ASM) 

IGZO TFT invented 

OD enhanced color backlights 


Protagonists 


Heilmeier et al., RCA; US 

Lechner et al., RCA; US 

Schadt and Helfrich, BBC; CH. 

Fergason, U. Kent; US 

Kahn, Bell Labs; US 

Schiekel and Fahrenschon, AEG-Telejunken; W. 
Germany 

Gray et al., Hull, RSRE, BDH; UK 

Raynes, RSRE; UK 


Kobayashi, U. Tokyo; JP 

Soref, Sperry; US 

Wada et al., Sharp; JP 

Seiko; JP 

Raynes and Waters, RSRE; UK 

Spear and Le Coomber, U. Dundee; UK 
Hilsum, Hughes, RSRE; UK 

Clark and Lagerwall, U. Gdteborg; SE 


Raynes, RSRE; UK 

Scheffer and Nehring, BBC; CH 
Katoh, Asahi Glass; JP 

Baur et al., Merck; DE 

Kondo et al., Hitachi; JP 

Koike and Okamoto, Fujitsu; JP 
Lee et al., Hyundai; KR 

Kim et al., Samsung; KR 
Yamada, Ishii et al., Sharp; JP 
Nomura et al., Tokyo Institute of Technology; JP 
Jang et al., Samsung; KR 
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Figure 6.16 State-of-the-art LCD at the turn of the millennium. (a) Watches and calculators using the 
reflective TN, (b) a dual display phone and PDA using foil compensated STN, (c) a laptop computer 
with TFT-driven TN, (d) a 14in. monitor display based on bistable SSFLCD, and (e) one of the first IPS 


mode TFT monitors [100]. 


40 in., led to the eventual triumph of LCDs in this 
most important of markets too. 

Coincident with LCD domination of the TV 
display market, the first i-Phone was launched in 
June 2007, heralding the age of the smart phone. 
The goal of providing resolution at the limit of 
visual perception helped drive LTPS into the mar- 
ketplace. However, LTPS also had the advantage of 
providing higher ON currents than was possible, 
thereby meeting the requirement for another emis- 
sive display technology: OLED. Not only do OLED 
surpass LCD for contrast and viewing angle but 
also for color saturation. Again, the adaptability 
of LCD technology provides a solution: replacing 
the cold cathode backlights with a blue LED and 
adding a film containing red and green quantum 
dots (QD), provides sharper colors than is pos- 
sible using color filters alone [100]. Once more, 
LCD performance increases to meet the market 
challenge. Midway through the 2010s, OLED is 
making grounds for portable displays, where fea- 
tures such as form factor and weight also play an 
important role but LCDs continue to dominate in 
all other markets (Figure 6.17). Will OLED eventu- 
ally replace the LCD altogether? At each stage of 
LCD evolution, new markets have been enabled by 


particular LCD modes. Many of these modes have 
retained their market share despite more advanced 
options become available. Whether it is the simple 
TN in watches and calculators, the STN for instru- 
mentation, or TFT-driven TN for low-cost moni- 
tors, the successful technologies become difficult 
to supplant once established. 


6.4.2 Dynamic scattering mode 


The first operating liquid crystal device was an 
electro-optic shutter, devised by Heilmeier’s prede- 
cessor at RCA, Richard Williams, in 1962 [11,103]. 
Williams showed that strong turbulence could be 
induced in a roughly planar sample of a negative 
Age nematic with a DC or low-frequency AC field. 
This turbulence was induced by ionic flow in the 
liquid crystal disrupting the liquid crystal, thereby 
causing strong optical scattering in the birefringent 
medium. Heilmeier used this switching between 
scattering and nonscattering states in RCA’s first 
display demonstrators in 1968. Unlike most of the 
other modes described, dynamic scattering mode 
devices did not need polarizers, operating in a 
20-100pm spaced cell at typically 30-50V. The 
onset voltage for scattering is approximately [104] 
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Figure 6.17 History of the mainstream of LCD from the calculator to QD 4k FHD TV. 


v,eT oe, 
E 


(6.69) 


where 1 is the bulk flow viscosity, 1 is the ion 
mobility, and T is a constant typically about 100. 
For n = 0.5Pa-s and p = 10-*cm?/V-s then V, is 
about 5-10V. The resistivity of the liquid crystal 
could be made lower than 10 GQ.m through delib- 
erate addition of mobile ionic impurities. The orig- 
inal displays of Heilmeier used DC fields to induce 
the ion flow, presumably because of the desire to 
address the devices using transistors. However, the 
lifetime was too short due to electrostatic break- 
down effects and so the first commercial devices 
used low-frequency AC instead. 


6.4.3 The twisted nematic LCDs 


6.4.3.1 INTRODUCTION 


By 1980, practically all commercial LCDs were 
based on the TN. This device set the benchmark for 


future developments, operating using alignment 
layers to obtain a uniformly aligned director pro- 
file, a cell gap of a few microns sandwiched between 
polarizers either side. The electro-optic effect is the 
basic Fréedericksz transition described in Section 
6.3.3, wherein the director responds to the RMS 
voltage coupling to the dielectric anisotropy affect- 
ing a change to the birefringence profile through 
the device, and hence altering the polarization 
state of the transmitted light. Each of the remain- 
ing devices described in this section uses these 
principles (although the FLC is a field effect device, 
with a polar coupling of the field to the ferroelec- 
tric spontaneous polarization). The TN uses a posi- 
tive Ae material, and usually a 90° twist between 
top and bottom surfaces, with either crossed or 
parallel polarizers, to operate in normally white 
(NW) or normally black (NB) modes, respectively. 
Applying a voltage three or four times greater than 
the threshold (ie., applying typically 5V) causes 
sufficient reorientation of the director in the bulk 
of the cell to cause the NW cell to appear dark, and 
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the NB cell bright. TN may be used in transmis- 
sion, usually driven by TFT and including color 
filters, or may be used as a two-polarizer reflec- 
tive mode. In this latter device, the rear polarizer 
incorporates a diffusive reflector so that light inci- 
dent from the front of the panel is reflected back 
through the device for a second pass. This leads to a 
much higher contrast ratio, since any light leakage 
in the dark state from the first pass is dramatically 
reduced on the second. However, it also means that 
the coloration of the white state is more important. 
Also possible are single-polarizer reflective modes, 
as described at the end of this section. 

A key part of any display design is to ensure 
uniformity of texture for the ON and OFF states. 
For a 90° TN this means ensuring that degeneracy 
of both twist and tilt are removed [58]. For twist 
alone this is simply done using slightly uncrossed 
alignment directions or, preferably, by inducing a 
natural sign of twist with the addition of a small 
amount of cholesteric to give a pitch of a few hun- 
dred micron. Removing tilt degeneracy requires 
that there is pretilt on both alignment surfaces. 
However, it remains essential that the pretilt and 
sign of twist are matched, so that there is minimal 
splay from one surface to the other when under- 
going the correct twist. Otherwise, reverse tilt and 
twist domains may still form as the director relaxes 
to the quiescent state after switching. 


6.4.3.2 THE OFF STATE AND TN DESIGN 


The polarization optics from a uniformly aligned 
birefringent liquid crystal cell was calculated in 
Section 3.2. It was shown that the polarization state 
after transmission through multiple elements is cal- 
culated simply by multiplying the appropriate Jones 
matrices together. The TN has a director twist angle 
that varies linearly from one surface to the other. 
Thus, it can be described as a series of very thin 
birefringent retarders, each with a slightly different 
orientation angle. Slicing the device into N layers, 
the retardation and orientation of each is given by 


T — 


aR 5Ojn= o;+ 


5 N-2, 
N- 


i= 0,1,2,. 
(6.70) 


where ® is the total twist angle. The overall Jones 
matrix M is the multiplicative sum of each of these 
elements M;: 


m=] (206) -R(6)) =] [(a-6)--2(6,)) 
au ‘i 
=R(-o)|| © 2s (2) 
0 e 2N 
Ae Lc be 
eee die 
= R(-®) a : N ; 


(6.71) 


As N tends to infinity, then each retarder becomes 
infinitesimally thin, and [105] 


x_ iL sinX sinX 
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Solving for the transmission 


=| cosxcos(®+4,— )+ <sinXsin(+6,— of 


ie [-(2) Joo (®-6, -») 


(6.73) 


where the input and output polarizer directions 
with respect to the input director are ¢, and ¢,, 
respectively. As expected, Equation 6.73 reduces to 
the transmission of a retardation plate, Equation 
6.25 if the overall twist ® = 0, and crossed polar- 
izers are used ( = #, and ¢, = b, + 90°). 

For the NW mode TN with ® = 90°, the polar- 
izers are crossed (s, = , +90°) and oriented with 
either ¢@, = 0° or 90°. The transmittance from 
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ane Me m=1,2,3... (6.76) 


Equation 6.73 then simplifies to the Gooch-Tarry 
expression [106]: 


2 
a (n) including the first minimum at An-d=1/2V3 
2 and second at An-d=1/2V15 4. At any one of 


nr 
T=1 2An-d\. (6.74 these minima (or maxima for the NW, although 
( nN } the term minima is used conventionally regardless 
of the polarizer orientations), the transmission is 


least sensitive to changes in retardation (including 
and the value of And is chosen to ensure that the viewing angle and cell non-uniformity) and polar- 
right-hand term is zero for 4 ~ 550nm, so that the _ jzer orientations. As the retardation is increased 
TN appears white. If the polarizers are kept paral- (j.e., for high cell gaps) successive minima are 
lel instead, then the transmittance becomes increasingly less sensitive to retardation changes 
until the Mauguin condition is approached at the 
42 2An-d\ high retardation limit [45], where polarized light 
om lay I+ d transmitted by the cell is rotated through 90° 
(6.75) regardless of cell gap, wavelength, temperature, or 
polarization angle. 


T — 2 
2An-d 
: ( iN } The choice of retardation is dictated by fabrica- 
tion limitations on cell gap, the operating speed that 
and the TN 46 WR at the ine eiardation condi. MPeamured, the coloration of the white state and the 
tions. Equations 6.74 and 6.75 are plotted versus @duited contrast. Response times depend on the 
retardation in Figure 6.18. There is minimum with square of the cell gap, through Equation 6.48. This 
NB: maximus with NW. where the twisted struc- usually limits practical operation to either the first 
ture of the birefringent material transmits linearly second minimum condition. The wavelength 
polarized light that is orthogonal to the input dependences for fir ae and second aad 
polarization and perpendicular (for NB, parallel if devices are shown in Figure 6.19. Again, choosing 
NW) to the output polarizer. This occurs at a series the wavelength to occur at the peak eye response 
of ) = 550nm, the first and second Gooch-Tarry 
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Figure 6.18 The Gooch-Tarry curve for the quiescent transmission of a 90° twisted nematic display 
versus retardation, for NB mode (parallel polarizers) and NW mode (crossed polarizers). 
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Figure 6.19 Choices of white state and cell gap for the TN. (a) The Gooch-Tarry curve for the 90° and 
75° twist angle TN. (b) Theoretical wavelength dependences of the transmission for a material with a 
typical birefringence (An =0.15) for 90° TN cells operating at the first minimum, first maximum, and 
second minimum and for a 75° TN with uncrossed polarizers at 85°. The cell gaps are also indicated. 


minima for a TN filled with 5CB are 2.64m and 
5.9 um, respectively. Most TFT LCDs operate at the 
first minimum, using lower birefringence materi- 
als, An = 0.1 and d = 4.5m because this cell spac- 
ing is conducive to a high manufacturing yield 
and the switching speed is satisfactory. If switch- 
ing speed is the priority, rather than use higher 
birefringence materials with the concomitant 
increases of viscosity, an alternative approach is to 
use a 75° twist [67]. The first minimum peaks at a 
lower transmissivity than for a 90° cell (98.4%) but 
this can be corrected by reorienting the polarizers 
through 7.5°, as shown for the results in Figure 
6.19. Typically, the cell gap for the 75° TN mode is 
80% that of the standard 90° TN, potentially lead- 
ing to a 50% speed increase, provided the reduced 
cell gap remains suitable for manufacture. 

For each of the modes, it is important to con- 
sider the coloration of the white state. For a back- 
lit color TN, this is less important than for a black 
and white reflective device because any loss of 
white color balance can be compensated in the 


addressing signals, back light spectrum, and color 
filters. For a dual-polarizer reflective display, the 
second pass of the light through the cell exagger- 
ates any unwanted coloration. 

Figure 6.19b shows the theoretical wavelength 
dependences in transmission for four TN modes. 
Converting these spectra to Commission interna- 
tionale de l’éclairage (CIE) 1931 x and y color coor- 
dinates, as shown in Figure 6.20, gives a locus of 
white hues depending on the retardation and the 
coloration of the polarizers [107]. Two loci are plot- 
ted, corresponding to a standard polarizer centered 
on green and a slightly bluer variety. These polar- 
izers represent the range of colors that can be tar- 
geted while retaining a black OFF state; narrower 
spectrum polarizers can also be used to modify the 
ON state coloration, but with a noticeable color- 
ation of the dark state too. Also shown on the chart 
are the coordinates for the D65 standard, repre- 
senting a target for the ideal white state. Operating 
at the first minimum tends to give a greenish hue 
to the white state, whereas the second minimum 
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Figure 6.20 Central region of the CIE color chart (shown in full on the right) indicating the region 
of white transmission for a dual-polarizer reflective mode TN. Two loci are shown, corresponding to 


polarizers with a slightly green tint (continuous line) 


and a slightly blue tint (dashed). The D65 white 


state reference is shown as a target for pure white. The location on the color chart of the first min, 


second min, and first maximum are indicated. 


gives an improved white due to the increased blue 
transmittance shown in Figure 6.19b. Thus, a sec- 
ond minimum TN is preferred for applications 
where the response time is unimportant (being five 
times slower than the equivalent first minimum 
mode) but the attractive bluish white is preferred. 
Alternatively, an intermediate retardation close to 
the point of the first maximum (ie., at An-d/A = 
2/22) has been used [107] to produce a neutral 
white as close to the D65 standard as possible, 
particularly when combined with the slightly blue 
tinged polarizer. The optimum cell gap for this is 


1 
d= aay Os5VI5 +0.68V3 \; (6.77) 


where the blue light is close to the second mini- 
mum and red wavelengths close to the first mini- 
mum and the green transmittance is decreased 
somewhat (Figure 6.19b). 

The other important consideration for the OFF 
state is the viewing angle. Figure 6.21a shows a 
polar plot of transmissivity for the azimuthal and 
zenithal directions for a NW first minimum TN 
operating in transmission. The OFF-state view- 
ing angle is good: light at any azimuthal angle 
experiences the same retardation profile due to 


the twisted structure and deviations of zenithal 
angle away from the display normal (shown as 
the central point) represent reduction of the effec- 
tive retardation to which devices operating at the 
Gooch-Tarry minima are insensitive. 


6.4.3.3 THE ON STATE AND OPTICAL 
COMPENSATION 


The ON state transmission is very viewing angle 
dependent. This is evident from the contrast ratio 
of OFF/ON transmission shown in Figure 6.21b 
because contrast is dominated by the dark ON 
state transmission. The blackest ON state occurs 
when the light is transmitted parallel to the liquid 
crystal optic axis, the direction where the liquid 
crystal behaves as an isotropic medium and any 
light transmitted is due to leakage of the polariz- 
ers. Contrast is highest for on-axis light because 
this is the direction of the applied field toward, 
which the director reorients, although it remains 
affected by the distorted regions close to the sur- 
face. Off-axis light experiences a retardation witha 
direction dependence that is related to the director 
profile through the cell, which in turn depends on 
the applied voltage. 

The Fréedericksz threshold voltage for a TN is 
given by 
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Figure 6.21 Measured viewing angle characteristics of a first minimum NW TN operating in transmis- 
on: ‘ : 


sion, showing polar plots of (a) OFF state transmissivity and (b) OFF/ON contrast ratio; (c) schematic 
of the TN with low, intermediate, and high voltages; (d) twist and tilt director profiles for the TN 
versus voltage. 


2 7 for a standard ® = 90° TN. Often, k,, ~ k,,/2 in 
Vo= ky + ®' (kos ~ 2k) , (6.78) practice, and so the Fréedericksz threshold is simi- 
Ege lar to that of the ECB mode, and typically around 

which simplifies to 


1V. Figure 6.21c shows schematically what hap- 
pens as the voltage is increased above V,, and 
numerical calculations of the response for typical 

—_ ky, +(k33 — 2k») /4 


elastic constants are given in Figure 6.21d. As the 
(6.79) voltage increases, the initially linear twist from 
€ Ae 


one surface to the other becomes increasingly 
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concentrated into the center of the cell where 
the director approaches the vertical condition, 
whereas regions of splay and bend are increasingly 
pushed toward the surfaces. Thus, the director pro- 
file approaches a vertically aligned central region, 
with twist-free splay-bend regions oriented at 90° 
to each other close to the two surfaces. 

The optical transmission through the cell is 
complex and requires numerical modeling. The 
voltage dependence of the normal incidence trans- 
mittance can be estimated by considering the mid- 
plane tilt of the director 6, [108,109]. Immediately, 


above V, this is 
®)| ks,” 
k tky—-k 
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and for the 90° TN cell it is 
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for the reasonable approximation k,, ~ 2k,,. Given 
that the surface pretilt is small, this expression rep- 
resents the linear part of the transmission-voltage 
characteristic above V, shown in Figure 6.9b. The 
multiplexibility of the TN is related to the steep- 
ness of this characteristic through the Alt-Pleshko 
expression (Equation 6.54). Thus, to ensure the 
highest number of lines that a passive-matrix TN 
can be multiplexed, the denominator of Equation 
6.81 should be kept low. This is done in practice 
using hybrid mixtures of polar and nonpolar com- 
pounds [110], in which k,,/k,, is lowered due to 
short-range order effects, offered a route to multi- 
plexing of up to 20 lines. 

Off-axis calculations of the transmissivity for 
the ON state certainly require numerical model- 
ing. However, the form of the contrast ratio curve 
shown in Figure 6.21b is anticipated by considering 


Ks” 
ky Ae 


+ 
4k, +[ks3 2k] €1 


(6.81) 


the director profile of the high voltage state as 
a vertically aligned nematic (VAN) in the bulk, 
with two orthogonal hybrid aligned states in the 
surface regions. The directors in these two regions 
are oriented parallel to the polarizers, at azimuthal 
directions 45° and 135° in Figure 6.21b, and hence 
appears dark at all angles. This understanding led to 
pronounced improvements of transmissive mode 
TN viewing angles through optical compensation. 
Discotic liquid crystals are similar to the standard 
rod-like calamitic mesogens used in displays but 
exhibit negative birefringence. Combining discotic 
and calamitic layers with equal but opposite retar- 
dations leads to a net optically isotropic medium, 
black at all angles when between crossed polariz- 
ers. Figure 6.22 shows how this works in practice 
[111,112]. Polymer film fabricated from a hybrid 
aligned discotic are laminated either side of the 
LCD with the optic axes parallel to the fully ON 
state director at the adjacent surface, and the splay 
antisymmetric. This leads to a greatly improved 
off-axis contrast ratio, as shown in Figure 6.22b, 
and helped lead to sufficiently good performance 
for TN displays to be used in laptops and monitors. 

Two-polarizer reflective mode TN devices 
are usually used in low-cost applications such 
as watches and calculators where the cost of 
extra optical compensation layers is prohibitive. 
However, the viewing angle characteristic of such 
devices is far more symmetrical than that of the 
transmissive device shown in Figure 6.21b, due toa 
self-compensating effect. Off-axis light that experi- 
ences a lower An from the director tilted toward it 
on the first pass through the device, experiences a 
correspondingly higher An when traversing in the 
other direction on the second pass after reflection. 

With all reflective displays, it is important to 
consider the illumination conditions; in particu- 
lar the color balance of the incident light and its 
degree of diffusivity. Indeed, the viewing angle 
characteristic can be considered both as an opti- 
cal output for diffuse illumination and as a means 
for ensuring the maximum light input. In practice, 
the LCD reflectivity is controlled to some extent by 
the diffusivity of the reflector and/or front polar- 
izer. Indeed, adaptions of the reflective layers can 
be used to deliberate trade-off viewing angle and 
reflectivity. The appearance of the display can be 
remarkably different when viewed by the diffuse 
light of a cloudy day or by the highly directional 
light on a sunny day or ina dark room with a single 
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Figure 6.22 Foil compensated TN. (a) Use of discotic foils to compensate the viewing angle of the 
OFF state for a transmissive TN LCD; (b) experimental measurements for the resulting viewing angle. 


light source. In the former case, the appearance of 
a scattering mode display will easily surpass that 
of the polarized LCD but the situation is reversed 
for directional lighting and viewing closer to the 
specular angle. 


6.4.3.4 SINGLE POLARIZER REFLECTIVE 
MODE TN 


Conventional two-polarizer reflective TN and 
STN LCDs have two further optical limitations. 
Parallax caused by the separation of the rear 
reflecting polarizer from the image plane by the 
thickness of the rear plate, leads to shadowing of 
the image when viewed off-axis. This can be dis- 
tracting for black and white devices but is severely 
detrimental to reflective color devices due to color 
leakage between subpixels. Second, the optical effi- 
ciency of the white state is relatively low because 
the light passes four times through the polarizers. 
The transmission of the highest quality LCD polar- 
izer is 43%. However, a further 5% is absorbed on 
each pass so that the maximum reflectance pos- 
sible with a two-polarizer reflective display is less 
than 28%, typically 23%. This is also a severe limi- 
tation for pixelated reflective color LCDs, where 


the color filters and decreased aperture ratio then 
lead to prohibitively low 7% reflectivity. 

Single polarizer reflective TN reduces these 
optical losses by using a front polarizer only and 
using an internal reflector on the inside surface 
of the rear substrate of the display to remove the 
parallax. The reflector is made slightly diffusive to 
scatter the reflected light in different directions, so 
that the OFF state has an attractive appearance and 
good viewing angle even when lit by a point source. 
The reflector can form the rear electrode as shown 
in Figure 6.23a. 

Single polarizer operation requires that the liq- 
uid crystal profile is designed to rotate the input 
polarized light through 90° after both passes 
through the liquid crystal, thereby being absorbed 
by the single front polarizer and appearing black. 
That is, the polarization is elliptically polarized 
after the first pass, the indicatrix is rotated on reflec- 
tion and the light becomes linearly polarized after 
the second pass orthogonal to the input polarizer: 
the display appears dark. For a positive Ae mate- 
rial, this will always correspond to the OFF state 
because in the ON state the director approaches the 
vertical condition and appears optically isotropic: 
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Figure 6.23 Options for single polarizer TN modes. (a) Schematic of single polarizer TN with internal 
diffusive reflector. (b) Solutions for the NB mode single polarizer TN. Continuous lines show the twist 
angle © for the first three solution sets to Equation 6.85, and the dashed lines show the correspond- 
ing polarizer angles $,. (c) The data of (b) replot as twist and polarizer orientations. The numbers indi- 
cated represent the retardation with 90° twist, whereas the untwisted cells give An-d/A = 0.25, 0.75, 
and 1.25, for each mode, respectively, each with , = 45°. (d) Calculated reflectivity for ® = 64°, 75°, 
and 90° twists, with polarizer angles set to give R=0 for m =1, 2, and 3, respectively. (e) Dark state 
transmission for ® = 64°, 75°, and 90° for a low An materials (0.09). 


no change to the polarization occurs on either pass 
and the light is transmitted. The theoretical curves 
shown in Figure 6.23 ignore losses: in practice the 
reflectivity may reach up to 35% in the single polar- 
izer modes. The image may be inverted and NW 


operation achieved by the addition of a quarter- 
wave plate between the polarizer and the liquid 
crystal, as shown in Figure 6.23a. 

Consider a single polarizer display using the 
simplest LCD geometry, the ECB mode of Section 
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6.3.3. Here, the liquid crystal acts as a switchable 
wave plate. Multiplying the Jones matrices for a 
polarizer, wave plate with optic axis at angle ¢, to 
the polarizer, and the same wave plate and polar- 
izer after reflection gives 


R=1-4sin‘ Tsin’29,, (6.82) 
where the retardation is approximately 
An-d 47 
Fe “ore (6.83) 


and 0 is the average director tilt and 6, is the polar- 
izer orientation measured from the input director. 
The OFF state appears dark (R= 0) if =, = ma/4 
(m= 1, 3, 5...). That is, at the quarter-wave plate con- 
dition light is circularly polarized with one handed- 
ness, which is swapped to the other handedness after 
reflection and no longer transmitted by the retarder. 
This mode suffers from poor dark state leakage and 
coloration, due to wavelength and angular disper- 
sion away from the quarter-wave condition. As for 
transmissive state devices, improved performance 
is achieved using the TN geometry. 

There are several options for single polarizer 
reflective TN operation [67], including twist angles 
ranging from 60° to 90°. Replacing the wave plate 
with the Jones matrix for a TN in the derivation 
above gives [113]: 


2 ‘ 2 
203 2a: 
R=|1- osin?y +(e) 
1+0@ 1+@ 


I(v 1+0° Jeosy -cos2Q, — siny -sin2Q, | , (6.84) 


where « = T/® and y = ®V1+’. This gives dark 
state solutions (R = 0) when the following two con- 
ditions are met simultaneously: 


(6.85) 


These solutions represent the cases where the liquid 
crystal is at the quarter-wave condition, rotating 
the polarization through 90° over the two passes 
and tends to those of Equation 6.82 as ® tends to 
zero (& ce), with d, = +0/4 and An-d/d\ = (2m + 
1)/4 for m= 0, 1, 2.... Figure 6.23b and c shows the 
solutions to Equation 6.85 for the first three orders. 
Consider the solution for , = 0, shown in 
Figure 6.23d and e: Equation 6.85 predicts that 
R =0 occurs when @ = +1, for which ® = x/(2V2) 
= 63.64° and An-d/d = 0.353. This is a rather low 
retardation; even if a commercial liquid crystal 
with the lowest birefringence available (An ~ 0.09) 
is used. This LCD mode requires a 2.2 1m cell gap, 
potentially lowering production yield and increas- 
ing cost. Second- and third-order examples are also 
listed in Table 6.5. Although the cell gap becomes 
more typical, the coloration of the dark state is far 
too high for use in a conventional display. Such 
modes, however, are useful for liquid crystal on sil- 
icon (LCOS) projection systems, where three pan- 
els are tuned to operate at the RGB wavelengths. 
The 63.6° single-polarizer TN [113] was suc- 
cessfully deployed in Nintendo’s Color Game Boy 
games console from 1998 to 2003. The display 
was manufactured by Sharp, who included ach- 
romatic retarders to invert the display to NW 
operation. This minimized dark state leakage, 
which is essential for any color display, whereas 
the resulting wavelength dependence of the white 
state was readily compensated through the color 
filters. Although the design gives the highest 
reflectivity possible with a polarized light mode 
LCDs, the introduction of color filters still leads 


Table 6.5 Examples of normally black solutions for single polarizer TN 


Order of Cell gap d (um) 
minimum Director twist ® (degree) Polarizer angle (degree) An-d/A (An=0.09) 
1st 63.6 0 (or 90) 0.353 2.2 
2nd 45 56.4 (or —33.6) 0.683 4.2 

60 63.6 (or —26.4) 0.606 3.7 
3rd 90 32.7 (or —57.3) 1.175 7.2 
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to a maximum reflectance of less than 10%. The 
approach taken by Sharp was to introduce pris- 
matic elements into the back reflector, thereby 
directing off-axis light into the viewing direction; 
the reduced viewing angle display was considered 
suitable for this single viewer application. The 
adoption of better performing backlights and high 
aperture ratio LTPS back planes eventually led to 
transmissive displays only becoming acceptable 
for color portable applications. Interestingly, full 
color reflective and transflective displays remain 
an area where no technology, liquid crystal, or oth- 
erwise, has yet met the performance required for 
market success. 


6.4.4 Supertwist nematic LCDs 


The TN device usually includes a trace amount 
of cholesteric dopant, simply to impart a natural 
handedness and prevent domains impairing the 
appearance. Increasing the cholesteric content so 
that the natural pitch of the chiral nematic P is 
much lower allows twists of greater than 90° to 
be reached [114]. The range of conditions for such 
“supertwist” states is [89] 


® 1id_.®@ i 1 
<—< + 
2xn 4 P 2n 4 


(6.86) 


or 


(1-2 )<p<(1+5), 


where B = 21-d/P®. The STN range of twist angles 
is 90° <® < 270°, which corresponds to differences 
in the rubbing direction ,-¢, of 


(6.87) 


o,-0, =®-7, (6.88) 
where it is important to ensure that the sign of the 
liquid crystal helix matches the pretilt of the two 
surfaces, in the same fashion as the TN. In prac- 
tice, higher pretilts are needed for STN, typically 
2° <0, < 8° 

Figure 6.24 shows the transmission character- 
istics for a set of devices with twist angles operat- 
ing across the supertwist range [109]. As the twist 
increases, both the Fréedericksz threshold voltage 


and the steepness of the curve increase, the latter 
allowing an increase of the number of lines that 
can be passive matrix addressed [89]. 

The inherent twist of the chiral nematic effec- 
tively reduces the twist elastic constant by the term 
(1-2n/P) and the Fréedericksz threshold becomes 


1 fa +(2) fs 2ky (1 6)| 


€ AE 


Vo = 
(6.89) 


Hence, V, increases with twist angle ®. Similarly, 
the voltage dependence of the midplane tilt angle is 
also affected, with Equation 6.80 now given by [109] 


k53 2 
2) + ky (1-48 + 
ky-(2) Kn n( B B*) 


1 
2 ~ k3(2B-1 
c = ot 33(2B—1) a 
(¥-1] +2) [ kss —2k22(1-B) | 1 


(6.90) 


As for the TN case, it may be assumed that k,, = 
Yk,, such that the denominator becomes 


1(@/ 
ku-H{2} k33(7-8B +B’) 
2\ 1 Ag 
© ) a : (6.91) 
L 
«+{2) k33B 


Assuming that the chiral doping is chosen to be in 
the central range for the given twist (6 = 1), then 
the denominator of Equation 6.82 becomes zero 
and the 6,,(V) gradient infinite when that twist is 
set to @..: 


(6.92) 


Low V, requires highly positive Ae, for which 
typically 0.3 <e,/Ae < 0.5. Ensuring that ®., is 
maintained at an attainable twist then necessitates 
k,3/k,, is as low as possible, contrary to the require- 
ment for the standard TN. For example, a typical 
mixture suitable for STN may have €, /Ae ~ 0.4 and 
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Figure 6.24 (a) Schematic of a typical two-polarizer reflective STN and (b) STN transmission charac- 


teristics for increasing twist angle. 


ky3/k,, = 0.85 [79] such that the gradient is highest 
for ®,, ~ 200°. Equation 6.92 only acts as a guide 
for material design but it shows that keeping both 
k,,/k;; and Ae/e, low is likely to give the highest 
degree of STN multiplexing. Therefore, the mate- 
rial scientist targets a high k,,, Ae and €,, whereas 
keeping k,, low to combine a low threshold voltage 
with a high multiplexibility. Where the gradient 
exceeds ®., the voltage response becomes hyster- 
etic, as shown for the 270° STN in Figure 6.24. This 
is unusable in RMS addressed displays and so typi- 
cally twists of ® = 4/, 2 = 240° are used in practice. 
Even then, switching can be disrupted through the 
formation of stripe domains and electrohydrody- 
namic instabilities that cause the helical axis to 
rotate into the plane of the cell [115]. These require 
a lower d/P to be used than the 6 = 1 condition, 
together with ensuring the pretilt is high. 

For a 240° STN, 5/8 $B <11/8 and the range of 
Fréedericksz thresholds is 


1 10 1 22; 
Tt, |——| ky, t+—k33 |< Ve <0, /——| ky, + — ks |. 
Jal 11 9 »| Cc fel 11 9 «| 


For the typical STN mixture with k,,/k,, = 1.2, Vo 
is 20-70% higher than that for the equivalent TN 


operation, with the higher threshold more practi- 
cal if B is kept low and the stripe voltage is to be 
avoided. Also, the elastic term that appears in the 
square brackets of Equation 6.89 is applicable to 
the response times: inserting into Equations 6.48 
show that the STN response compared is inher- 
ently quicker than the TN. However, this is rarely 
found in practice because the STN is usually highly 
multiplexed and the response much slower due to 
the much smaller changes of RMS voltage. 

The first STN demonstrators used a Guest- 
Host mode, in which anisotropic pleochroic dyes 
give contrast between the ON and OFF states [89]. 
However, optical contrast could also be achieved 
using optical retardation effects, in what is called 
the supertwist birefringence effect (SBE) [90]. 
STN also has a Gooch-Tarry-type dependence of 
the polarization optics, with the first minimum 
mode occurring at increasing An-d/d with twist, 
as shown in Figure 6.25, and the crossed polarizer 
angles set to Q, = 1/2(®- 10/2). The retardation for 
the first minimum is double that of the TN, poten- 
tially decreasing the speed of the device by a factor 
of 4. Even when high birefringence materials are 
chosen, the effect is too slow for animation, with 
>100 ms response times typical. However, the most 
significant drawback from the original technol- 
ogy was the optical properties: the OFF state is a 
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Figure 6.25 Optics of the supertwist birefringence effect mode. 


prominent shade of yellowy green or blue if the 
NB polarizer orientations are chosen, the viewing 
angle is poor and the contrast is much lower than 
that of the TN because the twist remains more 
evenly distributed through the cell in the STN case 
as the field is applied [116]. Various attempts were 
made to improve the appearance, such as the use of 
blue polarizers to give a white on blue display with 
high transmissivity, but the most successful were 
in the late-1980s, when optical compensators were 
added. The first method was to mount an inac- 
tive dummy STN panel onto the front of the LCD, 
equivalent to the active panel but with the opposite 
handedness [91]. This corrected both the coloration 
and the viewing angle and would do so across the 
whole temperature range. However, the additional 
cost and weight prohibited this approach commer- 
cially and polymer film retardation plates are now 
used instead [117]. The best color compensation is 
achieved through the use of two polymer films [7] 
as indicated in Figure 6.24. 

Even when the transmission characteristic is 
made infinitely steep, the effect of nonuniformi- 
ties, including electrode resistive losses and tem- 
perature variations, prevents multiplexing much 
beyond 240 lines. Driving alternate rows from 
opposite sides of the panel doubles the maximum 
number of rows. Such high degrees of multiplex- 
ibility represent a considerable improvement to 
the passive matrix TN and for that reason the STN 
continues to find a market where there is demand 
for a high black and white image content combined 


with low cost. However, the lack of gray scale and 
slow response speed prohibits video applications 
and it was these factors that fed ongoing research 
efforts for highly multiplexed displays. 


6.4.5 Ferroelectric liquid crystal 
displays 


Although making only a small commercial impact, 
FLCs commanded major R&D efforts through the 
1980s and 1990s, immediately after the publication 
by Clark and Lagerwall of the Surface Stabilized 
Bistable FLC mode [88] (SSFLC). Companies 
across Europe and the Far East each produced 
demonstrators to rival the nascent TFT technology 
[118]. However, the only panel to receive notable 
sales in the mainstream displays market at that 
time was the 15in. 1280 x 1024, 16-color monitor 
produced by Canon (Figure 6.16). 

Unlike the other mainstream LCDs described 
in this section, FLC does not use nematic liquid 
crystals but rather a particular type of smectic, the 
tilted smectic C phase, denoted as SmC. Unlike 
the nematic case, reorientation of the director 
for smectics is constrained by the presence of the 
smectic layers. The SmC phase is constrained in 
this fashion too but the director is relatively free 
to reorient about the layer normal in an imaginary 
cone of possible orientations. The component of 
the director in the layer plane is described by the 
unit director c, which acts as a two-dimensional 
nematic. It is the reorientation of the c-director in 
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response to applied electric fields that yields the 
potential for electro-optic effects suited for display 
operation. However, what makes the SmC so inter- 
esting is the inherent ferroelectricity allowed by the 
reduced symmetry of the SmC* phase. Although 
the nematic phase has cylindrical symmetry, the 
SmC is monoclinic: there is a single C, symmetry 
axis corresponding to the direction orthogonal 
to the layer normal a and director n. Such sym- 
metries are inherently biaxial, meaning that they 
have two optic axes and three principal permit- 
tivities. The difference between the two directions 
orthogonal to the n director is called the biaxial- 
ity. The refractive index biaxiality is negligible and 
FLCs are best treated as optically uniaxial mate- 
rials, with a single optical axis along the director. 
Thus, the polarization optics can be treated in the 
same way as a conventional nematic. However, 
the dielectric biaxiality de = €,—e, is significant 
[119,120] and, in essence, acts in a similar fashion 
to the nematic Ae, dictating the RMS response of 
the c-director. Reflections of the SmC molecules in 
the tilt plane do not disrupt the phase symmetry: 
molecular dipoles aligned along the C, axis have 
equal probability of being parallel or antiparallel. 
If the phase is chiral, this mirror symmetry is lost 
and there is a net dipole in the direction of +C,: 
the ferroelectric spontaneous polarization P,. This 
unique feature provides a polar switching torque 
many times greater than that possible in nematic 
liquid crystals, resulting in switching times that 
can be faster than 10 ps. 

For simplicity, a single elastic constant B and 
the flow-free viscosity y, are used, although the 
effect of the elastic anisotropy and the uniaxial 
dielectric anisotropy Ae is important too [121,122]. 
The switching torque of the FLC is then given by 
Ode 


Oz 


y, sin’ 0p = = Boos’ 5c + PE, cos6c sindc 


—£€)0€-E; sindc cosdccos 5c. (6.94) 


The ferroelectric torque differs from the dielec- 
tric having a linear dependence on the applied 
field E, rather than the RMS E?. This means that 
the torque depends on the field polarity as well 
as its strength. A typical FLC may have a P, = 50 
nC cm~ and dielectric biaxiality de = +0.5. For a 
typical +10 V signal applied across a 21m spaced 


cell, the ferroelectric torque is 50 times greater 
than the dielectric, dominating the electro-optic 
behavior. Assuming the material has a positive 
P,, then +E will tend to reorient the c-director on 
one side of the cone (toward = 0) and —-E to the 
other (p = 2). Simplistically, if these two condi- 
tions equate to the n-director being oriented 2/4 
apart and the device has a spacing set to give the 
quarter-wave plate condition and crossed polar- 
izers parallel and perpendicular to the director in 
one of the states, Equation 6.25 predicts that the 
device will switch between minimum and maxi- 
mum transmissivity. 

A second important aspect of the SSFLC is 
bistability. As for any LCD, the starting point is 
to achieve the desired alignment, uniformly over 
the whole sample [123]. There are several steps to 
consider for ferroelectric LCDs (FLCD). The usual 
SSFLC device geometry relies on the N*¥-SmA- 
SmC* sequence, as shown in Figure 6.26a. The 
device requires parallel alignment with the 
required pretilt 0,. The cell gap must be sufficiently 
low to unwind the cholesteric helix and provide an 
almost uniform nematic texture, as shown: devoid 
of twist and with only a slight splay and bend asso- 
ciated with the surface pretilt. Furthermore, the 
helicity of the SmC n-director must also be sup- 
pressed in a similar fashion. This uniform nem- 
atic texture should be retained on cooling into the 
SmA phase, where the layers will align uniformly 
perpendicular to the cell walls in what is termed 
“bookshelf” geometry. On cooling into the SmC* 
phase, the director tilts from the layer normal by 
the cone angle 0,, which grows continuously from 
0° at the second-order SmA to SmC* phase transi- 
tion to typically 22°-25° at ambient temperatures 
(Figure 6.26b). The smectic layer spacing contracts 
as the director tilts from the layer normal, caus- 
ing the layers to tilt by an angle 5, and form a 
symmetric chevron-like structure. The layer tilt 
remains a constant fraction of the cone angle, 
typically 5, = 0.856, so that the director remains 
continuous across the sharp chevron interface at 
the cell center. It is this interface that gives the 
SSFLC its bistability. The c director can be at either 
of two orientations, , and x—@;, in one arm, and 
—; and x + ¢, in the other. These correspond to 
just two orientations of the n director, as shown in 
Figure 6.26b. Application of a DC field couples to 
the ferroelectric polarization, eventually causing 
such a high torque at the chevron interface that 


6.4 Standard LCD modes 189 


(a) SmC(*) SmC(*) 
immediately well 
N(*) SmA below T. below T, 
(C1) (C1) 

a —>r -r -r 7x { 2 wT ( ) 

mww!8, og Se re (i) 

~ 2 FF NR} CH ch 

~ -— & Ky H+ a 

a: = Wp f ) (§) 

» Hoe OC & -y 

wan wa wT CA) 

_ i at \ aS Se SK 

— fF RE ° ; (1) 

ee! {= wa =< ade 

aie aie fe Prk \ « Ye § > 

tages ai 
azimuth 
angle 
B 
(b) = csdapiay n-director 
Biaxial {\ Layer 
permittivities | normal 

\ 3 n(e;) ea Chevrom 


Cell 


Projection of n 
onto cell plane 


Figure 6.26 


\ Projection of a 
\ 8 onto cell plane 


plane 


interface 


n-director 
orientations 


(a) Ferroelectric liquid crystal alignment on cooling through the sequence N*-S,-S_* 


for parallel aligned surfaces. C1 and C2 chevron layer textures with low and high tilt and triangular 
director profiles are shown. (b) Definition of angles for S.* devices and the source of bistability from 


the chevron interface. 


latching from one state to the other occurs. After 
the pulse, the director remains in that state, with 
the director relaxing back to one of the two quies- 
cent states. 

The surface pretilt and anchoring energies play 
fundamental roles in SSFLC devices. Conventional 
polyimides for nematic alignment are used. These 
have azimuthal and zenithal anchoring energies in 
the range of 0.3-1 mJ-cm”; high enough to be con- 
sidered infinite for nematic LCDs, but insufficient 
to cause changes to the layer orientation or SmC* 
cone angle in FLC. Immediately below the SmA- 
SmC* transition, the layers always tend to tilt in 
the direction favored by the surface pretilts. This is 
the Cl state, shown in Figure 6.26a. As the temper- 
ature is cooled further, 0, and 6, increase, forcing 
the surface director to lie further from the rubbing 


direction and pretilt angle. Close to the tempera- 
ture where the difference between the cone and 
layer tilt angles (0.-5,) approaches the surface 
pretilt 0,, the tilt of the layers swaps direction to 
form the C2 layer structure. For low surface tilt 
angles (0.1° < 0, < 2°) samples usually form a mix- 
ture of both Cl and C2 states. The lack of unifor- 
mity and zigzag defects that separate the regions 
of opposite layer tilt severely damages the device 
appearance. However, with intermediate pretilts 
(2° < ®, < 8°), the layers form the C2 state com- 
pletely and uniformly, whereas for higher pretilts 
still (12° < @, < 35°) the layers do not undergo the 
alignment transition at all and the sample is uni- 
formly C1. Both layer geometries have been used in 
SSFLC devices [101,118], though the Canon moni- 
tor used the high pretilt Cl approach. 
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The out-of-plane tilt of the director @ and in- 
plane tilt angle of the projection of the n-director 
into the cell plane are shown in Figure 6.26b. They 
are given by the expressions: 


B= wn" cosdc sinO¢ } 


sinc sindc sin8¢ + cosd¢ cosO¢ 


0=sin” (sindc cos0c sinOc¢ + sindc cos Qc). 
(6.95) 
At the chevron interface, there is no out-of-plane 


tilt and the director has an in-plane twist angle B,, 
given by 


Bn = cos = = } (6.96) 
{3} 


which is typically about 8°. For the C1 layer geom- 
etry, the surface tilt is chosen to be high, typically 
25°. For typical values of 0, 52 25° and 5, = 22°, 
Equation 6.95 gives an in-plane tilt of B, = 27°. Thus, 
the director twists from 27° at one surface to 8° in 
the cell center and back out to 27° at the other sur- 
face. Assuming that the twist is approximately lin- 
ear then the director profile is triangular [124], as 
shown in Figure 6.26a. The transmission of polar- 
ized light through such a cell is similar to that of the 
reflective TN mode given by Equation 6.84, except 
the opposite polarizer is crossed to the input polar- 
izer in the FLC case, rather than being parallel. This 
has the simple solution for the angle to align the 
polarizers with respect to the rubbing direction, B,,.: 


wn. —Bs) 1+—a? | 


tan2 (Bex Bs)= 1 > (6.97) 
1+—o 
4 
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as before. Equation 6.97 suggests that the device 
will appear highly colored in its quiescent state for 
retardations close to the full wave plate condition. 


However, if the cell spacing is reduced to the half 
wave plate condition, the device appears black with 
the polarizers aligned at the angel: 


puna Peebs, (6.99) 
2 

The example above gives B,,, = +18°. Thus, if the 
polarizers are placed at +18° and +108° to the rub- 
bing direction, the cell will appear black for one 
state and transmit most of the light when in the 
other domain, where the optic axis is about 36° 
from the polarizers. Approximating the structure 
to a uniform retarder, and setting the retardation 
to be at the half-wave plate condition for green 
light, then the FLC switches between states with 
the optic axis either parallel or at 2B,,, to the polar- 
izer. Equation 6.25 suggests that the transmittance 
should be 91%. This could be maximized so that 
B.., = 22.5° (and the director reorients through 
45°), for example, by using a material with a higher 
SmC* cone angel 0,. However, this also causes a 
decrease in switching speed and so Canon used 
the lower optical efficiency to help achieve a fast, 
flicker-free frame for their monitor. 

FLCD panels are addressed in a similar fashion 
to most passive matrix displays and in the same 
line-scanning method described in Section 6.2.4. 
However, the response is no longer to the RMS over 
the frame but rather the signal applied to each row 
must be sufficient to latch the pixels into a new state 
within the line time. In this fashion, the informa- 
tion is built up line-by-line. The row waveform has 
one particular sign of operation, allowing only one 
set of states to be selected appropriately. Both states 
are addressed, either using two subframes of appo- 
site polarities (with a cost of increasing overall frame 
time) or by preceding each addressing pulse by a 
blanking pulse that selects the black state regardless 
of the data being applied to the previous lines and 
then latching selectively or not in the addressing line 
(with the cost of reduced brightness for pixels that 
should remain white in consecutive frames). Various 
addressing schemes are possible [125], influencing 
speed, operating window, and appearance. 

A second approach to operating the SSFLC was 
also attempted jointly by RSRE (then DERA) and 
Sharp Corporation. This maximized device speed by 
using the C2 geometry and multiplexibility using a 
lower P, (=10 nCcm”) and higher dielectric biaxial- 
ity de (2+1) [119,121]. Equation 6.94 predicts that the 
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dielectric and ferroelectric terms become equal at 
about 32.5 V for such high biaxiality, low P, materi- 
als. If the field has the correct polarity to reorient the 
director from one side of the cone to the other, the 
dielectric biaxiality suppresses switching and above 
the voltage where the torques balance (=32.5V for 
these values), the director will remain unswitched 
indefinitely. In fact, electrical pulses at about 70% of 
this voltage [120,121] start to slow the response rap- 
idly, creating a minimum in the switching charac- 
teristic (tV\;;,). Operating close to this voltage (Le., 
about 23 V, for the high biaxiality SmC* material 
in this example) gives a highly nonlinear response, 
thereby enabling thousands of lines to be addressed 
with a high degree of insensitivity to temperature 
variations and line losses. Using this tV,,;,, mode, 
Sharp created a prototype color 4 HDTV, operating 
with a 121s line address time, to give 256 gray levels 
and a 60 Hz frame rate [101]. 

By the mid-1990s, the key advantages of the 
SSFLC over TFT TN were its perceived lower cost 
and excellent viewing angle. The good viewing 
angle was inherent to the SSFLC mode due the 
fact that the director remains in the same plane for 
both switched states, as shown in Figure 6.27. Such 
IPS gives excellent viewing properties, surpassing 
that of the foil compensated TN of Figure 6.27. 
However, in the mid-1990s, SSFLC lost the war 
with TFT-driven nematics because of two reasons. 
First, the number of critical mask steps grew with 
FLC complexity. To achieve microsecond pulses 
across a passive matrix required metal bus lines to 
be prepared. Moreover, smectics are fundamentally 
sensitive to shock because any flow in the panel 
disrupts the carefully aligned layers irreparably. 
Second, to prevent mechanical damage to the LC 
alignment, polymer walls were defined photolitho- 
graphically, again introducing a critical mask step. 
Together with the poorer yield associated with 
achieving perfect alignment, any cost advantage 
was severely eroded. The final battle came with the 
introduction of in-plane switched TFT nematics 
that matched, and eventually surpassed, the view- 
ing angle of even the SSFLCD (Figure 6.16). 


6.4.6 In-plane switching LCDs 


6.4.6.1 INTRODUCTION 


Operating with TFT removes steepness of the 
electro-optic transition as a critical design issue. 


This enables LCD modes to be chosen that meet 
the more stringent optical requirements for large 
area monitor and televisual displays. The crucial 
weakness of TN LCDs was the viewing angle, even 
with optical compensation. Viewing angle is par- 
ticularly important for large area displays, where 
images must appear uniform from the center of 
viewing to the corners. The viewing angle tar- 
get is harsher still if the image is to satisfy mul- 
tiple viewers. The television market also requires 
fast response times not just for black to white 
transitions but also between adjacent gray levels. 
Extremely high contrast ratios are needed to com- 
pete with emissive technologies such as CRT, PDP, 
and, most recently (OLED). 

Achieving pixel contrasts in excess of 10,000:1 
and >160° horizontal viewing angles requires cylin- 
drical symmetry of the director profile in both of 
the ON and the OFF states. Two approaches were 
developed during the late 1990s and early 2000s: 
IPS and vertical aligned (VA) modes as shown 
in Figure 6.28. Different manufacturers cham- 
pioned each mode and numerous modifications 
were tried. This section will concentrate on the IPS 
mode, and its derivative FFS, whereas VA modes 
are dealt with in the following section. 


6.4.6.2 BASICS OF IPS OPERATION 


In-plane fields are impractical for passive matrix 
displays but are readily achieved with active 
matrix LCDs by moving the common electrode 
to the active plate (in addition to the Source and 
Gate lines), in-plane electrodes are etched onto the 
active back-plane in either a comb-like structure, 
as shown in Figure 6.29a, or more commonly a zig- 
zag configuration. The field produced by comb-like 
electrodes is complex, varying both in the plane of 
the cell in the direction normal to the electrodes. 
Ignoring the effect of the liquid crystal permittiv- 
ity, and taking the first Fourier component only, 
the field has components [126]: 


Vv. Ty TZ 
E, ~—sin e ; 
a (A) sf =) 
V Ty Mz 
E,~ 5 
F cos( 2 Jeso[ =| (6.100) 


That is, the electric field components have a 
periodic form in the plane of the cell but decay 
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Figure 6.27 Schematic representation of a switchable quarter-wave plate with in-plane switching. 
One direction of viewing self-compensates the change reduction of birefringence with the increased 
optical path. The increase in retardation in the other direction (usually set to be the horizontal direc- 
tion for the white state) is relatively small and easily compensated. A polar plot for the contrast is 


sketched on the right. 


exponentially across the bulk of the cell. Rather 
unsatisfactorily, most treatments ignore the com- 
plex field behavior and assume that, if the cell gap 
and electrode widths are sufficiently low, the field 
is uniform and restricted to the cell plane. If the 
quiescent state of the nematic is aligned with the 
rubbing direction orthogonal to that of the applied 
field, the IPS mode has a Fréedericksz threshold 
field that is approximately given by Equation 6.34, 


although the errors for this can be severe in prac- 
tice [126]. Assuming a uniform in-plane field, the 
Euler-Lagrange equation for the elastic distortion 
above the threshold is [127] 
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which has the simple solution 
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Figure 6.28 Improvement of viewing angle using IPS or VAN modes. 
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Figure 6.29 IPS principles. (a) Schematic of the TFT in-plane electrode arrangement, (b) theoretical 
transmission calculated for low and high switching voltages for one electrode period [124]. (c) Mode 
choices associated with positive Ae and negative Ae materials. 
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o= oosin( ™2) (6.102) 


for fixed boundary conditions. Aligning the polar- 
izers parallel and crossed to the rubbing direction, 
the first-order solution for the optical transmission 
is given by that of a wave plate (Equation 6.25). 
Approximating the sinusoidal dependence in 
Equation 6.102 to the triangular form of Equation 
6.97 [124] and setting the device thickness at the 
half-wave retardation gives an effective in-plane tilt 
of the director of (o) =Qm/2 and the transmission 
approaches 50% with increasing voltage. At fields 
higher than that which gives o,, = 90°, the twist 
deformation becomes increasingly concentrated 
toward the surfaces. Thus, the effective twist angle 
<i> exceeds the optimum 45°, thereby causing the 
transmission to decrease, as shown in Figure 6.29b. 

The IPS mode can operate with either positive 
or negative Ae materials, Figure 6.29c, depend- 
ing on the orientation of the rubbing direction 
with respect to the electrodes ,. In this diagram, 
it is assumed that the anchoring on the surfaces 
is strong and director reorientation occurs in the 
bulk of the cell, although somewhat closer to the 
electrode plate due to the transverse decay of the 
field. Clearly, the director in the positive mode will 
tend to orient in the field direction. For the nega- 
tive mode, the director reorients to be orthogonal 
to the applied field, either remaining in the cell 
plane or tilting out of that plane. IPS is always 
favored in this case because the elastic energy asso- 
ciated with the twist elastic constant is lower than 
the splay-bend that would be induced by the tilt 
(i.e., ky. < ky, <k,3). Although the early demonstra- 
tors used negative Ae, positive mode is now more 
common because the materials combine higher Ae 
with lower viscosity and hence give lower operat- 
ing voltages and faster response times. 

With the TN, STN, and SSFLC modes, ana- 
lytical calculations for the on-axis optical and 
electro-optic behavior are reasonably accurate. 
Optimization of viewing angle required numeri- 
cal solutions due to the importance of off-axis 
refractive and reflection effects. However, with 
the adoption of the IPS mode and the variety of 
multidomain VAN modes covered in the follow- 
ing section, both the electric field and the director 
profile vary in two or even three dimensions. This 
necessitates numerical simulation to optimize the 


LCD [31,64]. For example, Figure 6.29b shows the 
calculated transmission for a 2D simulation of the 
IPS mode [128]. Above the electrodes, the field is 
almost vertical and the director either reorients 
vertically for positive Ae or remains unaffected 
by the field if negative Ae; in either instance the 
electrodes appear as unwanted dark bands in the 
pixel. This necessitates that the gap should be much 
larger than the width of the electrodes / > w. If 
too high, the field is reduced and the switching 
voltages become higher. Typically, the electrodes 
are w = 31m wide and have a gap / = 6-8yum. 
Therefore, the banding represents a major reduc- 
tion in the optical efficiency of IPS mode. 

Ideally, the transmission-voltage character- 
istic should approach linearity across the voltage 
range, readily giving gray levels. The response of 
the IPS mode is strongly dependent on the angle 
of the rubbing direction with respect to the elec- 
trodes, :, [128]. For positive mode IPS, , = 30° 
is used typically [129], not only leading to suitable 
transmission characteristic and near linearity of 
the gray scale response times, but also significantly 
improving the response time, which decreases lin- 
early with increasing ¢. 

The viewing angle characteristics of the IPS 
mode exceed even that of the SSFLCD, shown in 
Figure 6.27. The first improvement was to orient the 
electrodes in a small-angled zigzag, to help widen 
the viewing cone. However, the stringent require- 
ment to maintain very high contrast well off-axis 
still necessitates the use of optical compensators. 
For example, the crossed polarizers themselves 
leak at the high angles in the four quadrants cen- 
tered at 45° and 135°. A typical IPS mode LCD 
used in television achieves pixel contrast ratios in 
excess of 2000:1 for direct viewing, and contrast 
in excess of 1000:1 over 175° horizontal and ver- 
tical viewing. The lowest pixel contrasts still sur- 
pass 200:1 at 140° viewing in the 45° quadrants. 
Such impressive viewing angle figures are achieved 
using a front uniaxial wave-plate with its axis ori- 
ented crossed to the rubbing direction (and input 
polarizer) combined with a negative uniaxial wave 
plate formed from a homeotropic discotic liquid 
crystal polymer [130]. 


6.4.6.3 FFS MODE 


Despite the impressive performance of IPS, the 
mode suffers from poor optical efficiency due to 
the banding structure caused by lack of director 
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Figure 6.30 Fringe field switching mode. (a) Basic structure and operation and (b) typical transmission 
distribution with varying voltage within a period of the data electrodes, shown in blue. 


switching above the electrodes. For high-reso- 
lution portable display applications, this is very 
important because increasing the output from the 
backlight to compensate for optical inefficiency 
is too costly for battery life. Many mobile phones 
and tablets deploy FFS mode LCD [96,97]. This is a 
modification to IPS mode, where a uniform coun- 
terelectrode is placed beneath a dielectric layer 
with the interdigitated electrodes on top. The pitch 
of the electrodes is much smaller than the cell gap 
(Figure 6.30). There is little electric field directly 
above the central line of the data electrodes, but 
immediately either side of this there is an in-plane 
field component that causes the director to reorient. 
This arrangement greatly reduces the transmission 
loss caused by the banding and can be operated 
with electrodes that are closer together, therefore, 
allowing lower operating voltages to be used. The 
fringing fields break the symmetry sufficiently to 
allow operation without alignment pretilt, again 
helping to improve viewing angle toward the limit. 
This is difficult with conventional rubbed poly- 
mers but can be achieved using photoalignment. 
Typically, the dielectric layer is 500nm thick, 
providing much higher fields close to the elec- 
trode surface than typical for the IPS mode [131]. 
Together with the reduced electrode spacing, this 
lowers operating voltages, and hence power. The 
dielectric layer acts as the storage capacitor for the 
pixel, thereby increasing the aperture ratio and 
concomitantly increasing optical efficiency still 
further. Typically, FFS mode has double the optical 
efficiency of the IPS mode, with the only disadvan- 
tage being that two transparent electrodes need to 


be deposited and etched onto the rear plate dur- 
ing fabrication process. The success of FFS mode 
can be measured by its adoption for many portable 
products, notably the Apple iPad from 2011. 


6.4.7 Vertically aligned nematic 
modes 


As for IPS, the invention of VAN occurred early 
in the history of LCDs [85] but was reinvigorated 
with the widespread adoption of TFT and the need 
for wide-viewing angle technologies in the mid- 
1990s. Homeotropic alignment gives a near per- 
fect black state at normal incidence and a viewing 
angle easily compensated using a negative uniaxial 
retardation plate. Being dominated by the dark 
state, the contrast of VAN mode devices is gener- 
ally exceptional at all angles and is independent of 
temperature or cell gap variations. The director is 
switched into the plane of the cell when a negative 
Ae liquid crystal is used. The direction of tilt will 
be degenerate and form scattering domains unless 
some preferred orientation is imparted to the cell. 
For example, if one of the homeotropic surfaces is 
rubbed, it gives a pretilt of typically 89.8°, and the 
director will tilt uniformly in this direction with 
increasing field. However, this will give a poorer 
viewing angle for the white state, as indicated in 
Figure 6.8. To overcome this limitation, Fujitsu 
invented the multidomain vertically aligned mode 
in 1997 [94,95], termed MVA mode. The aim of the 
invention was to maintain the vertical cylindrical 
symmetry as the director reorients with applied 
voltage, using domains of opposing tilt. Typically, 
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each pixel is subdivided into two or four areas with 
orthogonal tilt directions for each. Different area 
ratios may be used for the horizontal and verti- 
cal directions, provided that the area for opposing 
pairs is equivalent. 

Initial attempts to produce multidomain align- 
ment used the rather impractical approach of dual 
rubbing, where a rubbed surface was protected 
during a second antiparallel rubbing by a pho- 
tolithographically defined mask that was subse- 
quently removed. The approach that Fujitsu took 
was to arrange dielectric protrusions onto the 
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electrodes and underneath the homeotropic align- 
ment layer, as shown in Figure 6.31a. Each protru- 
sion has a convex shape with side walls angled to 
the surface but is sufficiently small and rarefied 
to have negligible effect on the overall pixel align- 
ment. When the voltage is applied, the slight field 
fringing around the protrusion causes tilting in 
opposite directions on either side of the protru- 
sion, thereby breaking the symmetry and automat- 
ically causing domains of the opposite tilt sense. 
Early modules included protrusions on both inner 
surfaces, as shown in Figure 6.31, but including 
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Figure 6.31 Vertical aligned nematic modes. (a) Multidomain vertical aligned (MVA) mode, (b) pat- 
terned vertical aligned (PSA) mode, and (c) axially symmetric multicell (ASM) mode. 
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the protrusion on a single surface only saved addi- 
tional photolithographic steps and was found to 
give satisfactory performance. 

Samsung made a further advance for the VAN 
mode in 2000. Similar to the MVA mode, the pat- 
terned vertically aligned mode (PSA) replaced the 
protrusions with slits in the electrodes to create 
the fringing fields (Figure 6.31b) [98]. Sharp [99] 
designed the axially symmetric microcell (ASM) 
mode LCD, where polymer walls surround the 
nematic, forming a microcell container for the liq- 
uid crystal (Figure 6.31c). The walls are formed by 
photoinduced phase separation of a monomer dur- 
ing the polymerization that occurs on UV expo- 
sure of a grid like pattern. A chiral dopant is added 
to the nematic with the correct pitch to ensure a 
twisted orientation of 90° for the given cell gap. 
Unlike the MVA and PVA modes, disclinations 
are avoided altogether because the director forms a 
monodomain while retaining the axial symmetry 
that results in the exceptional viewing angle. 

These VAN modes share the properties of 
wide viewing angle, very high contrast, and fast 
operation. High switching speeds are common to 
each of the vertically aligned modes because the 
field-induced distortion is dominated by the bend 
elastic constant k,,, which is usually significantly 
higher than both k,, and k,, (Equation 6.48). The 
mode also has the advantage of not requiring 
the rubbing step during fabrication, which can 
give improved yield and costs. However, the mode 
is slower than modern IPS mode panels because 
full switching is needed to obtain the bright state, 
whereas IPS switches the director through a lower 
angle if pb, >> 0. 


6.5 LCDS FOR NONMAINSTREAM 
AND NICHE MARKETS 


6.5.1 Introduction 


The wealth and diversity of different device modes 
that exploit liquid crystals is a measure of both the 
variety of phases and behaviors exhibited by these 
organic materials and by the ingenuity of the scien- 
tists, engineers, inventors, and innovators working 
in this field over the past half century. The path to 
providing flat screen monitors and full high defini- 
tion television displays is littered with unsuccess- 
ful attempts. However, many of those technologies 
found, or indeed created, niche markets, offering 


unique selling points in other applications. A selec- 
tion of some of the more important LCDs used 
outside the conventional direct view displays are 
summarized in this section. 


6.5.2 Cholesteric LCDs 


6.5.2.1 TEMPERATURE, STRAIN, AND GAS 
SENSORS 


From the first discovery of liquid crystals by 
Reinitzer in 1888 [18], the bright colors of choles- 
teric liquid crystal textures have fascinated observ- 
ers. Indeed, the first liquid crystal application was 
not an electronically addressed display but rather 
as sensors that deployed this coloration. In the early 
1960s, prolific inventor James Fergason observed 
that the peak wavelength of cholesteric selective 
reflection depended strongly on both temperature 
and trace amounts of chemical vapor [132], envis- 
aging device applications. His work inspired many 
applications from detecting minute temperature 
changes on human skin due to the influence of 
nicotine or underlying tumor to visualization of 
faulty electronic circuitry. Photographer Henry 
Groskinsky, inspired by the vivid colors of the 
cholesteric, recorded these applications for Life 
Magazine in 1968 [133]. The article also mentioned 
that the liquid crystal could be encapsulated into 
a polymer, another of Fergason’s inventions that 
will be described in Section 6.5.5, that was used 
by National Cash Registers to detect the mechani- 
cal strain within a loaded spring, an application 
that was later extended to aeronautical testing of 
planes, rockets, and turbine blades [134]. Indeed, it 
was encapsulation that eventually led to the mass 
production of liquid crystal thermometers that 
remain popular today. 

The cholesteric has a natural helical structure 
shown in Figure 6.1. Light traveling along the 
helical axis undergoes Bragg reflection due to the 
repeating nature of the optic axis. This occurs for 
circular polarized light with the same handedness 
as the cholesteric pitch P and at a band of wave- 
lengths centered on A, given by 

hy = n-P-cos0, (6.103) 
where 0 is the angle of incidence with respect to 
the helical axis and n is the average refractive 
index =/, n, + '/,n,). Strong coloration occurs 
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when A, matches a visible wavelength, from 400 
to 700nm. The width of the reflection band Ad is 
approximately 

AX = An-P. (6.104) 
The color is, therefore, made more vivid by using 
weakly birefringent cholesteric materials. Light of 
wavelengths outside the band of selective reflec- 
tion is transmitted through the sample, as is all 
light of the opposite handedness. Optical con- 
trast then requires the liquid crystal layer to be 
mounted onto a dark backing material, to absorb 
the transmitted light. Tuning of the temperature 
range over which the pitch varies in the visible 
regime usually requires two or more compo- 
nents with different pitches and temperature 
dependences. The pitch diverges as the second- 
order cholesteric to smectic phase transition is 
approached, leading to rapidly changing color 
from red to blue with increasing temperature. 
Varying the concentration of components with 
strong smectic local ordering in the cholesteric 
mixture provides control over the operating tem- 
perature range for the thermometer. 


6.5.2.2 BISTABLE CHOLESTERICS 


Selective reflection of colored light from cholesteric 
liquid crystals can be used for electronic displays 
too. Moreover, the cholesteric electro-optic char- 
acteristic can be arranged to be bistable, allowing 
unlimited multiplexibilty using low-cost passive 
matrix addressing. Among the first optical switch- 
ing modes to be studied at RCA was the bistable 
cholesteric [135], the switching mechanism for 
which was elucidated during the 1970s [136,137]. 
However, the success of the bistable cholesteric is 
largely due to the team at Kent State University 
headed by Doane [138] and the many innovations 
made by the engineers at the spin-out company 
Kent Displays Inc. (KDI) [139,140]. 

The basic operation of the display is to switch the 
cholesteric liquid crystal between the Grandjean 
texture, where the axis of the helix is largely nor- 
mal to the display that, therefore, appears reflec- 
tive, and the focal conic texture, where the helix 
lies in the plane of the cell and light is forward scat- 
tered to be absorbed by the rear (black) substrate. 
Latching between the states is done via an interme- 
diate homeotropic state, where the field is unwound 
by a high electric field coupling to the positive Ae 


of the liquid crystal, as shown in Figure 6.32a. 
If the field is switched off immediately, the helix 
forms with its axis vertical to the pixel, thereby 
appearing colored due to selective reflection. If the 
field is reduced more gradually through an inter- 
mediate level, the helix forms in the plane of the 
cell, in the focal conic state, and the pixel appears 
dark. The degree of hysteresis and the sensitivity 
of the device to mechanically induced damage is 
controlled using polymer stabilization. 

Most displays are designed to be monochrome, 
using materials with high birefringence to give the 
broadest reflectivity. The highest practical levels of 
An give yellowish-green coloration against black, 
although some customers prefer the inverted 
optics of white and blue when a blue background is 
used. Together with the slow response speed, high 
voltage and sensitivity to shock, the poor appear- 
ance means that the devices are not commer- 
cially successful and have been largely superseded 
by other choices. However, bistable cholesterics 
remained of interest for full color bistable reflec- 
tive displays. Lower birefringent materials are used 
to give individual layers with sharp red, green, 
and blue reflection bands. Stacking three layers in 
series then allows full reflective color. This type of 
technology was successfully applied to large area 
signage by the company Magink, providing bright 
billboards for advertising purposes [141]. As a 
reflective display, the panels were ideal for bright 
sunlit conditions, where they could outperform 
LED electronic signage. KDI also used a triple 
stack to produce full color reflective displays for 
portable products. Parallax between the layers was 
minimized through polymer stabilization of the 
individual active layers mounted directly onto a 
backing foil [140], as shown in Figure 6.32b. Not 
only did this enable good optical performance, but 
also the resulting display was very flexible indeed, 
as is apparent from the demonstrator shown in 
Figure 6.32c. Although not successful commer- 
cially, this also remains true of all other reflective 
color display modes and remains an important gap 
for future developments. 

The technical advances made by KDI for flexible 
color plastic displays promised new applications 
outside the display field, such as electronic skins, 
Figure 6.32d, and electronic writing tablets (Figure 
6.32e). The Boogie board isan electronic writing pad 
that uses mechanical pressure to induce the reflec- 
tive Grandjean texture, on a black background. 
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Figure 6.32 Bistable cholesterics. (a) Operating principle, (b) trilayer stack for full color, (c) proto- 
type of the film backed triple N* stack, (d) switchable electronic skin using a single layer choles- 
teric, (e) the successful Boogie Board product from KDI, using bistable cholesteric to form a writing 
tablet. 


When the image needs to be refreshed, an electri- 65,3 Bistable nematic LCDs and ZBD 
cal blanking pulse is applied to erase the page. This. |= _ 
product continues to be successful in a niche mar- _Bistable operation of an LCD allows many lines 
ket that the technology has created. of information to be passive matrix addressed, 
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where each frame is written line by line and each 
line retains its information until it receives the next 
addressing signal. Before the large-scale adoption of 
TFT back planes and active matrix addressing, var- 
ious bistable nematic modes were invented, com- 
plementing the contemporary efforts to develop 
bistable cholesteric and FLCD. Few of these modes 
made it even to demonstrator stage. However, in 
the late 1990s, there was a resurgence of interest in 
bistable nematics [126], notably the work at Seiko- 
Epson on the bistable twisted nematic (BTN) mode, 
the 180° BTN or Binem developed by Nemoptic and 
the ZBD by ZBD Displays Ltd. (now Displaydata). 
Several factors drove this renaissance: 


e Simple passive matrix displays based on TN 
and STN did not lose market share with the 
introduction of TFT; rather, the markets that 
they had created, such as watches, calculators, 
instrumentation, etc., continued to need low 
cost, usually reflective displays. Whereas new, 
high investment production plants in Japan, 
Korea, and Taiwan developed TFT LCDs for 
high end applications, a plethora of manu- 
facturers in China continued to serve what 
remained a $1B passive matrix market well into 
the 2000s. 

e Bistability promised to create new niche LCD 
markets, particularly where ultralow power 
was required. Markets included electronic 
shelf-edge labels with high information 
content, electronic book readers, smart card 
displays, secondary displays for mobile phones, 
and an enormous range of indicator displays, 
from car key fobs to razor blades. Indeed, the 
image storage offered by bistability continues 
to be a unique selling point for many applica- 
tions associated with the Internet-of-things, 
where displays can be updated automatically 
using radio frequency (RF) signals, rather than 
rely on costly batteries and associated circuitry. 

e Bistable displays often have good potential for 
plastic displays. TFT on plastic remained a 
challenge to fabricate through the 2000s, due 
to the difficulties of registering high-resolution 
patterns on a flexible backplane, and the low 
fabrication temperatures required for plas- 
tic substrates. Bistable nematics offered the 
possibility of high image content based on 
simple, low temperature, low cost fabrication 
on plastic. 


However, these drivers were also attractive to 
other non-LCD modes, such as Janus colloids 
from Gyricon, interference mode micro-elec- 
tro-mechanical-systems (MEMS) from Iridigm/ 
Qualcomm, electrowetting from Liqua-vista/ 
Amazon and Gammadynamics, and electrophoret- 
ics from Bridgestone and E-ink. In 2005, Amazon 
selected E-ink displays for its Kindle e-Book read- 
ers, based largely on the appearance of its white 
state and despite the significantly higher cost due 
to the active matrix. This created demand for scat- 
tering mode devices and many new markets chose 
electrophoretics over LCDs. One bistable LCD 
technology that survived was the ZBD, marketed by 
ZBD Displays Ltd., now Displaydata. This spinout 
from R.S.R.E (then DERA) in the UK targeted the 
retail signage market, and in particular, electronic 
shelf-edge labeling. For a retailer to replace tens of 
thousands of electronic labels per store, cost was 
paramount, and the bistable LCD has a significant 
advantage over electrophoretics, whereas offering 
superior performance and higher image content 
than the incumbent directly addressed segmented 
TN LCD. The company’s success was not only 
due to the bistable LCD but also by development 
of a novel RF communications protocol [125] that 
allowed small to midsector retailers to take advan- 
tage of the labeling, whereas only hyperstores had 
been able to afford the infrastructure required for 
previous labeling systems. The company has sold 
several million labels worldwide and now offers a 
combination of the ZBD LCDs alongside E-ink for 
higher-end application. 

Bistability results where a device has two stable 
states with similar free energies that are separated 
by an energy barrier, wherein transitions from one 
state to the other are discontinuous, or first order. 
An early approach was the BTN mode [142], which 
followed similar principles to that of the STN, but 
set the d/P ratio of the chiral nematic to lie half- 
way between states of low and high twist angles ®. 
For example, setting d/P = 0.5 with parallel surface 
alignment should give a x twist state. However, if 
the pretilt on both surfaces is sufficiently high, the 
cost of the induced splay energy becomes greater 
than that for twist. Thus, the chiral nematic may 
either unwind to a uniform Oz state to match the 
surface condition or may wind further to form the 
® = 2n twist state (Figure 6.33a). Switching from 
one state to the other then relies on whether or not 
flow is induced immediately after a high electrical 
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Figure 6.33 Operation of bistable twisted nematic. 


pulse coupling to a positive Ae. If the pulse returns 
to OV via an intermediate voltage, there is little 
induced flow and the Om state is formed, whereas 
a direct transition to 0 V induces flow that encour- 
ages director twist at the cell center and the ® = 21 
state is formed. The two states are metastable, so 
the texture relaxes back to the intermediate 1-state 
after a second or two on removal of power. This 
means that the device was not suited to zero power 
applications. Rather, Seiko Epson used it as the dis- 
play for Hi-Fi Graphic Equalizer displays due to its 
very fast optical response [143]. 

A similar approach was taken by Nemoptic, 
who also used a BTN configuration, but with d/P = 
0.25 to give either 0 or x twist states (Figure 6.33b). 
In this instance, the director cannot change its 
twist from one state to the other without break- 
ing the anchoring at one of the surfaces, requir- 
ing that one of the surfaces be deliberately weakly 
anchored [144]. Switching of the device again uti- 
lized back flow depending on the trailing shape of 
the addressing pulses. Marketing the device under 
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the trade name Binem, Nemoptic produced various 
demonstrators [145], including full color reflective 
displays, TFT-driven panels, and, intriguingly, a 
pixelated switchable quarter-wave plate mounted 
onto the front of an OLED display to switch 
between high-power emissive video frame rate dis- 
play and ultralow power black and white E-reader 
mode [146]. Despite the excellent optical appear- 
ance, 0-2 BTN required cell gaps below 21m and 
suffered from manufacturing tolerances that were 
difficult to achieve. Perhaps the biggest cause of 
the company’s eventual demise in 2010 was that it 
failed to find the correct niche for its product. 

The ZBD is rather unusual in several aspects 
[147,148]. It uses a grating as a surface alignment 
layer, designed to impart bistable pretilts of the 
contacting nematic regardless of the overall geom- 
etry chosen (Figure 6.34). This allows many differ- 
ent LCD designs to be used, including VAN-HAN 
mode [147], HAN-TN mode [147,149], multistable 
VAN-HAN-IN modes [150], gray-scale displays 
[151], single polarizer mixed TN modes [152], 
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Figure 6.34 The zenithal bistable display mode, ZBD. (a) Latching from the C to D state occurs with 

a bipolar pulse with the trailing part positive with respect to the grating. The applied field nucleates 
defects at the bifurcation point on the grating sidewall, and the positive polarity favors the D state 
with —'/, disclinations at the convex surface and +'/, defects at the concave surface. The resulting low 
surface tilt creates a 90° TN state when placed opposite a conventional monostable rubbed polymer 
surface as shown. Operating in the NW mode, with the grating aligned parallel to the front polar- 
izer and with n matched to the liquid crystal ordinary refractive index gives an excellent white state. 
(b) Applying pulses of the opposite polarity causes the defects to retrace their path along the grating 
surface until they annihilate. The resulting C state is continuous and homeotropic, thereby creating 
HAN alignment. This appears black when between the crossed polarizers as shown, and the viewing 


angle self-compensates when in reflective mode. 


scattering modes [153], and ultrafast Pi cell modes 
[150]. The device is an early example, and the only 
LCD currently on sale, that utilizes the flexoelec- 
tric effect for latching between the two states. 
Moreover, the device deliberately uses disclina- 
tions at the deep, homeotropic-grating surface to 
stabilize the low-tilt or D state. The potential bar- 
rier between this state and the high-tilt defect free 
or continuous state (C state) is mediated by the cre- 
ation and annihilation of these surface defects. The 
voltage for latching from one state to the other Vop 
is related to the pulse width t by Reference [154]: 


Vop 2 Vs $ 2Wo 
d (e:+e3)T (e: +e; ) + JeoAeKs; 
I, 
Jo ia, (6.105) 
(e, +e3)T 


where W, is the zenithal anchoring energy of the 
homeotropic grating surface, I, is the coefficient of 
slip for defects moving across that surface, e, + e; 
is the sum of the splay and bend flexoelectric coef- 
ficients for the liquid crystal material, and the 
dielectric effect of the grating has been ignored. 
Controlling the anchoring energy in the range 0.2 
mJ-cm~ to 1 mJ-cm~ allows the threshold voltage 
V,, to be adjusted to typically about 1 V pm", typi- 
cally 7 V. 

The device is usually configured with the grat- 
ing opposite a standard parallel aligned rubbed 
polymer surface to create a 90° TN when in the 
low-tilt D state. This state is always formed first 
on cooling from the isotropic to nematic phase 
because the defects are stabilized at the surface 
when the S order parameter is low. Thus, the inter- 
pixel gaps remain in the TN state and good display 
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reflectivity results when operating in the NW TN 
mode. Typical cell gaps are 71m, operating at 
15-20V using standard STN driver electronics, 
LC mixtures, and polarizers. Operating tempera- 
tures from —25°C to 40°C and-—5°C to +60°C were 
achieved using low and high anchoring energies, 
respectively. 

The main technical challenge for ZBD Displays 
Ltd. was to introduce a low-cost and reproducible 
manufacturing method for a 0.8p1m pitch, 1m 
high blazed sinusoidal grating into a standard pas- 
sive matrix LCD production line using Gen 2 glass. 
This was done by copying a photolithographically 
defined master grating into a lacquer on polyethyl- 
ene terephthalate (PET) film via a nickel sputtering 
and electroforming replication technique [148,155]. 
The film was shipped from the UK to manufactur- 
ers in China, where it is used to emboss the grating 
into a homeotropic photopolymer deposited on the 
glass surface, thereby replacing the conventional 
rubbing step for that plate. This method allows the 
technology to meet the same price point as con- 
ventional STN, where the costs of the compensa- 
tion foils required for the STN offset that of the 
ZBD grating film. Having achieved such low price 
points for its chosen niche market, it remains to be 
seen if ZBD can replace conventional TN and STN 
displays in other markets. 


6.5.4 Polymer dispersed liquid 
crystals 


A weakness of LCDs that is particularly evident in 
reflective mode devices is the constraint for polar- 
izers, which absorbs more than half of the avail- 
able light. This inefficiency was avoided in the first 
dynamic scattering displays but the contrast and 
lifetime of those devices were far inferior to retar- 
dation-based LCDs such as the TN that their period 
of success was very short lived. Hilsum [156] pro- 
duced a scattering device by mixing glass micro- 
spheres into a nematic, creating a scattering texture 
that could be switched to a nonscattering state by 
an electric field. A more practical device was the 
polymer-dispersed liquid crystal (PDLC). This is 
another example of an LCD that initially aimed to 
produce bright displays but which found success 
when the technology was applied to a novel prod- 
uct with a niche market; for PDLC this was privacy 
glass. 


Following his successes with cholesteric sensors 
and the invention of the TN LCD soon after Schadt 
and Helfrich (claiming precedence in the United 
States), Fergason invented a method of encapsulat- 
ing liquid crystal droplets into a polymer matrix 
in the early 1980s [157]. Originally called NCAP 
by its inventor, PDLC usually takes the form of a 
plastic layer that can switch between scattering 
and nonscattering states with an applied electric 
field [158,159]. An ITO-coated glass or transpar- 
ent plastic cell is filled with a nematic liquid crystal 
mixed with a monomer. Curing of the monomer 
into a solid polymer form is done by evaporat- 
ing the monomer solvent, by applying a thermal 
treatment, or, most commonly, through the ini- 
tiated polymerization of constituent photoreac- 
tive groups. Phase separation of the liquid crystal 
occurs as the polymer forms from the monomer, 
creating droplets within the polymer matrix. 
Often, a surfactant is included both to help control 
the dispersion of droplet size and align the nematic 
into a radially symmetric or bipolar state, as shown 
in Figure 6.35. Droplets in the 0.5-5y1m range scat- 
ter incident light intensely due to the refractive 
index mismatch between the polymer and the liq- 
uid crystal. Application of an electric field across 
the sample then causes reorientation of the liquid 
crystal director, leaving the cross-linked poly- 
mer undisturbed. Usually, the liquid crystal has 
a positive Ae and an ordinary refractive index n, 
matched to that of the polymer. Thus, the scatter- 
ing is reduced as the liquid crystal aligns parallel to 
the field and the light incident close to the normal 
direction is transmitted unchanged. 

Droplets with radial alignment have a single 
defect at the droplet centroid (called a “Boojum”) 
leading to spherical symmetry, whereas bipolar 
droplets have point defects on opposing surfaces 
and cylindrical symmetry of the director field. 
Typically, droplets are between 1 and 10m and 
form the bipolar structure with randomly oriented 
symmetry axes. The electrical field behavior is com- 
plex. The field required to align the director paral- 
lel to the field, and hence approach the minimum 
deflection of incident light, is inversely proportional 
to the droplet radius, a [160]. For a radial droplet, 
the critical field is approximately related to [161] 


put |k (6.106) 
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Figure 6.35 Polymer dispersed liquid crystal for a positive Ae operating in the scattering to nonscat- 
tering mode. (a) Droplets of different sizes with random orientations of the director profile, form- 

ing either a radial or bipolar configuration. Light is strongly scattered, including back scattering. 

(b) Application of the field initially causes little distortion of the director profile within each droplet, 
but the profile adjusts to align the disclinations within the plane of the film and the director with a 
net direction parallel to the field. The scattering reduces somewhat. (c) Increasing the field causes 
distortion of the director field within each droplet toward the vertical orientation, where the refrac- 
tive index of the droplet and polymer matrix approaches the matched condition, and little scattering 
occurs. (d) Conventional back-scattering mode of PDLC; (e) novel, low-voltage mode using specu- 
lar reflection as the black state; and (f) common mode for privacy windows, where the plastic film 


switches between translucent and transparent. 


where k is a mean elastic constant. The symme- 
try of a perfectly spherical bipolar droplet prevents 
analytical solutions being made and needs the 
symmetry to be broken. A practical approach is to 


assume that each droplet is slightly elongated with 
different semimajor axis a and semiminor axis b. 
For liquid crystals with positive Ae, weak applied 
fields reorient the symmetry axis parallel to the 
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direction of the field. The critical field is then given 
by [161] 


k3(a? —b? 
Ec = 1 Otc 42 33( ) , 
3ab\ Op € Ae 


where 6, and 6, are the low frequency conductivi- 
ties of the liquid crystal and polymer, respectively. 
The response times are given by 


(6.107) 


\ 
e)AeE? + Kes (a’-b’) 
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Typically, droplets of a= 1m, b = 0.9m, k,; = 10 
pN, y, = 0.05kg m's~ and Ae = 15, and o,/6, = 28 
give E,=1.5V pm! and Topp = 20 ms. Substantial 
scattering needs 20 or so droplets in the direction 
of incidence, so film thicknesses of 20 um are typi- 
cally used. Thus, the voltage of such PDLC films 
can be high, typically much higher than 30V to 
achieve saturation of the transparent, nonscatter- 
ing state. 

If the droplets are sufficiently small, and the 
device spacing high, then the OFF state can lead 
to significant back scattering. This can be used in 
a reflective mode display by mounting the PDLC 
film onto a black background to provide optical 
contrast (Figure 6.35d). However, the switching 
voltage inherently increases both with smaller 
droplets and higher device spacing. Moreover, 
the shallow switching response necessitates the 
use of TFT to matrix address the pixels, limiting 
the switching voltage to 15 V and preventing any 
useful degree of optical effect. Recently, Sharp has 
produced low-voltage TFT PDLC displays where 
the layer is mounted instead onto a specular mir- 
ror [162]. Relatively large droplets keep the voltages 
sufficiently low for TFT addressing (=6 V) and the 
forward scattered light is scattered more strongly 
after being reflected back toward the observer a 
second time by the mirror. In the ON state, the 
forward scattering is removed and the viewer sees 
the specular reflected light. In applications, where 
the display is illuminated by a point source, this 
will appear black in all directions except where the 


display is oriented directly between the viewer and 
light source (Figure 6.35e). 

Although never successful for mainstream dis- 
plays, one market where the PDLC has founda niche 
is for privacy windows. The PDLC can be made as a 
laminated plastic roll, which can be applied adhe- 
sively to existing infrastructure, and trimmed to 
the appropriate size and shape. The window can 
then switch between a translucent, “milky white” 
appearance in the OFF state, to being clear and 
fully transparent when ON. Although this adds 
costs to the window, it is increasingly popular in 
high-end architectural projects for both interior 
and exterior applications. Uses include privacy 
control of conference rooms, intensive care areas, 
or bathroom doors, for example, to providing 
architectural design features. Moreover, the mate- 
rial can also be adjusted to act as an environmental 
smart window, switching between heat retention 
and loss as required [163]. Other applications such 
as temporary projection screens, active camouflage 
and switchable diffusers are all possible [164]. 

Another potentially profitable application for 
PDLC is switchable holographic media [165]. If a 
coherent image is used to cure the polymer dur- 
ing the fabrication process, switchable diffractive 
optical elements can be fabricated with drop- 
lets ranging from 100nm to 1pm forming holo- 
graphic gratings from the interference pattern of 
the illumination. Rather than cause scattering, the 
droplets locally modulate the refractive index to 
generate a diffraction grating. Holographic optical 
elements (HOE) are used for a variety of optoelec- 
tronic applications, including focusing, beam- 
steering, filtering, and optical multiplexing. The 
use of holographic PDLC provides the means for 
switchable HOE, allowing dynamic beam steering, 
tunable filtering, and optical signal processing. For 
holographic PDLC, the droplet size a is typically 
submicron and the OFF time is very fast: typi- 
cally 50-500 1s. However, E, also increases with 
decreasing droplet size and so typically 100-200 V 
is required. This high field dominates the ON time, 
which again is fast, typically 50 us. 


6.5.5 Liquid crystal on silicon 


Rather than apply a semiconductor onto glass to 
provide transistors, LCOS places the liquid crystal 
element directly onto a CMOS integrated circuit 
[166,167]. Of course, as the silicon is opaque, the 
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devices must operate in reflection; usually, silver 
electrodes patterned directly onto a CMOS inte- 
grated circuit to provide both the pixelated electric 
field pattern and the highly reflective rear substrate. 
The chip not only provides the drive signals for the 
contacting liquid crystal but can also deliver extra 
functionality, such as gray-scale gamma correc- 
tion, temperature compensation, edge detection. 
The devices are typically only 1-2cm in diagonal 
and are used for projectors and camera viewfind- 
ers. Keeping such low dimensions enables the 
ultrahigh resolution of the silicon circuitry indus- 
try to be applied, with typical pixel pitches for an 
extended graphics array, 1024 x 768 projection dis- 
play being below 3 um. 

LCOS spatial light modulators are the active ele- 
ment in light projectors for conferences and home 
cinema systems. This is one of the largest niche 
markets for LCDs, despite strong competition from 
MEMS projection displays (digital light projectors). 
Recently, there has been great interest in near-eye dis- 
plays and pico-projectors, such as those used in vir- 
tual and augmented reality headsets, and for Google 
glass. Such displays tend to use a small polysilicon 
transmissive TN LCD, rather than reflective LCOS. 

The projector systems either use a single LCOS 
panel or use three panels tuned for red, green, and 
blue wavelengths and the final image combined 
using a prism system. A 45° single polarize TN can 
be used but VAN mode provides the high contrast 
needed for home cinema, as utilized by the top 
of the range projectors from Sony and JVC [167]. 
Considerations such as viewing angle and color 
balance become inconsequential in such projec- 
tion systems, where speed and contrast are the 
dominant factors and performance can be opti- 
mized for individual color bands. 

A different approach to color uses a ferroelec- 
tric LCOS. Rather than using three separate mod- 
ulators, the fast response time of theFLC is used 
to provide operation at 360 Hz, thereby allowing 
frame sequential color when illuminated by 120 Hz 
alternating color band illumination [168,169]. This 
is used for helmet mounted virtual reality (VR), 
binocular displays, and viewfinders by Forth 
Dimension Displays and Cinoptics. 

LCOS has also proven successful for nondisplay 
applications, particularly for spatial light modula- 
tors in optoelectronic systems such as wavelength 
selective switching, structured illumination, and 
optical pulse shaping [170]. 


A related technology to LCOS is the optically 
addressed spatial light modulator [171]; rather 
than electrically addressing each pixel, the device 
is addressed using incident light. The LCD includes 
unpixelated ITO electrodes sandwiching a pho- 
toconducting semiconductor and liquid crystal 
layers. In the dark state, the dielectric permittiv- 
ity of the photoconductor is low, and most of the 
electric field applied across the ITO electrodes is 
dropped across this layer, leaving the liquid crys- 
tal unswitched. However, charge is transferred to 
the liquid crystal interface where light is incident 
on the photoconductor, which switches the liquid 
crystal accordingly. Early devices used GaAs pho- 
toconductor and a nematic. Recently [172,173], an 
a-Si Optically addressed spatial light modulator 
with an FLC modulation layer has been used to 
produce dynamic computer generated hologram 
with over 10° pixels. The speed and the bistability 
of the FLC allowed the image to be built success- 
fully in a projection system using a series of electri- 
cally addressed nematic SLM. The resulting system 
remains arguably the most complex display to be 
produced, and the nearest to providing full 3D 
holographic dynamic images, which remains the 
ultimate display goal. 


6.6 LCD MODES FOR POTENTIAL 
FUTURE APPLICATIONS 


6.6.1 Introduction 


The variety of different LCDs is huge and only a 
handful has made it to commercial success, whether 
mainstream or niche. Some of these modes are 
important to describe because they include prin- 
ciples yet to be utilized (e.g., the Pi-cell, V-shaped 
switching FLC, and antiferroelectric liquid crys- 
tals (AFLCs); others are recent modes that are yet 
to find a market (such as the blue phase or flexo- 
electric cholesteric modes). It was suggested earlier 
that the principal motivation for LCD develop- 
ments has evolved from increasing image com- 
plexity to achieving the widest viewing angle, high 
resolution, and recently color depth. Alongside 
these developments has been a constant need for 
increasingly fast LCD switching. Further speed 
improvement continues to be important for gam- 
ing, VR, augmented reality (AR), and ultimately 
for glasses-free 3D displays with eye tracking for 
multiple viewers and frame sequential color. 
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Moreover, nondisplay applications of liquid 
crystals continue to grow in importance and vari- 
ety. A good example of this is the advent of liquid 
crystal lasers that have more far-reaching possibili- 
ties for optoelectronics. Brief appraisals of each of 
these modes are included in this section. 


6.6.2 Pi-cell 


Nematic liquid crystals are simple to align, usu- 
ally maintain alignment quality after receiving 
mechanical or thermal shock, and are well under- 
stood and characterized. The down side is often a 
slow response speed, particularly at low operating 
temperatures where the Arrhenius form of the vis- 
cosity dominates (Equation 6.14). An early exam- 
ple of a fast nematic mode is the pi-cell [174], where 
ON and OFF times approaching 1 ms are achieved. 

The pi-cell is the most well-known example 
of a surface mode liquid crystal [175]. It relies on 
the parallel alignment of high pretilt surfaces, 
between which is a positive Ae nematic. For low 
surface pretilts, the lowest energy quiescent state 
is predominantly splayed from one surface to the 
other. For high pretilts, a bend state of the director 
becomes favored energetically. This occurs above 
the pretilt given by [176] 


40,-—1 _ k33-—ky, 


ee (6.109) 
sin 20, k33+ky, 


which predicts that the splay and bend states are 
energetically equivalent states for 40° < 0, < 50° 
for typical calamitic nematics (0.5 < k,,/k,, < 2). 
Pretilts of this magnitude have been hard to 
achieve historically, with 0, ~ 25° typically being 
the maximum. With pretilts of this magnitude, 
the splay state is energetically favored but the bend 
state is metastable and can exist alongside the 
splayed state separated by a x disclination. With 
an applied voltage coupling to the positive Ae, 
the central director of the splayed state can tilt in 
either a clockwise or anticlockwise direction, con- 
centrating the elastic distortion close to the top or 
bottom surface (Figure 6.36b). If a sufficiently high 
field is applied, the elastic distortion may become 
sufficiently high to break the anchoring at one of 
the surfaces and allow a first-order alignment tran- 
sition to the bend or x state [177]. On removal of 
the field, the director relaxes back to the metastable 


state where the director remains vertical in the cell 
center. Whereas in this state, the director field 
close to the surfaces can be affected by the appli- 
cation of a lower field in a continuous fashion, 
thereby modulating the retardation and causing 
optical contrast between ON and OFF states, with 
a fast response. 

To understand why the pi-cell gives this fast 
response, the effect of viscous back-flow needs to 
be considered. Figure 6.36a shows the situation 
for a uniform director produced by antiparal- 
lel surface alignment. When the field is removed, 
the director is subject to both elastic and viscous 
restoring forces. Lateral flow of the material is set 
up in opposite directions in either half of the cell, 
acting to kick the director in one-half in the oppo- 
site direction to that of the other, thereby slowing 
the relaxation to the final state. This backflow, or 
“optical bounce” effect slows the response for ECB 
and TN devices significantly [68]. However, for 
the z-state, the final tilt remains vertical in the 
cell center and the flow acts in both halves in the 
same direction as the relaxing director, thereby 
hastening the relaxation process. Thus, viscous 
flow supports switching so that the device is very 
fast—typically switching in about 2-5 ms for both 
ON and OFF switching. 

The device is oriented with the alignment direc- 
tions at 45° to crossed polarizers, and the cell 
gap set to give the half-wave plate condition. This 
occurs for a higher cell spacing for the bend state 
due to the higher overall tilt of the director. The 
viewing angle of the bend state is better than the 
splayed state because of the symmetry of the direc- 
tor arrangement. That is, the change in retardation 
for off-axis light is increased in one-half of the cell 
and decreased in the other half, so that there is 
little retardation change. This self-compensation 
effect leads to the alternative term for the Pi-cell 
as optically compensated bend (OCB) mode [178]. 
Further improvements to the viewing properties 
can be achieved using film compensators [179]. 

The device is yet to be used commercially. It 
cannot achieve the high contrast ratio enjoyed 
by the IPS and VAN modes, due to the remnant 
surface retardation; even with optical compensa- 
tion, dark state leakage is too high to meet the high 
contrast ratio requirement of modern TV displays. 
Moreover, the bend state is metastable, and usually 
decays back to the splayed state after removal of the 
field. Unless a constant bias voltage is maintained, 
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Figure 6.36 The effect of backflow and the Pi cell. (a) With antiparallel alignment, there is flow align- 
ment in opposing directions on removal of the field, hindering the relaxation back to the OV uniform 
state. (b) Parallel alignment initially gives a splayed state, but application of sufficiently high field to 
break the surface anchoring allows a bend, or p-state to be formed. When the field is removed, relax- 
ation to the unswitched bend state is supported by the flow throughout the cell. However, the bend 
state is metastable, so the splay state reforms after sufficient time, usually nucleated from the pixel 


edges or LCD spacers. 


domains of the unwanted splay state spread from 
unswitched areas such as the interpixel gaps and 
close to spacers. However, the principles deployed 
in this mode remain of interest. For example, a 
transflective display has recently been suggested 
that uses subpixellation for the bend state operat- 
ing in reflection and the splay state operating with 
in-plane electrodes in transmissive mode [180]. 


6.6.3 Analog ferroelectric liquid 
crystal modes 


The SSFLC mode described in Section 6.4.5 is 
bistable: either the black or white states is retained 
after the addressing pulse. Grey scale is provided 
using spatial or temporal dither [125]. Bistability 
was integral to the original interest in FLCDs 
because it provided an alternative to TFT. However, 
once TFT had become sufficiently low cost, interest 
was retained in utilizing the fast optical speed of 
FLCs but using an analog response combined with 
the active matrix. 

Various analog FLC modes were studied as out- 
lined below. Each of the modes described suffers 


from the need to achieve and maintain uniform 
smectic layer structures, which has prevented dis- 
play applications to date. However, the high speeds 
that are possible mean that these modes may yet 
find utility in nondisplay and optoelectronic 
applications. 


6.6.3.1 N*-SMC* 


Conventional SSFLC uses an N*-SmA-SmC* to 
give a chevron layer arrangement and the implicit 
bistability that brings. If the FLC has a phase 
sequence where the SmC* cools directly from the 
unwound chiral nematic, the director remains in 
the rubbing direction at the two surfaces and the 
layers form in a uniform layer bookshelf geometry 
but with the layers angled at the cone angle 0, to 
the rubbing [181]. Cooling with an applied DC field 
ensures that a single sign of orientation is achieved. 
Thus, the device is monostable, with an analog 
response as the director switches about the cone in 
the bulk of the cell with one polarity of field, and 
no-response with the opposite polarity. This “half 
V-shaped” switching can be addressed by TFT to 
give a fast, analog response. 
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A modification is the twisted FLCD [182], which 
combines a material with a first-order N*-SmC* 
transition with rubbing directions crossed to each 
other. The unwound N* forms a conventional 
TN but, as the twist elastic constant diverges on 
approaching the SmC phase, the smectic layers 
form uniformly at 45° to the rubbing directions. 
The director still forms a 90° twist, but moves 
about the SmC* cone from one surface to the other 
with the applied DC field, leading to a “V-shaped” 
switching response. This mode works best with a 
strongly first-order transition, wherein the SmC* 
cone angle is independent of temperature and is 
typically close to the optimum 45°; this allows 
the director to lie parallel to the crossed rubbing 
directions. 


6.6.3.2 ANTIFERROELECTRIC LIQUID 
CRYSTALS 


Certain compounds that form chiral tilted smectic 
phases exhibit higher ordered phases, where there 
is correlation of the director orientation between 
adjacent layers. With AFLC, the SmC* c-director 
and polarization directions alternate by z from one 
layer to the next. Application of a DC field switches 
each alternate layer parallel to the field, thereby 
forming a V-shaped switched response [183]. 


6.6.3.3 DEFORMED HELIX MODE 
FERROELECTRIC LIQUID CRYSTAL 


Although formed from chiral liquid crystals, most 
FLC modes use sufficiently low spaced devices 
to unwind the pitch. This is not the case for the 
deformed helix mode, where the pitch is made 
sufficiently low that even devices at the quarter- 
wave plate condition wind continuously in a heli- 
cal manner. Application of the field then causes 
distortion of the sinusoidal director variation and 
a shifting of the optic axis from that of the helix 
toward +0, [184]. 


6.6.3.4 THE ELECTROCLINIC EFFECT 


SmA phases formed from chiral molecules 
undergo a field-induced director tilt with an 
applied DC field, forming a structure similar to 
the SmC* phase [185]. The strength of this electro- 
clinic effect is greatest immediately above an SmA 
to SmC* transition, where the induced tilt is linear 
with the applied field. As with the AFLC and DH 
FLC modes, the switching is within the cell plane, 
and hence it is not only fast but has good viewing 


angle. However, the limited temperature range and 
temperature dependence have so far discounted 
electroclinics from application. 


6.6.4 Blue phase TFT displays 


Attendees at the 2008 exhibition that occurs each 
year alongside the Society of Information Display 
annual conference were surprised to see an unher- 
alded novel LCD mode based on the blue phase at 
the Samsung stand. The company demonstrated a 
15in. TFT monitor operating at a ground break- 
ing 240-Hz frame rate. Liquid crystals were known 
for their slow response and most attendees were 
unaware of the blue phase. Those that were most 
likely believed that the blue phase would never 
receive such serious interest from a manufacturer 
due to the notoriously narrow temperature ranges 
exhibited by these phases. 

Blue phases are a subset of cholesteric liquid 
crystals, which occur when the cholesteric natu- 
ral helicity is strong and the pitch is very short, 
close to the transition to the isotropic phase [186]. 
Rather than spontaneously twist along a single 
axis perpendicular to the local director, the direc- 
tor twists along two mutually orthogonal axes to 
form a double helix cylindrical structure, as shown 
in Figure 6.37a. The director at the center of each 
cylinder lies parallel to the cylinder axis and the 
diameter of the cylinder is P/4 so that the director 
twists through 45° from one side to the other. This 
means that the director remains continuous across 
adjacent cylinders oriented with their axes orthog- 
onal to each other, as shown in Figure 6.37b. These 
double twist cylinders pack into a cubic array as 
shown in Figure 6.37c, mediated by disclination 
lines throughout the structure that occur at the 
interstices of the cylinders. Of the three possible 
blue phases, the simple cubic blue phase I, Figure 
6.37f, and body-centered cubic blue phase I, 
Figure 6.37d have the widest temperature ranges. 
The occurrence of the disclinations means that 
the cubic structure is only stable where the pitch 
and order parameter are very low, typically for 
about 1°C-2°C below the isotropic to cholesteric 
phase transition. Optically, the structure appears 
an iridescent color due to Bragg reflection from 
the defects arranged on the regular cubic lattice, 
Figure 6.37e; hence, the phase became known as 
the “blue” phase, although other colors are possi- 
ble depending on the lattice. Otherwise, the phase 
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Figure 6.37 Blue phase liquid crystal. (a) The double helix structure; (b) the intersection of three 
double twist cylinders and the direction of the —1/2 disclination that occurs close to the cylinder inter- 
sections; (c) cubic packing of the double twist cylinders; (d) packing of the cubic blue phase II, with 
associated defect lattice; (e) photomicrograph of the transition between blue phase | and II; (f) body 
centered packing of the cubic blue phase | and the associated defect lattice. 


is optically isotropic due to the cubic symmetry 
of the phase, and it appears dark between crossed 
polarizers. Blue phases exhibit a Kerr effect with an 
applied field due to deformation of the local cho- 
lesteric structure coupling to the dielectric anisot- 
ropy of the liquid crystal [187,188]. This induces 
a net birefringence An, with the optic axis in the 
direction of the applied field [189]: 

An; =.KE’ ~—n°E’, (6.110) 
where n is the isotropic refractive index of the blue 
phase and the Kerr constant K is also wavelength 
dependent. 

Although the Kerr effect provided means for 
electro-optic modulation, the extremely lim- 
ited temperature range of the blue phase seemed 
severely prohibitive to application. This was par- 
ticularly true for displays, where operation from 
—20°C to +70°C is usually the minimum specifica- 
tion. In 2002, Kikuchi et al. succeeded in greatly 
extending the temperature range by photopoly- 
merizing a monomer added to the liquid crystal 


at temperatures where the material formed the 
blue phase [188]. The resulting polymer formed 
at the defect sites stabilized the lattice structure 
over a wide temperature range, without affecting 
the electro-optic properties of the liquid crystal 
significantly. 

The Samsung display used in-plane electrodes 
to switch the polymer stabilized blue phase from 
an optically isotropic structure to a birefringent 
medium with the optic axis approximately in the 
plane of the cell (Figure 6.38). The device does not 
require alignment layers and gives excellent high 
contrast and wide viewing angle due to the isotro- 
pic nature of the dark state and IPS effect. The most 
important issue to resolve was achieving suitably 
low operating voltages. This was done using shaped 
protrusions as the electrodes [190], with 2 um wide 
electrodes and 2-4t1m electrode gaps. Modeling 
showed that using angled electrode walls produced 
higher transmissivity by reducing the angle of the 
optical axis at the electrode edges. Using these 
structures, operating voltages below the 15 V target 
for TFT addressing were achieved. Indeed, since 
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Figure 6.38 Principal of operation for in-plane switching mode blue phase LCD. The applied field 
induces optical birefringence (shown by the ellipsoid shape of the refractive index indicatrix). Good 
performance is achieved using shaped protrusions allowing 10V operation to be achieved [186]. 


that original work, a host of different device lay- 
outs and liquid crystal materials have been studied 
[191], indicative of the ongoing interest in the blue 
phase mode for applications. 


6.6.5 Cholesteric flexoelectricity and 
the uniform lying helix mode 


Recently, there has been much renewed interest 
in the flexoelectro-optic effect in chiral nematic 
liquid crystals, originally proposed by Patel and 
Meyer [192] in 1987. This interest stems from the 
very fast IPS effect, typically 100s. If an electric 
field is applied perpendicular to the helical axis 
of a cholesteric liquid crystal, coupling between 
the field and the flexoelectric polarization causes 
splay and bend of the local director field, which 
in turn causes the local director to tilt away from 
the helical axis in the direction normal to the field 
(Figure 6.39a). Without the field, the uniaxial optic 
axis of the cholesteric lies parallel to the helical 
axis. The field-induced local tilt of the director 
causes the optic axis to tilt through angle B in the 
plane orthogonal to the applied field. 


If the cholesteric can be uniformly aligned with 
the helical axis parallel to the plane of a standard 
LCD with transverse electrodes (uniform lying 
helix, ULH), then applying the electric field causes 
rotation of the optic axis in the cell plane, giving 
contrast when observed through polarizers par- 
allel and crossed to the helical axis. This in-plane 
change of retardation axis provides a switchable 
half-wave plate. 

Assuming negligible dielectric anisotropy, 
the twist angle is approximately linear with the 
applied field: 


wage ne)? 


a lay (6.111) 
2n( ki, + ks) 


and the response time T is 


VP? 


= —._ —_ 6.112 
2n? (ky, + Kss) ( ) 


Flexoelectric switching is polar, so the ULH can 
be driven in either direction by swapping the field 
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Figure 6.39 The cholesteric flexoelectric effect. (a) Rotation of the optic axis and (b) alignment of N* 
into the ULH texture using cusped grating surfaces [192]. 


direction. Thus, Equation 6.112 is appropriate for 
both ON and OFF switching because the optic axis 
can be driven back to the OFF state using signals of 
the opposite polarity to those for switching. Both 
expressions show that the strength of the effect 
is optimized for small pitch lengths P, promising 
speeds below 1001s and greater than 45° switch- 
ing angles for strongly flexoelectric materials 
[193], and IPS for good viewing angle. The pitch 
is unwound by the RMS effect of the field cou- 
pling to Ae, so liquid crystal materials with strong 
flexoelectric effect but low Ae are required. This 
has been achieved using nematic dimers with odd 
alkyl spacer groups [194]. 

Fabrication of devices based on this effect 
requires a method for obtaining the required ULH 


alignment uniformly and stably over the panel. 
Strongly planar anchoring at the cell walls leads 
to the Grandjean texture being formed, where 
the helical axis lies parallel to the cell normal 
(sometimes called the uniform standing helix). 
Homeotropic alignment gives degenerate focal 
conic domains, and a scattering texture. Various 
methods have been investigated, including using 
periodic planar and homeotropic surfaces with the 
periodicity matched to P/2 [195], surface relief grat- 
ings [195,196], and polymer walls [197]. The direc- 
tor profile for a cholesteric liquid crystal aligned 
on a homeotropic grating with a pitch of P/2 and 
a cusped shape to give the lowest elastic distortion 
of the ULH state [196], as shown in Figure 6.39b. 
Although yet to be proven over large areas, these 
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methods show considerable promise for future 
applications of the ULH mode LCDs. 


6.6.6 Liquid crystal lasers 


With the successful commercialization of high- 
resolution, high-frame rate LCDs for TV, interest 
in nondisplay applications for liquid crystals has 
continued to grow at an amplified rate. One such 
application particularly relevant to optoelectronics 
is the invention [198] and demonstration [199,200] 
of the liquid crystal laser. This allows straightfor- 
ward tunability of laser light across the optical 
wavelength range combined with simple low-cost 
fabrication and offers the potential for electric field 
tuning of the lasing wavelength. 

Selective reflection from the cholesteric liq- 
uid crystal is an example of a one-dimensional 
photonic band-gap, wherein circularly polarized 
light of the correct wavelength range and hand- 
edness cannot propagate through the material 
in the direction parallel (or antiparallel) to the 
helical axis. This means that spontaneous emis- 
sion from a fluorophore within the cholesteric is 
inhibited within the photonic band-gap, leading 
to photons being emitted at the band edge [201]. 
That is, optical pumping of a cholesteric with either 
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a fluorescent dye dissolved into it or as constitu- 
ent part of the mesogenic molecule causes emis- 
sion of lower energy photons that overlap with 
the forbidden band of wavelengths corresponding 
to the range of selective reflection. Where emis- 
sion occurs across the band-gap lasing will occur 
at both edges. In the example system shown in 
Figure 6.40, a tetraaryl-pyrene derivative dye [202] 
pumped at 410nm gives emission peaking in the 
range 450-500nm. Once dissolved into a choles- 
teric with a photonic band from 500 to 520 nm las- 
ing occurs on the lower band edge. Either lowering 
the cholesteric pitch or changing to a dye with a 
longer wavelength emission swaps the lasing wave- 
length to the upper photonic band edge. 

The stimulated emission occurs in a single 
direction, as shown in Figure 6.40. Such devices 
offer potential for forming laser arrays, competing 
against III-V semiconductor vertical cavity sur- 
face emitting lasers but without the need for the 
multiple fabrication steps. Indeed, the simplicity 
of fabrication allows ink-jet printing of the liquid 
crystal to form an array of laser dots, each with a 
signature emission [203]. The low cost and ability 
to print onto a plastic backing layer has already 
earned the technology commercial application for 
anticounterfeiting. 
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Figure 6.40 Lasing in dyed cholesteric. (a) Example of the optical properties for the fluorescent dye 
and cholesteric liquid crystal, leading to lasing on the lower band edge of the selective reflection and 


(b) schematic representation of cholesteric lasing. 
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Lasing is possible in other liquid crystal phases 
that include Bragg type structures with optical 
length scales, including the ferroelectric chiral 
SmC* and blue phases [204]. The former offers the 
potential for fast electric field modulation, whereas 
the latter produces lasing in multiple directions 
due to the cubic structure of the Bragg lattice. 


6.7 CONCLUSION 


The market success of LCDs is predicated by scien- 
tific and technological achievements from across 
the globe, advances that have needed collabora- 
tive efforts of mathematicians, chemists, physi- 
cists, engineers, and entrepreneurs. The breadth 
of the subject is unusually broad and involving. 
A major part of the ongoing success of LCDs is 
the continual evolution and adaptation that these 
technological advances allow. Throughout the his- 
tory of the LCDs, the naysayers have maintained a 
mantra that LCDs cannot achieve the complexity, 
cannot achieve the viewing angle, cannot achieve 
the resolution, cannot achieve the speed, and can- 
not achieve the color balance of the latest com- 
peting technology. In each case, they have proven 
incorrect. Often, this has been through the inven- 
tion of new modes, sometimes associated with 
different arrangements, sometime with different 
materials and phases. Often, the improvements 
have come from the use of partnering technolo- 
gies; the active backplanes, polarizer, and optical 
films, back-light units, light guides, or manufac- 
turing advances. 

As it has always been, the competition is strong. 
OLED displays are already making inroads into 
small, high-resolution displays for mobile phones. 
If performance, cost and lifetimes can all exceed 
those of the LCD, then they will gain an ever- 
increasing share of the market, from low to high 
diagonal displays. Low-cost reflective passive 
matrix LCDs retain a large market presence too. 
However, as the cost of the TFT backplane con- 
tinues to reduce, the competition from electro- 
phoretic displays is strong and growing. Winning 
these battles will take yet more ingenuity from the 
LCD scientists and engineers. 

There remain several display markets where 
no display technology has yet delivered, whether 
LCD or alternative. Although reflective color 
and transflective displays have been marketed, 
solutions to date have not been adequate to gain 


market acceptance. LCDs have a natural advantage 
for transflective mode operation, being based on 
transparent media that modulates ambient light. 
Electrophoretics absorb light preventing trans- 
missive operation and OLEDs emit and so cannot 
modulate reflected light. Solutions to achieving 
the required performance at a suitably low cost 
have yet to be commercialized; perhaps the devel- 
opments of new nematic modes [e.g., 180] or the 
application of a new LC phase, such as the blue 
phase HI [205], will prove successful. 

An area of enormous growth is the use of liquid 
crystals for nondisplay applications [206]. In addi- 
tion to the conventional applications described in 
the previous sections, liquid crystal main-chain 
polymers such as Kevlar continue to prove one of 
the biggest LC applications, liquid crystal elasto- 
mers are used as the active element in nanoma- 
chines and molecular motors [207], liquid crystal 
semiconductors are attracting interest for photo- 
voltaic and OLED [208], and the great interest in 
liquid crystals in chemical and biological sensors 
continues with unabated enthusiasm [209]. One 
of the largest areas of growth, however, is in the 
field of optoelectronics. Whether this is for smart 
windows, beam steering and light guiding, print- 
able lasers, switchable lenses, optical computing, 
adaptable photonic structures, or Terahertz modu- 
lators, successful solutions to such applications in 
the future will require both the ingenuity and the 
multidisciplinary approach that made our prede- 
cessors so successful. 
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7.6.1.1. Dynamic range/speed 255 
7.6.1.2 Resolution-speed 
trade-off 256 
7.6.1.3 Dynamic range- 
resolution 256 
7.1 INTRODUCTION 


Spatial light modulator (SLM) technology has 
made tremendous progress in the past few years. 
Thus, device technologies such as active matrix- 
driven liquid crystal devices (LCDs) and ferro- 
electric LC (FLC) devices as well as micro-mirror 
SLMs, regarded as exotic novelties less than two 
decades ago, are now commercially available and 
constitute a significant share of the display market. 

Regarded as “dream-devices” only a decade ago, 
the multiple quantum well (MQW) SLMs are now 
a reality with array sizes of 256 x 256 and stagger- 
ing frame rates of over 100kHz. 

It is interesting to note that the trend in appli- 
cations continues to be dominated by the dis- 
play applications. Optical interconnects (OIs), an 
almost unknown application at the time, have 
moved strongly forward in the last few years 
with the explosive development of communica- 
tions, while the exotic optical data processing 
application is still awaiting the opportunity for a 
break-through. 

This chapter, aimed at describing the main 
SLM technologies and applications is organized as 
follows. 

The scope and definition of the main SLM 
types is detailed in Section 7.2. The bulk part of 
this review contained in Section 7.3 describes the 
main types of SLM currently available or those 
in an advanced development stage. Particular 
emphasis is placed on the recent emerging tech- 
nologies including LC, micro-mirror or micro- 
electro—mechanical-systems-(MEMS-) based 
arrays as well as MQW devices. Short summa- 
ries of the “older” SLM technologies, including 
solid state electro-optic, magneto-optic as well 
as acousto-optic and photorefractive devices are 
also presented in Section 7.3. Novel SLM concepts 
including electro-holograms and photonic band- 
gap (PBG) materials are presented in Section 7.4. 
Section 7.5 presents the main SLM applications 
including displays, optical communication, optical 
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data processing, programmable diffractive optical 
elements, adaptive optics and wavelength image 
converters. Finally, Section 7.6 presents some of the 
fundamental limits as well as the future trends of 
this technology. 

There have been numerous reviews in the form 
of books, book-chapters and conference proceed- 
ings covering SLM technology, since the mid- 
1970s. Some of the more recent reviews (since 
1990) are given in [1-11]. 


7.2 SLM DEFINITION AND 
GENERAL DESCRIPTION 


SLMsare devices that spatially modulate the ampli- 
tude, phase or the polarization state of an optical 
beam. By “spatial modulation”, we mean either 
a one-dimensional (1D; linear array) or a two- 
dimensional (2D) spatial modulation. Although 
not yet practically implemented, a three-dimen- 
sional (3D) spatial modulation of an optical beam 
(e.g., a dynamic 3D hologram) is also possible. A 
combination of the modulation parameters such 
as phase-amplitude or polarization-amplitude 
modulators has also been demonstrated. Since 
the most important modulation parameter is 
the amplitude or intensity of the optical beam, the 
modulation of either the phase or the polarization 
state of the beam is often used to affect amplitude 
(intensity) modulation through the conversion 
of the spatial phase or polarization modulation 
into intensity modulation using interferometric 
arrangement or crossed polarizer-analyser con- 
figurations, respectively. The control or address- 
ing of the SLM is another important parameter. 
The main choices here are electrical addressing 
(e.g., electro-optical or electro-absorption SLM) 
and optical addressing (e.g., optically-addressed 
SLM or OASLM); since electronic addressing is the 
most practical choice, this form is sometimes used 
to excite an intermediary field that in turn affects 
the optical property of the beam. Examples for this 
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Figure 7.1. A reflective-mode, Si-Schottky- 
diode, LC SLM. (After Sayyah, K. and Efron, U., 
Opt. Lett., 21, 1384-1386, 1996.) 


intermediary addressing are (electro)-magneto- 
optic SLM (MO-SLM) and (electro)-acousto- optic 
SLM. The next important parameter is the read- 
out beam configuration. We normally distinguish 
between transmission-mode and reflective-mode 
devices. Next, we specify the readout operational 
wavelength of the device. Thus, in addition to 
the commonly used visible regime, SLM devices 
exist and operate in the IR and UV regimes of the 
electro-magnetic spectrum. 

Figure 7.1 shows a cross-section of a liquid 
crystal SLM. According to the categories defined 
above, the device is a reflective-mode, photo-acti- 
vated (PA) SLM, based on a crystalline Si photo- 
substrate (in a Schottky-diode configuration) anda 


Table 7.1 Main characterization parameters of SLMs 


SLM parameter 


Spatial dimension 1D, 2D, 3D 
Modulation type 
Addressing mode 
SLM configuration Transmissive, reflective 
Operational wavelength 
Optical modulator type 
optic, magneto-optic 
Driver type 


Parameter range 


Visible, IR, UV, (mm-wave) 
Liquid crystal, MEMS, MOW, photo-refractive, acousto- 


Active matrix (Si), passive matrix (Si), CCD (Si, GaAs), 


nematic LC layer as the electro-optical modulator. 
The device accepts visible-IR input imagery (400- 
1100 nm) and can modulate the readout beam to 
form dynamic imagery from UV to long IR, with 
frame rates of up to 1kHz. This particular device 
has been operated in various LC configurations 
allowing both phase and amplitude modulations 
of the readout beam. 

Table 7.1 summarizes the main characteriza- 
tion parameters of SLM devices. The main physi- 
cal mechanisms utilized in SLMs are listed in 
Table 7.2. 

Table 7.3 summarizes the main specification 
and performance parameters of SLMs. 


7.3 MAIN SLM TYPES AND THEIR 
PERFORMANCES 


7.3.1 Liquid crystal devices 


7.3.1.1 LC-SLMS: SCOPE 


As pointed out in the introduction, the main appli- 
cation of LCDs and, for that matter, of most SLM 
devices in general, is still that of displays (see Section 
7.5 later). LCs are particularly suitable for display 
applications featuring a combination of extremely 
high electro-optic coefficients, broad-band visible 
operation and response times, which will match 
human vision. Although developed mostly for dis- 
plays, LC-SLMs were and are still being studied for 
other applications as well. These include optical 
data processing, Ols, adaptive optics and program- 
mable Kinoform (binary-optical) elements, such 


Example 


Amplitude, phase, polarization conversion 
Electrical, optical (PA); (electro)-acousto-optic 


(Electro-) 
magneto-optic 


LCD, Bragg cell 


Si-backplane 


photo-conductor (Si, GaAs, BSO, ...) 
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Table 7.2 Physical mechanisms employed in SLMs 


Physical mechanism 


Electro-optic (linear) 


Examples 


PLZT [13], (KD*P) [14,15], MSLM (LiNbO3) [16,17], ferro-electric 


liquid crystal (FLC) [18] 


Electro-optic (quadratic) 
Electro—mechanical (MEMS) 


Nematic-LCD [19], cholesteric liquid crystals [20] 
Deformable mirror (DMD-TI) [21,22], grating LV [23,24], 


deformable membrane devices [25-27] 


Electro-absorption 
Electrostatic phase distort 
Electro-capillarity 
Photo-refractive 
(Electro)-acousto-optic 
(Electro)-magneto-optic 
Thermal phase changes 


GaAs (CCD-SLM) [28,29], (MQW-SLM) [30] 

Thin oil film LV [31], gel-SLM [32] 

Bistable mercury-mirror devices [33,34] 

PROM/PRIZ/PICOC [35-40] photo-MQW-SLM [41,42] 

Bragg cells (1D) [43], acousto-MOW [44] 

Magneto-optic (MO-SLM) [45] 

VO, modulator (IR) [46], LC-smectic/cholesteric transition [47], 


bubble cross-connect switch [48] 


[49] 
[50] 


Plasmon-electro-optic enhancement 
PBG modulation 
Active light emission 


OLED-array [51,52], VCSEL array [53,54] 


Table 7.3 Main SLM specifications and performance parameters 


Performance parameter Description Units 
Array size No. of horizontal/vertical pixels N/A 
Resolution LP/mm @ MTF 
Temporal response Rise/decay time, frame rate ms, Hz 
Spectral bandwidth Input/output beam nm 


Input sensitivity (PA Device) 

Contrast ratio/phase dynamic range 

Diffraction/output modulation 
efficiency 

Output uniformity 

Physical array size 

Power consumption 

Operating temperature range 

Relative humidity range 

Shock resistance 


Min. photo-activating power 


cm? mW-1, cm? J~" 
dB; bits; radians 
% 


as variable focal length lenses and beam-steering 
devices (see later in Section 7.5, SLM-applications). 

Historically, the nematic materials were the 
first to be used for SLMs and other display applica- 
tions (such as watch displays). As described earlier, 
a nematic LC layer operating as the electro-optic 
modulator is used in either a twisted-nematic (TN) 
[55] or in a controlled-birefringence configuration 
[56]. The main advantage of the TN configuration 


is in its optically broad-band operation and the 
relatively lower sensitivity to the LC thickness 
variations. 

Nematic LCDs are mainly used as either small 
(1-3in.) panels (or “light valves”) for projection 
displays or as larger (10-15in.) direct view panels 
for personal digital assistants (PDAs), lap-top and 
hand-held PCs (HPCs), or mobile phone displays. 
The LCDs for both applications are manufactured 
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in both transmissive and reflective configurations. 
Earlier versions of LC projection displays were 
based on PA configurations, consisting of a photo- 
conductive layer used to convert the incoming, 
low-light-level input image (e.g., from a small-size 
CRT) into a spatial voltage distribution across the 
nematic LC layer. A separate beam is then used to 
read out the LC modulation and project the image 
onto a large screen. Due to the rapid progress in 
large CMOS array technology, most projection 
LCDs of today are based on electrically driven or 
electrically addressed (EA) LCDs. The older gen- 
eration, PA LCDs, also called “liquid crystal light 
valves” (LCLVs) [57], are still in use for some of the 
high-end projection display applications requiring 
a combination of high resolution and high bright- 
ness such as very large screen displays and elec- 
tronic cinema. 

Electrically addressed LCDs were first based on 
the use of a passive X-Y addressing scheme [58]. 
See Volume II, Chapter 6 by J. Cliff Jones. Although 
this concept relies on the threshold voltage prop- 
erty of LC materials, the MULTIPLEXING level 
(e.g., the number of pixels in a line) is quite limited. 
With the advance in VLSI technology allowing 
large transistor driver arrays to be manufactured 
at high yield, these passively addressed LCDs are 
now replaced by actively addressed LCDs (based on 
active transistors in each pixel) in most applications. 

Two other important developments in LCD 
technology took part in the 1980s and 1990s. These 
are the discovery and development of FLC [59] and 
the development of a scattering mode, polymer- 
dispersed LC (PDLC) [60,61]. The FLC, as the name 
suggests, is based on a LC material class (smectic 
C*) having a permanent electrical dipole moment 
and thus the molecules can physically be flipped 
with the reversal of the electric field. This is in con- 
trast to the nematic LC, which responds only to the 
magnitude of the field (E) and not to its polarity, 
with subsequent longer response time. The PDLC 
is based on LC “bubbles” or “droplets” immersed 
in a polymer matrix such that upon the applica- 
tion of an electric field, their average director ori- 
entation changes from a random distribution to a 
fully-field-aligned one. By properly matching the 
refractive index of the LC to that of the polymer, 
the result is the formation of a scattering mode in 
the off-state and a transparent mode, with applica- 
tion of the field. The most important aspects of this 
new LC form are (1) the polarization-independent 


optical modulation capability and (2) the simple 
packaging allowing pre-processed PDLC sheets to 
be pre-manufactured and assembled into the LCD 
without the need for the cumbersome vacuum-fill- 
ing of cells with liquid crystal material. 

The last decade (1990-2000) has witnessed 
additional important developments in LCD tech- 
nology among which are the development of an 
efficient poly-silicon-based driver circuitry [62], 
as well as the in-plane switching mode (IPS) [63] 
and more recently a novel photo-polymerization 
alignment technique [64]. The poly-silicon driver 
technology allows an efficient electronic driver to 
be fabricated on the top of a glass-based substrate, 
thus enabling the fabrication of high performance, 
large panel LCDs for lap-top computers. The IPS 
mode significantly increases the viewing angle 
for an LCD display. Finally, the novel contact-less, 
photo-alignment (PAL) method allows the forma- 
tion of multi-LC domains to reduce polarization 
dependence. 


7.3.1.2 THE PRINCIPAL LC TYPES AND 
MODES OF OPERATION 


As the name implies, LCs represent an intermedi- 
ate phase combining the physical properties of both 
solids and liquids. Specifically, this class of materi- 
als behave as anisotropic liquids, as they have no 
translational or long range lattice order and in this 
regard they are fluids or liquids. However, both 
their electrical and optical properties are direc- 
tionally dependent or anisotropic. It is important 
to keep in mind that the LC phase is an interme- 
diate phase, which exists only within a narrow 
temperature range (a few tens of degrees). The LC 
material solidifies below this range of temperature, 
while undergoing a liquid-liquid transition above 
this temperature range, where it becomes a normal 
isotropic liquid. The LC anisotropy is tied to the 
directionality in the (average) physical orientation 
of the molecules. 

The degree of anisotropy can be described by 
introducing the LC director and its orientation L, 
which indicates the average, macroscopic orienta- 
tion of the LC molecules. 

There exist three main types of LC class, namely, 
nematic, cholesteric and smectic mixtures. A brief 
description of each class and its mode of operation 
is given in the following. For an extensive review of 
LC classes and their associated physical and elec- 
tro-optic properties, see references [65-70]. 
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7.3.1.2.1. Nematic liquid crystals 


In this class (or phase) the aggregate of the LC mol- 
ecules show directional anisotropy in both their 
dielectric and optical properties. We thus have lon- 
gitudinal and transverse polarizabilities Q and a, 
dielectric constants €) and €1, as well as refractive 
indices n) and nx. All these parameters are defined 
with respect to the long molecular axis, which prac- 
tically coincides with the average orientation of the 
LC director. Positive-dielectric anisotropy NLCs 
are characterized by having: Ae = €) — €1 > 0, and 
similarly for Ao and An. 

It should be noted that both positive and nega- 
tive dielectric and refractive index anisotropies 
can be found in the LCs. Of particular interest are 
NLCs having a negative Ae but positive An. 

These are sometimes referred to as “negative- 
anisotropy NLCs”. 

The director orientation coincides with the 
optical axis of the refractive index ellipsoid. Thus, 
one usually substitutes the ordinary (m)) and the 
extraordinary (n,) refractive indices for n. and Np 
respectively. 

NLCs do not have a permanent dipole moment. 
Thus, the application of an electric field E beyond a 
threshold level E, induces a dipole moment p, pro- 
portional to E. The field then exerts a torque pro- 
portional to |p x E| ~ |E|?, which re-orients the LC 
molecules or, macroscopically, the LC director L. 
This re-orientation results in an effective tilting 
of the refractive index ellipsoid, thus varying the 
effective birefringence, as experienced by the inci- 
dent polarized optical beam. 

The threshold field, E, is given by 


E, = (1/d),/[k /eoAe] . Here, d is the thickness of 
the LC cell and k is the appropriate elastic constant. 
One can define a thickness-independent threshold 
(or critical) voltage, V,, = E.d (rather than a thresh- 
old field), beyond which the applied voltage starts re- 
orienting the LC director. The turn-on and turn-off 
response times, Toy and Topp are given by ton Toy 
t* A/[V/V,,) — 1], where t* = y/t?kAT opp. 

The quadratic dependence of the torque exerted 
on the LC director by the applied voltage results in 
two important characteristics of NLCs: 


1. The NLC’s orientation responds to the magni- 
tude or RMS value of the applied voltage and 
is therefore insensitive to the polarity of the 
applied voltage or field. 


2. The turn-on response time is inversely pro- 
portional to V’, whereas the off-state or the 
turn-off time is independent of the applied 
voltage. This sets the lower limit for the total 
response time of NLCs in practical devices to 
milliseconds. Such response is ideal for display 
devices, but is too slow for optical process- 
ing, as well as most telecommunication switch 
(cross-connects) applications. 


There exist three basic configurations of NLC cells: 


1. Planar configuration in which the NLC mol- 
ecules are homogeneously aligned parallel to 
both electrode planes in a certain alignment 
direction, which can be physically imposed by 
a particular treatment of the LC cell electrodes. 

2. Homeotropic configuration in which the NLC 
molecules are uniformly aligned perpendicular 
to the cell electrodes. 

3. A twist configuration in which the parallel 
alignment orientations of the NLC molecules 
at the entrance and exist cell electrodes or win- 
dows differ by an angle $.. 


This configuration is formed by fabricating align- 
ment layers with different orientations on the two 
cell electrodes. The NLC molecules then align in 
planes parallel to the cell electrodes, where in each 
plane the director orientation is rotated (or twisted) 
relative to the preceding plane. This results in the 
director re-orientating or twisting continuously 
from the alignment orientation at the entrance 
electrode, to the alignment direction at the exit 
electrode. Typical twist angles are 45° (hybrid-field 
effect); 90°, 180° (a-cell) and 270° or above (super- 
twisted NLC = STN configurations) [71]. 


7.3.1.2.2 LC alignment 


As is obvious from the previous discussion, one 
must define the LC alignment within the cell in 
order to make use of its electro-optical proper- 
ties. This is accomplished by generating alignment 
“marks” at the boundary cell surfaces, which then 
“anchor” the adjacent LC molecules in those pre- 
defined directions, thereby affecting the alignment 
of the bulk LC material within the cell. Obviously, 
such alignment via anchoring to the surface cannot 
be sustained over large distances. Thus, practical 
LCDs are physically constrained to a thickness of a 
few tens of micrometres (seldom exceeding 20 um). 

There exist several methods of LC alignment. 
The most important ones are 
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Table 7.4 Summary of LC alignment methods 


Alignment method 


Advantages 


Disadvantages 


SAD Excellent tilt control Complex process, difficult to 
mass-produce 
BUF Simple, demonstrated production Requires mechanical contact, 
difficult tilt control 
PAL Non-contact, amenable to mass- Low anchoring energy, tilt 
production, remotely programmed control, long-term stability 
orientation, multi-domain 
LBL Natural method for homeotropic Difficult to mass-produce, 
alignment complex process, tilt control 
IBE Good tilt control, demonstrated 
production 
GRS Lithographic process Complex process, difficult to 
mass-produce, optical losses 
(scattering) 
1. Shallow angle (SiO,) deposition (SAD) [72] surface, and the difficulty of a precise control of the 
2. Mechanical rubbing or buffing (BUF) [73] pre-tilt angle (around 88°-89°), which is crucial for 
3. Photo-alignment (PAL) [74,75] a proper operation of LCDs’. 
4. Langmuir-Blodgett (LBL) process [76] This form of alignment, as well as some of its 
5. Ion-beam etching (IBE) of polymer films [77] derivatives (such as the hybrid alignment), are 
6. Formation of a grating relief structure attractive for LCD display technology, as they can 


(GRS) [78] 


The relative advantages/disadvantages of each are 
given in Table 7.4. 

Parallel or homogeneous alignment. This is the 
simplest form of alignment in which the surfaces of 
both electrodes are aligned in the same direction. 
This alignment usually employs positive-dielectric 
anisotropy LC material, whose director (normally 
parallel to the surface and in the direction of the 
alignment marks) would tend to tilt towards the 
electrodes, upon the application of an electric field 
or voltage. 

In order to avoid the formation of opposite tilt- 
angle domains, a small (1°-2°) pre-tilt angle with 
respect to the electrode surface is usually defined 
to be at a 180° difference between the two elec- 
trodes. Since in this alignment the LC molecules 
are anchored parallel to the electrode surfaces, the 
alignment mechanism makes use of the largest foot- 
print of the elongated LC molecules and is therefore 
the most powerful (highest anchoring energy). 

Homeotropic LC alignment. The homeotropic 
or tilted-perpendicular alignment (TPA) of LCs is 
a particularly difficult alignment, both due to the 
small foot-print of the molecules on the aligning 


lead to a very high contrast and reduced angular 
sensitivity. Several methods of implementing the 
homeotropic alignment are given in [79-88]. 


7.3.1.2.3 The electro-optic effect in 
NLC cells 


The nematic LC molecular orientation becomes 
distorted under the application of an applied volt- 
age (the so-called Fredericksz transition), with the 
LC director tilting at an angle 8(V). Although the 
tilt-angle varies across the LC cell thickness, one 
can approximate the variation of the effective bire- 
fringence An,,(V) of the cell by 


Ange [0(V)]= nen, / {n, cos*[0(V)] 


+ nz sin’[0(V)]}"? — n, 


“For homogeneously aligned (or planar-aligned) LC 
configuration, the pre-tilt angle guards against the for- 
mation of multiple domains in the LC layers, which is 
cosmetically unacceptable for displays. However, for 
the perpendicular TPA alignment, the pre-tilt control 
is much more critical, in that it determines the final in- 
plane orientation of the LC director. This orientation 
must be predetermined and controlled in order for a 
display device to function properly. 
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assuming a spatially uniform tilt-angle, 8(V), 
throughout the cell. 

In the case of a parallel-aligned configura- 
tion, assuming a positive dielectric anisotropy 
LC, the director will be re-oriented from a par- 
allel alignment (0(V) = 0°, for: V < Vj) to an 
almost perpendicular alignment with respect to 
the cell electrodes (6(V) = 180°), at sufficiently 
high field or voltages (typically: V > 10 Vani). 
This change which results in An,,[6(V)], chang- 
ing from: An, = An,,,, = 1. — 1, to An, = 0, can 
be used to effectively rotate the polarization plane 
of the incoming optical beam. Thus, the inten- 
sity of an incoming beam polarized at 45° to the 
alignment direction of the (parallel-aligned) LC 
will be proportional to I = I, sin?(A@>/2), follow- 
ing a crossed-analyser behind the LC cell. The 
retardation angle, Ag, can be approximated by 
Ag> = 2nd{An,,{6(V)] — n,}/A, using the uniformly 
aligned LC cell approximation, with a tilt-angle 
given by 0(V). 

By aligning the input polarization along with 
the LC alignment direction, this configuration can 
be used for a pure phase modulation, where the 
voltage-modulated phase, Aq, is given by the same 
expression as above. 

The same basic applications for intensity 
(polarization-rotation) modulation or a pure phase 
modulation can be accomplished using the homeo- 
tropically aligned NLC arrangement discussed 
earlier. In this case, one needs to use a negative- 
anisotropy LC in order to accomplish a director 
re-orientation by the application of an electric field. 

The effect of continuously controlling the effec- 
tive birefringence in a homogeneously aligned cell 
(whether parallel or, perpendicularly aligned) is 
termed the “controlled birefringence” or CB-effect. 

The TN configuration is formed by adding a 
chiral (helically structured) chemical agent into a 
nematic mixture. In this configuration, the orien- 
tation of the polarization plane of an optical beam, 
polarized either parallel or perpendicular to the 
alignment at the entrance window, will follow the 
director orientation rotation caused by the twisted 
structure across the LC cell, exiting with the plane 
of polarization aligned with the LC alignment 
direction at that exit window. 

This optical rotation effect, which occurs in the 
absence of an electric field, is subject to the con- 
dition: U= ndAn/)0., > 1 (limit), where 0, is the 
total twist angle in the cell. 


Defining the rotatory power, P,, as the fraction 
of the optical beam intensity whose polarization 
gets rotated along with the twist angle as previously 
described, it can be shown that this rotatory power 
can be approximated by P, » U?/(1 + U’), for both 
regimes of: U>> 1 and U « 1 [68, p. 372,89,90]. In 
particular, for U > 1, the rotatory power level is 
almost unity, showing that the polarization plane of 
the incoming beam is almost perfectly rotated with 
the twist. The effect of applying an electric field is to 
tilt the director perpendicular to the (almost plane- 
parallel) layers, towards the direction normal to the 
cell electrodes. This results in the twist arrangement 
becoming increasingly distorted and eventually 
breaking up at sufficiently high fields. The destruc- 
tion of the twist structure at high field destroys the 
polarization property and thus allows the use of the 
locally applied voltage to modulate the polarization 
rotation or the intensity with the use of a proper 
polarizer—analyser arrangement. 

It should be pointed out, however, that in this 
arrangement one cannot simply use the effect for 
pure phase modulation except for very low applied 
fields [91]. 

Another important difference between the con- 
trolled birefringence and the TN effect is that the 
latter can be used (subject to the above condition) 
to rotate the plane of polarization in a wide spectral 
window (e.g., throughout the entire visible spec- 
trum of 450-650 nm). The CB-effect, on the other 
hand, is by its very nature a wavelength-dependent 
effect as can be seen by the A-dependence of the 
retardation A@ on the previous page. 


7.3.1.2.4 Cholesteric LC 


A cholesteric LC (CLC) mixture is formed by the 
introduction of a relatively large concentration of 
a chiral agent, resulting in a typical enlargement 
(numerous 22 cycles) of the twist angle. In the CLC 
configurations, the degree of the rotational twist is 
characterized by the period of the twist rotation, 
namely, the pitch P,, measured in micrometres, 
and related to the twist angle 0, by 0, = 2nd/P,. It 
has been shown [92,93] that 


1. The pitch, P,, of the helix formed is inversely 
proportional to the concentration of the chiral 
component in the nematic mixture. Typical 
values of the pitch are between 0.5 and 50 pm 
and can be attained by varying the chiral com- 
ponent concentration between 20% and 0.5%. 
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2. The transmission of the planar cholesteric 
configuration (where the helical axis is perpen- 
dicular to the cell surfaces) is high, except in 
the spectral regions where A © P, and where a 
high reflectivity of a circularly polarized light 
is observed. 

3. The twist or helix structure, of the cholesteric 
phase can be completely suppressed by a suf- 
ficiently large electric field. In this case, the LC 
director (for a positive-anisotropy nematic, 

Ae > 0) lines up with the field (perpendicular 
to the cell electrodes), with an effective refrac- 
tive index of n,, relative to the incoming optical 
beam. 


Upon application of a low electric field, the planar 
structure turns into a group of randomly oriented 
helical clusters that strongly scatter light. This scat- 
tering phase is termed “focal conic” texture. The 
capability of forming both a planar texture (at 
zero field), which with sufficiently long pitch can 
be used as a transparent state, and the use of the 
field-excited focal conic is the basis for the recent 
interest in using cholesteric materials as a bistable 
optical modulator. In order to obtain stable, zero- 
field states for both focal-conic and planar tex- 
tures, polymer stabilization of the LC structure is 
used [94,95]. 


7.3.1.2.5 Smectic materials 


Smectic materials possess an ordered layered struc- 
ture. While there exist several subclasses of smec- 
tic phases, the two important ones are smectic A 
and C (SmA, SmC). These differ by the angle in 
which the molecules tilt with respect to the normal 
layers. A crystalline order, namely, a well-defined 
layer period, exists in both the smectic phases. It 
should be pointed out that the smectic and nem- 
atic phases may be based on the same material, 
of which SmA and SmC may constitute different 
phases, at different temperature ranges. The degree 
of disorder in the phases, is increased with increas- 
ing temperature, SmC being at the lowest tempera- 
ture end and, hence, the most ordered phase; SmA 
follows as a more disordered smectic phase, ending 
at the highest temperature range with the nematic 
as the lowest ordered, anisotropic LC phase. 

As for the optical properties, SmA is a uniaxial 
material with its extraordinary refractive index 
perpendicular to the layers’ planes, while the 
refractive indices in both in-plane axes are equal 


to the ordinary index. This symmetry, however, is 
lost in the SmC phase due to the molecular tilt in 
the smectic planes. Of particular importance is the 
chiral SmC or SmC* which is formed by adding a 
chiral (helically structured) chemical agent to the 
SmC mixture. In this case, the mirror symmetry of 
the SmC is lost and the formation of a spontaneous 
dipole moment is possible without the presence of 
an external electric field. The existence of a perma- 
nent dipole moment 1 (which is reversible by the 
application of an electric field) results in a direc- 
tional (polarity) dependence of the LC director on 
the externally applied electric field, through the 
generated torque T, where T=! XE. This then leads 
to on- and off-response times both being inversely 
proportional to the field: t(E) = y/p|E|. This case, 
in contrast to the nematic LC behavior, where the 
off-time is independent of the applied field, having 
an effectively lower (field-induced) dipole moment, 
leads to significantly faster response times of ~10- 
100 ps. 

The first concept and demonstration of a usable 
FLC configuration was published in 1980 by Clark 
and Lagerwall [96]. In this configuration, the 
researchers constructed a thin FLC cell in which 
the helical structure was suppressed, resulting in a 
uniform distribution of the director across the LC 
cell. This arrangement, termed “surface stabilized 
FLC” (SSFLC), allows two field-dependent, LC 
director orientations separated by 20, where 0, is 
the tilt-or cone-angle of the FLC. The LC director 
can switch between both states by the application 
or reversal of the externally applied electric field. 
The response times for switching are of the order 
of 100 ps for typical applied fields (5-10V pm). 
Despite being limited to binary operation, the 
SSFLC configuration has been the principal mode 
of operation for FLC-SLMs for a variety of applica- 
tions, in particular for LCDs. In this last applica- 
tion [97], pulse width modulation (PWM) methods 
making use of the fast response of the FLC relative 
to the required frame time of ~20 ms are used to 
enable the required grey scale operation. 

Another mode of operation is based on using 
the SmA very near to its SmC transition point. In 
this configuration (electro-clinic effect) [98], the 
LC director orientation, which is still uniform 
throughout the cell, tilts proportionately to the 
applied electric thereby allowing continuous grey 
scale operation. At the same time, one can attain 
response times comparable to those of the FLC 
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devices since the LC molecules already possess a 
permanent dipole moment. 

A mode of operation employing SmC* or FLC 
materials was developed by Funfschilling and 
Schadt [99] in Switzerland, in 1989. This mode, 
termed “deformable helix FLC” (DHF), does not 
constrain the natural helix formation in a SmC 
cell, but makes use of the fact that the helix will 
tend to get distorted upon the application of an 
electric field. Thus, in the short-pitched helix cell 
formed (such that p =A), this field-induced distor- 
tion results in a change in the orientation of the LC 
director, averaged over the wavelength size LC sec- 
tion observed. This orientation change can be used, 
similar to the controlled-birefringence effect in a 
nematic LC cell, for polarization rotation operation. 

Very fast speed of response (a few microseconds), 
as well as grey scale operation, can be attained in 
this DHF mode, at the expense of a relatively low 
dynamic range. The latter is a consequence of the 
averaging effect over the LC director. 


7.3.1.2.6 Polymer-based LC 
configurations 


Recent advances in both polymerization and 
lithographic techniques resulted in the introduc- 
tion of polymers into LC structures. In general, 
the advantage of introducing polymers into LC 
mixtures is the formation of mechanically self- 
supported structures, as opposed to regular LC 
fluids requiring external mechanical support in 
the form of cell window glasses or plastics. The 
first concept, introduced over a decade ago, was 
that of a PDLC [100-104]. This concept is based 
on the phase separation which occurs between the 
LC and polymer materials and which results in the 
formation of micrometre or, wavelength-size LC 
droplets within the surrounding polymer matrix. 
The original concept was based on the use of nem- 
atic LC materials. However, in later PDLC versions 
other LC phases such as FLCs and CLCs are also 
utilized. In the NLC-PDLC version the electro- 
optic modulation effect is based on matching the 
ordinary NLC droplet index of refraction to that 
of the surrounding polymer matrix. Since, in the 
absence of an external electric field, the LC direc- 
tor orientation in the droplets is randomly distrib- 
uted, the droplet index will differ (on average) from 
that of the polymer matrix. Thus, the PDLC will 
strongly scatter in this state. Upon application of 


an external electric field, the LC directors in the 
NLC droplets become oriented with the field (e.g., 
parallel to the field direction, for a positive-anisot- 
ropy LC). In this state the droplets now appear to 
have an ordinary index matching to that of the 
surrounding polymer matrix, for optical beam 
incident perpendicular to the cell window. This 
arrangement therefore allows a two-state, polar- 
ization-independent, transparent/scattering opti- 
cal modulation. 

Other types of polymer-stabilized LC structure, 
in particular, a reflective, polymer-stabilized cho- 
lesteric texture (PSCT) for a scattering/narrow- 
band reflective, dual-mode bistable LC modulator, 
have recently been intensively studied for low- 
power-consumption displays [105]. One potential 
application is that of electronic paper. 


7.3.1.3 LIQUID CRYSTAL SLMS 


In the following, we will present a brief description 
of the main LC-SLM structures. For an extensive 
review of this technology, the reader is referred to 
the latest symposia of the Society for Information 
Displays over the last few years [106] where numer- 
ous papers on these structures were published. 


7.3.1.3.1 Electrically addressed LCDs 


A schematic of a reflective-mode, electrically 
addressed, LCD is shown in Figure 7.2. 

A polarized readout beam enters the device from 
the top and is reflected from one of the electrode/ 
mirrors placed at the bottom of the device. These 
electrodes addressed with different voltages affect 
the LC whose director tilts as shown in proportion 
to the increasing voltage (from left to right). The 
alignment layers control the orientation of the LC 
director at the electrode surfaces. 

The LC driving is accomplished using mostly 
“active matrix” schemes which are based on the use 
of thin-film circuitry with one or more MOS-based 
transistors in each pixel node. The circuitry typi- 
cally employs a row/column drive timing scheme 
in which each row is sequentially addressed such 
that within each of the addressed rows the column 
pixel transistors are sequentially addressed. The 
MOS transistors in each pixel allow the sequential 
gating of the signals into each of the pixel’s storage 
capacitors. The addressing signal, which must be 
alternating in its polarity (AC) to avoid LC deterio- 
ration, is typically 1-10 V RMS. 
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Readout beam 


Electrodes/mirror 


Figure 7.2 Schematic of a electrically addressed, reflective-mode LCD. 


The silicon substrate in the early direct view 
LCD panels was amorphous silicon which, due 
to its limited performance (carrier mobility) has 
recently been replaced by poly-crystalline Si, 
grown on a high-temperature glass substrate. 

An important consideration for a Si-based 
driving circuitry is that it must be well shielded 
from the readout beam. In a transmissive-mode 
device this implies a mandatory loss in the area- 
or aperture-factor of the pixels, and therefore in 
the optical efficiency, as the circuitry region of the 
pixel cannot be used for light transmission. 

This results in an increasingly high loss of 
optical efficiency for smaller pixel-size arrays, 
as the (optically shielded) circuitry occupies an 
increasingly larger portion of the total pixel area. 
Reflective-mode LCDs, on the other hand, can be 
made optically efficient regardless of their pixel 
size. 

The LC employed in the direct view displays 
is typically in a TN configuration, which allows a 
cell-thickness-independent, broad-band (450-650 
nm) operation. Pixel-level-RGB filters depos- 
ited on the front glass allow color operation to be 
accomplished in conjunction with spatially aligned 
activation of the RGB sub-pixel drivers. Finally, a 
crossed analyser at the exit plane, in conjunction 
with the input polarizer, converts the polarization- 
rotation at the pixel-level into spatially modulated 
intensity (with full color). 

As pointed out above, there is a significant 
structural difference between direct view LCDs 


= 


LC alignment 
layers 


ar 


(e.g., a lap-top or note-book screen) and projec- 
tion panel LCDs. While lap-top displays are large- 
aperture (25-40 cm), transmissive type, projection 
display panels are typically of smaller size (2-8 cm 
in linear dimension), usually of a reflective type. 
These relatively smaller size panels are currently 
based on a single crystal silicon substrate (the 
so-called LC on silicon technology (LCOS)). The 
driver electrodes are usually utilized as the reflec- 
tion layer with an appropriate metallic coating. 
The LC mode can either be a hybrid field effect 
mode [107] first used in the early LCLVs, vertically 
aligned nematic exhibiting a relatively wide field of 
view (FOV), or more recently, in-plane-switching 
configuration [63]. FLC as well as PDLC devices 
were also demonstrated in small LCD panels for 
projection displays. FLCs are also currently pro- 
duced for video camera direct view applications 
[108]. An ultra-thin-crystalline Si-substrate tech- 
nology recently developed [109] allows very small 
transmissive panels to be used for projection and 
head-mounted display applications. 

The above class of electrically addressed LCDs 
encompasses the use of the different LC configu- 
rations discussed earlier: in particular, TN cells, 
controlled-birefringence operation, as well as FLC 
[110] and PDLC [111] configurations. Finally, one 
should mention that in addition to the traditional 
use of MOS- or CMOS-based active matrix driving 
schemes, the use of a CCD-based circuitry matrix 
to address an LCD has also been demonstrated 
with an array size of 256 x 256 [112]. 
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Table 7.5 Typical performance of electrically addressed LCD projection panel 


Performance parameter Description Units 
Input/output Electrically addressed, output intensity N/A 
modulation 
Device configuration Reflective mode N/A 
Array size Up to 2000 x 2000 Pixels 
Optical modulator Nematic liquid crystal (TN configuration) 
Driver array Single-crystal CMOS array 
Resolution 25 LP mm! @ 20% MTF 
Temporal response Rise/decay time: 0.5-10 ms, frame-rate: ms, Hz 
30-300 Hz 
Spectral region Visible, 450-650 nm 
Input sensitivity (PA device) N/A N/A 
Contrast ratio 100:1 (40 dB) dB 
Output modulation efficiency 10-30 (unpolarized, collimated beam) % 
Output uniformity 1-3 % 
Aperture size 2 x 2 (for a 1024 x 1024 array @ 20 um cm x cm (pixel-size) 
Power consumption ~500 mW 
Operating temperature range 10-40 2G 
Shock resistance 1-5 G 


Typical performance ofan electrically addressed ee Dielectric 

LCD panel for projection displays is shown in insulating mirror 

Table 7.5. layers Transparent 
conductive 

7.3.1.3.2 Photo-activated LCD SLMs — Dae 

As pointed out earlier, the first type of LC-SLM projection Writing 

developed was actually the PA SLM configu- light light 

ration developed at Hughes Aircraft Research 3_ > =— 

Laboratories during the late 1970s [57]. These = > = 

devices were the first LC-SLM products for both = >= = 

commercial and military uses. A typical reflective- is = 

mode, PA SLM structure is shown in Figure 7.3.  <«— —» <_ 

The structure of such an SLM commonly known =~ > 2s 

as an LCLV consists of four layers: a photocon- 3 > = 

ductor (PC); a light blocking layer (LBL); a mirror == = 

(typically dielectric) (DM) and the LC layer. This 

structure is sandwiched between two glass elec- ~<j— Glass 

trodes providing both the mechanical support 

and the necessary sheet-conductivity or surface- a ieaaal Photoconductor 

electrode function. The operation of the device is a eas 

as follows: under low illumination level at the input Transparent 

(PC port) most of the bias voltage drops on the conductive Light blocking layer 

(high-resistivity) PC layer, and as a result the LC ae ra 

ays . iquid 

is biased below threshold level and is consequently exystal 


not activated. Upon illumination of a region or 
spot on the photoconductor, this section of the PC Figure 7.3 The Hughes PA LCLV. 
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Table 7.6 Performance parameters of the PA, a-Si LCLV 


Performance parameter Description/value Units 

Device configuration Reflective-mode N/A 

Optical modulator Nematic liquid crystal N/A 

Driver array Amorphous Si-photo-substrate N/A 

Array size 2000 x 2000—equivalent Pixels 

Resolution 25 LP mm-' @ 20% MTF 

Temporal response Rise/decay time: 0.5-10 ms, frame-rate: ms, Hz 
30-300 Hz 

Spectral bandwidth Input beam: 550-700, output beam: 200 nm 
450-650 

Input sensitivity (PA device) 0.1-0.3 m W cm-? 

Contrast ratio 200:1 (40 dB) dB 

Output modulation efficiency 10-30 (unpolarized, collimated beam) % 

Output uniformity 1-3 

Physical array size Diameter ~50 mm 

Power consumption ~100 mW 

Operating temperature range 10-40 °C 

Shock resistance 1-5 G 


becomes conductive and the voltage now shifts to 
the LC and the LBL/DM layers. The latter, being 
thin, takes only a relatively small fraction of the AC 
voltage, which now drops mostly on the LC layer 
activating those areas aligned with the illuminated 
sections of the photoconductor. The activation of 
the LC layer results in a change in the polarization 
rotation of the LC leading, with the use of a polar- 
izing beam splitter, to an intensity modulation of 
the out-going optical beam. 

In the particular case of the Hughes LCLV device 
shown in Figure 7.3, the two LC configurations com- 
monly used are the hybrid-field-effect mode using 
a 450 TN configuration [57] or, alternatively, the 
tilted-perpendicular mode based on the controlled- 
birefringence effect in a vertically aligned, negative- 
anisotropy LC, which has been in use more recently 
[113]. These devices initially employed CdS as the 
photoconductive layer which has been replaced by 
an amorphous-silicon layer. A fast-response version 
of the device for color-sequential mode operation 
based on single-crystalline silicon photo-substrate 
was also demonstrated [114]. Due to the simplicity 
of an electrically addressed system on the one hand, 
and the rapid improvement in the yield of large 
CMOSarrays on the other, the trend today is towards 
electrically addressed LCD systems for projection 


displays. However, high-end applications requiring 
exceedingly large array size (>2000 x 2000) and high 
brightness (>5000 Im) are still a hard reach for elec- 
trically addressed LCDs. The “old-time” PA LCLVs 
are therefore still potential candidates for such 
demanding projection display applications such as 
electronic cinema [113]. In addition to reflective- 
mode devices, transmissive-mode PA SLMs have 
also been demonstrated [115-117], using a variety of 
semi-transparent photoconductors including amor- 
phous silicon and BSO. The latter was developed at 
the Thales Research Laboratories, under Professor 
J-P Huignard. These devices were constructed using 
nematic, FLC and PDLC LC-modulators. 

The typical performance of a PA a-Si LCLV is 
given in Table 7.6. 


7.3.2 MEMS-based SLMs 


7.3.2.1 GENERAL 


The origin of these devices occurred after various 
attempts in the late 1970s to use continuous mem- 
brane mirrors, locally deformed under the applica- 
tion of local electric fields, as a means to modulate 
the reflectivity or the phase of the reflected opti- 
cal beam [118]. This concept was expanded in the 
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early 1980s to include the first MEMS-type struc- 
tures in silicon [119,120]. This MEMS structure 
is essentially an array of silicon-based pixel-level 
metallized cantilevers which tilt in response to an 
electric potential applied to the pixel electrode. 
This MEMS structure which started with analogue 
tilt response design was later refined by the Texas 
instruments developers as a binary (digital) device 
with only two states. This recent design, combined 
with a complex driving circuitry to allow grey scale 
operation based on a PWM scheme, is the basis for 
today’s high-brightness, high-resolution projectors 
aimed at such ambitious goals as electronic cin- 
ema [121]. The other principal MEMS-type SLM is 
based on the pixels constructed in a shape of inter- 
digitated fingers. In this configuration, the applica- 
tion of an electric field across this digitated finger 
structure results in the formation of a diffraction 
grating with a variable field-dependent depth. Thus 
diffraction of the incoming optical readout beam, 
rather than its deflection, constitutes the novelty 
of this MEMS structure. This “grating light valve” 
(GLV) device [122,123], conceived and developed 
by Bloom, is now under development. 

Finally, another recent related development has 
been reported, which is developing a similar struc- 
ture of a MEMS-silicon-based micro-mirror SLM. 
However, rather than using an electrostatic field to 
drive the micro-mirrors, the device uses a piezo- 
electric micro-transducer fabricated in each pixel 
[124]. Finally, for completeness, we should also 
mention a recent effort by researchers to develop 
mechanical continuous membrane implementa- 
tions of SLM similar in concepts to the earlier 
attempts mentioned [125] above. 


7.3.2.2 THE MICRO-MIRROR SLM (DIGITAL 
MICRO-MIRROR DEVICE) [120,121] 


The digital micro-mirror device (DMD) SLM 
(Figure 7.4) is based on a MEMS structure that is 
fabricated using a CMOS-compatible processes 
over a driver array which is based on a CMOS 
memory. 

Each pixel modulator consists of a 16 x 16 pm? 
aluminium mirror, which can reflect light in one 
of the two directions depending on the state of the 
underlying driver circuit. In the on-state, the mir- 
ror is rotated to $, = 0° (into the system’s FOV). In 
the off-state, the mirror swings to $, = —10° (ie., 
out of the system’s FOV cone). Thus, by using the 
DMD in conjunction with a light source and a 


Mirror 


Torsion 
hinge 


Yoke Landing tip 
Figure 7.4 The pixel structure of the digital 


micro-mirror device. (After Van Kessel, P.F. and 
Hornbeck, L.J., Proc. IEEE, 86, 1686-1704, 1998.) 


suitable projection optics (Figure 7.5), the mirror 
reflects incident light either into or out of the pupil 
of the projection lens. 

Thus, with the above arrangement, the on-state 
(0, = 0°) of the mirror appears bright, while the off- 
state of the mirror (0, =—10°) appears dark. 

The DMD pixel mirror is constructed over a 
CMOS driver array similar to an SRAM cell. An 
air-gap between the driver circuitry and the mir- 
ror is formed using an organic sacrificial layer. 
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Figure 7.5 The optical system arrangement for 
the DMD modulator. (After Van Kessel, P.F. and 
Hornbeck, L.J., Proc. IEEE, 86, 1686-1704, 1998.) 
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Table 7.7 Summary of the DMD-SLM performance 


Performance parameter Description Units 
Input/output Electrically addressed, readout beam, intensity N/A 
modulated 
Device configuration Reflective mode N/A 
Optical modulator Micro-electro-mechanical mirror N/A 
Driver array CMOS driver array N/A 
Array size Up to 1920 x 1080 Pixels 
Resolution 25 LP mm! @ 50% MTF LP mm! @ MTF 
Temporal response Opt. switching time: 20 ps; frame-rate: 30-300 Hz ms, Hz 
(color, 8-bit); 10kHz binary (with custom 
addressing) 
Spectral bandwidth Output beam: 450-650 200 nm 
Input sensitivity N/A N/A 
Contrast ratio 100:1 (20 dB) dB 
Output diffraction efficiency 60 % 
Output uniformity 1 % 
Physical array size ~40 x 20 mm 
Power consumption ~1 WwW 


The air-gap allows the mirrors to rotate about 
two flexible torsional hinges. The mirror is rigidly 
attached to an underlying yoke, which, in turn, is 
connected by two flexible torsional hinges to support 
posts that are formed on the underlying substrate. 

An electrostatic field generated by the underly- 
ing pixel results in a mechanical torque applied on 
the micro-mirror. This torque, applied against the 
restoring torque of the hinges, produces a mirror 
rotation in the positive or negative direction. The 
mirror and yoke rotate up to the point that the yoke 
comes to rest against the mechanical stops. 

The DMD mirrors are 16 ppm? made of alumin- 
ium for high reflectivity. They are arrayed to form a 
matrix having a high fill factor (approximately 90%) 
to attain high optical throughputs. 

Grey scale is achieved by using a PWM technique. 
This is an addressing method in which the effective 
frame rate is M times (e.g., M = 256) faster than the 
actual, visual frame rate (e.g., 30 Hz). The binary-mod- 
ulating pixels are activated in duty cycle ratios between 
1/M and 1.0 during the modulation of each visible 
frame. This, combined with the integrating response 
of the eye, creates an effective grey scale perception. 
Color operation is achieved by using color filters. 

The system can also operate in a color- 
sequential mode using sequentially flipped color 


filters, in conjunction witha single device. Both the 
PWM grey scale method and the color-sequential 
operation are possible due to the relatively fast 
electro-mechanical response of the micro-mirror 
which features typical switching times of ~20 ps. 
Thus, assuming a 3 X 8-bit operation for a 256- 
level RGB, color-sequential video projection, the 
pixel switching requirements are T,,,:,., ~ (16 ms)/ 
[(256 levels/color) x (3 colors)] ~ 21 ps, which can 
still be met by the ~20 ps switching time of the 
micro-mirror. 

As to the reliability issues related to the mechan- 
ical failure of the micro-mirror, due to the continu- 
ous flipping, TI reports that testing of hinge fatigue 
resulted in over 1 X 10” (1 trillion) cycles without 
mechanical failure. This 20 year equivalent opera- 
tion, performed in an accelerated cycling test, seems 
to indicate that hinge fatigue is not a reliability con- 
cern for the life of an ordinary DMD product. 

In the brighter, color-parallel mode, two or three 
DMD arrays are used with stationary color filters to 
produce a full color image table (see Table 7.7). 


7.3.2.3 THE GRATING LIGHT VALVE [122,123] 


The GLV technology, originally developed by 
Bloom, similar to the DMD, is based on the MEMS 
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Figure 7.6 The pixel structure of a GLV device. 


techniques, to form pixels in a silicon chip. Each 
of these pixels (typically, 25 ym size) is made up of 
multiple ribbon-like structures (Figure 7.6), which 
can actually be moved up or down over a very small 
distance (only a fraction of the wavelength of light) 
by controlling electrostatic forces. The main advan- 
tage of the GLV-MEMS technology over the DMD- 
MEMS technology is in the reduced displacement 
required of the strips (1/4A or around 0.15 pm), rela- 
tive to the DMD pixel displacement (around +1.5 
pm for the pixel tip). This leads to a substantially 
faster response speed of the order of 1 MHz or better. 

The ribbons are arranged in such a way that 
each pixel is capable of either reflecting or diffract- 
ing light. An output image is formed by collecting 
the reflected or diffracted light with an appropri- 
ate lens system, either projected onto a front- or 
rear-screen system, or to be viewed directly by 
the eye. A Schlieren optical system is used to dis- 
criminate between the two optical states. By block- 
ing reflected light and collecting diffracted light, 
contrast ratios of a few hundreds to one can be 
achieved. In an ideal square-well diffraction grat- 
ing, 81% of the diffracted light energy is directed 
into the 41st orders. By adding multiple Schlieren 
stops and collecting more orders, practical systems 
can achieve greater than 90% diffraction efficiency 
(DE). The gaps between GLV ribbons (defined by 
the minimum lithographic feature) control the 
optical diffraction efficiency. Thus, the theoreti- 
cal DE varies between 82 and 98% for ribbon gaps 
between 1.2 and 0.35 pm, respectively. 

The GLV technology can be employed in either 
digital or analogue modes. In the digital address- 
ing mode, the switching is based on the PWM 
described earlier for the DMD device operation. 


In the analogue mode, the depth to which ribbons 
are deflected is controlled by the driving circuitry. 
When the ribbons are not activated (deflected), 
the pixel is in its off-state. When the ribbons are 
deflected to (1/4)A, the pixel is fully on. Any par- 
ticular grey level can be generated by deflecting the 
ribbons to positions between these two limits. 

For display applications, the GLV is operated 
in the scanned mode, where a linear array of GLV 
pixels is used to project a single vertical column of 
the image data. This column is optically scanned 
at a high rate to produce a complete 2D image. As 
the scan moves horizontally, GLV pixels change 
states to represent successive columns of video 
data, forming one complete 2D image per scan. 
The relatively fast switching speed of GLV devices 
(of the order of 1 MHz) allows full HDTV scanning 
(1920 x 1080 image) at video rates of up to 96 Hz. 

The main performance parameters of the device 
are summarized in Table 7.8. 


7.3.2.4 THE THIN-FILM MICRO-MIRROR 
ARRAY [124] 


A thin-film micro-mirror array (TMA) display 
is currently under development by researchers at 
Daewoo Electronics in South Korea. 

The modulator technology of the TMA device 
is similar to that of the TI DMD discussed earlier. 
However, rather than using electrically activated 
micro-cantilevers, it uses micromachined thin- 
film piezoelectric actuators to control the micro- 
mirror tilt mechanism in an analogue fashion, thus 
enabling grey scale operation with over 256 levels. 
Each pixel consists of a mirror and an actuator. 
In previous designs, the two had been co-planar, 
but the improved design has the actuator situated 
below the mirror, increasing the fill factor up to 
94% and the contrast ratio to 200:1. 

The TMA uses thin-film piezoelectric actuators 
in the form of micro-cantilevers, which consist of 
a supporting layer, bottom electrode, piezoelectric 
layer and a top electrode. 


7.3.2.5 THE MICRO-MECHANICAL SLIT- 
POSITIONING SLM [126] 


This is a recent concept based on the technology 
of MEMS described earlier, where an actuator 
can programmably shift an optical slit of width 
between 8 and 100 um, thus enabling the position 
of the transmitted light regions to be changed as 
well as modulate the incoming light beam. 
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Table 7.8 Performance parameters of the grating light valve 


Performance parameter Description/value Units 
Input/output Electrically addressed, readout beam, N/A 
intensity modulated 
Device configuration Reflective/diffraction mode (1D) N/A 
Optical modulator Micro-electro-mechanical grating N/A 
Driver array CMOS driver array N/A 
Array size 1D: 1920 x 1 (2D also possible) Pixels 
Resolution 25 LP mm-' @ 50% MTF 
Temporal response Opt. switching time: down to 20 ns; ns, Hz 
line-rate: 30-300 Hz (color, 8-bit); 
10kHz binary (with custom addressing) 
Spectral bandwidth Output beam: 450-650 200 nm 
Input sensitivity N/A N/A 
Contrast ratio >200:1 (40 dB) dB 
Output diffraction efficiency Up to 80 % 
Output uniformity 1 % 
Physical array size ~50 x 0.1 (for 25 x 100 um pixels) mm 
Power consumption <100 mW 


The proposed SLM architecture can be 1D or 
2D (for imaging). The applications include micro- 
spectrometer systems as well as the next generation 
space telescope with multi- object spectrometer under 
development by the European and US space agencies. 

When an electric field is applied to the two elec- 
trodes, the mechanical strain in the piezoelectric 
layer causes a vertical deflection of the mirror. The 
response time of each pixel is 25 1s, making it fast 
enough for field-sequential color display applica- 
tions. The display, which is used to make a high- 
brightness XGA-format projector, has an optical 
efficiency of 20% at a panel size of 2.54 in. 

In order to modulate the light intensity of the 
individual mirror pixels projected on the screen, a 
field-stop is used as a light valve. When a mirror 
does not tilt, all the light reflected by the mirror is 
blocked by the field stop and the pixel is at its off- 
state (dark). 

When the mirror is fully tilted, all the light goes 
out through the projection stop and the pixel is at 
its brightest state (white). 


7.3.3 MOW modulators 


7.3.3.1 INTRODUCTION 


While this is the most recently developed SLM 
technology, it is also the most promising one for 


demanding, ultra-fast-frame-rate (>MHz) appli- 
cations such as optical data processing. It was 
pioneered in the early 1980s by Miller, who demon- 
strated, for the first time, the potential of artificially 
made stacks of quantum-size layers of alternating 
GaAs (quantum well material) and GaAlAs (quan- 
tum barrier materials) for a highly efficient, ultra- 
fast electro-absorption (EA) effect [127,128] with 
potential use for very fast SLMs. The effect was 
named the “quantum confined stark effect” (QCSE) 
by Miller. The interest at that time was that of fast 
switch or switch arrays for communications. While 
the EA effect can be observed in direct-gap semi- 
conductors such as GaAs and InP (the so-called 
Franz-Keldysh effect), it is relatively inefficient in 
requiring high-voltage switching to accomplish 
a relatively poor contrast modulation. In fact, an 
attempt to use this effect in constructing a CCD- 
driver-based, GaAs-SLM was attempted in the 
early 1980s [129]. The low contrast accomplished 
(~1.2:1) convinced the technical community that 
this is not the right path to a fast, efficient SLM 
technology. The subsequent development of MQW- 
based SLM technology [130] and the, additional 
use of Fabry-Perot structure subsequent, combin- 
ing the EA effect with the related electro-refraction 
(ER) effect, resulted in an SLM technology capable 
of gigahertz response with contrast ratio of 100:1 
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and higher, at moderate addressing voltage levels 
of ~5V. This SLM technology constitutes the only 
potential solution known today for the demanding 
optical data processing applications. 


7.3.3.2 MOW MODULATORS: PHYSICAL 
BACKGROUND 


The MQW structure, as its name implies, con- 
sists of a stack of quantum wells (QWs) i.e., a few 
molecular monolayers of a low-energy-gap “well” 
structure (e.g., GaAs), sandwiched between thin 
wide-bandgap “barrier” layers (e.g., GaAlAs). This 
structure “compresses” the excitonic wave function 
(bound electron-hole pair) due to the ultra-thin 
dimension of the well layer in which the excitons 
reside, forcing the electron-hole pair to be much 
closer to each other. This in turn, results in a much 
higher coulombic energy and, therefore, higher 
ionization energy for the excitons compared to 
their ionization energies in a bulk semiconductor 
material. This allows excitons to exist in these QWs 
at room temperatures. Furthermore, the applica- 
tion of an electric field of the order of 100k V cm! 
causes a partial separation of the electron-hole 
wave functions, thereby altering the energy levels 
of the excitons. This, in turn, alters the effective 
optical resonance of the exciton thereby shifting 
the resonant absorption wavelength. Since the 
wavelength dependence of the absorption curve 
is quite steep near the resonant wavelength, this 
shift can effectively be used to modulate the opti- 
cal absorption of the optical beam (Figure 7.7). 
The importance of this effect for SLM application 
is the fact that such modulation can be effectively 
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Figure 7.7 Field-induced absorption changes in 
an MOW sample. (After Goossen et al., Appl. 
Phys. Lett., 64, 1071, 1994.) 


performed with only a few volts of bias resulting in 
an EA effect with picosecond response times. 

The theoretical treatment of the EA effect in 
a QW predicts a quadratic effect of the resonant 
wavelength dependence upon the applied elec- 
tric field [131]. The resonant excitonic EA effect 
in the MQW structure is expected to result in a 
corresponding resonant ER effect through the 
Kramers-Kronig relationship which links the 
spectral dependence of both the absorption coef- 
ficient and the refractive index [132]. 

Another important type of MQW structure, the 
compositionally doped “nipi” structure, was initi- 
ated and developed by Ploog and Doehler [133,134]. 
This structure is based on alternating composi- 
tionally doped quantum-size layers, e.g., n-GaAs/ 
p-GaAs. Often an insulating layer (“T’-layer) is 
inserted between the n- and p-layers and hence the 
name “n-i-p-I”. The resulting stack of quantum- 
size p-n junctions, when selectively contacted with 
all the n- and the p-layers in parallel, can result ina 
very large EA or ER effect [135]. 

Finally, in reviewing related structures and 
physical effects of MQW, lower-dimensionality 
quantum well structures should also be men- 
tioned. The formation of quantum well structures 
results in the confinement of carriers to 2D “wells”, 
as pointed out earlier. From the standpoint of the 
dynamic behavior, such confinement results in the 
quantization of the density of energy states which 
now behave as a series of step functions. This effect, 
which actually contributes to the steep spectral 
absorption curve and hence to the enhancement of 
the EA, can be enhanced even more by the confine- 
ment of the carriers to a single dimension (“quan- 
tum lines”) or ultimately to zero-dimensional 
“quantum dots”. In particular, quantum dot struc- 
ture has been extensively researched in the last few 
years due to the theoretical potential of yielding 
extremely sharp delta-shaped absorption curves. 
These could result, in turn, in an extremely effec- 
tive switching and therefore in very efficient SLM 
structures. The main issue in the manufacturing of 
these structures has been the finite spread in the 
dot size distribution, which sharply reduces the 
steepness of the absorption curve [136]. 


7.3.3.3 MOQW MODULATOR: DEVICE 
STRUCTURE 


The basic device structure of an electro-absorbing 
MQW modulator in a transmissive mode is shown 
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Figure 7.8 MOW modulator. (After Miller, D.A.B., 
Int. J. High Speed Electron., 1, 19-46, 1990.) 


in Figure 7.8. The MQW structure is fabricated 
(using either MBE or MOCVD epitaxial growth 
techniques) within a p-i-n structure. This con- 
figuration allows the generation of a relatively 
high electric field created by back-biasing the p-n 
junction and extending into the low-doped, high- 
resistivity intrinsic (I) region where the MQW 
structure is situated. 

The medium-high-doped n- and p-AlGaAs 
regions constitute, in effect, transparent conduc- 
tive electrodes, as these AlGaAs-based layers have 
a wider bandgap than the QW excitonic line and, 
at the same time, feature relatively high sheet- 
conductivity due to their high doping levels. 

The transmission contrast ratio of an MQW 
sample can be roughly estimated as CR ~ exp(AaL), 
where Aa is the field-induced (negative) change in 
the excitonic absorption coefficient, and L is the 
optical path length or the MQW sample’s thickness. 

As can be seen from Figure 7.7, an absorption 
coefficient change of: Aa ~ (-)1 pm is induced 
for a voltage change of approximately 10V over a 
sample thickness of approximately 0.7 1m, which 
translates to a field variation of approximately 
~1.4x 10°Y cm”. Based on the estimated CR above, 
this absorption change translates to a contrast 
ratio of around 2:1. This is a typical value for the 
performance of a transmissive-type MQW device, 
having around 50 QW periods, with around 10V 
of applied voltage bias. For a reflective-type device, 
one can theoretically attain up to a quadratic 
enhancement of the CR since the optical path L in 
the CR equation above is doubled. 

This translates to around 5:1 in reflectivity con- 
trast for modulators operating in a reflection mode. 
To go beyond these figures necessitates either a 


large QW stack, (e.g., 200 periods allows over 10:1 
contrast in transmission [130]) or other means of 
amplifying the quantum-confined Stark effect. In 
the following, we will shortly describe two such con- 
figurations. Historically, the first attempt was to use 
the photo-current generated inside the MQW mod- 
ulator to affect the device’s bias voltage in a positive 
feed-back mechanism, so as to enhance nonlinearly 
the QCSE by having the voltage bias of the sample 
be varied in proportion to the photocurrent. These 
early self-electro-optic effect devices (SEEDs) [137] 
were designed to get a low-field nonlinear, bi-stable 
behavior in PIN-type MQW devices. 

A switching energy as low as 180 pj for a 
60 pm X 60 pm SEED has been demonstrated [138]. 
A second, common modification of the QW device 
is the integration of the device within an optical 
resonator cavity. This essentially “amplifies” the 
EA and/or ER effect by effectively extending the 
optical path via the multi-pass effect of the cavity. 

The asymmetric Fabry-Perot QW modulator 
[138-140] is based on the insertion of the MQW 
stack within an optical resonant cavity composed 
of a semi-transparent top mirror (Figure 7.9) and 
a bottom, high-reflectivity, 1/4A-stack dielectric 
mirror. In this case, the resonator’s reflectivity, RR, 
is given by Ra = [Ry (1 — R,/R,)?/(1 — R,)*], where 
Ra = VRrRz exp(—O.d) and R., Rz, are the reflec- 
tivities of the top and bottom mirrors, respectively. 

As can be seen, if the effective bottom-mirror 
reflectivity (R; exp[—2,(E)d]) is sufficiently reduced 
by the field-induced absorption a(E), to the point 
where it matches the top-mirror reflectivity: 
Ry exp(-2a(E)d) = Ry we have a zero reflectivity 
for the resonator. This capability of bringing down 
the reflectivity close to a zero level implies that we 
can expect high reflectivity contrast ratios for the 
relatively modest bias levels. The above estimate 
did not take into account the ER effect that must 
also be considered [138]. 

Contrast ratios in excess of 100:1 were obtained 
for moderate sized QW stacks and biases lower 
than 10V [140]. It should be pointed out, however, 
that such use of a thickness-sensitive, resonant 
cavity makes it difficult to accomplish adequate 
spatial uniformity of the modulator array. 


7.3.3.4 PHASE MODULATION USING MOW 


An earlier attempt to demonstrate phase modula- 
tion in MQW was reported in 1988 [141]. The dem- 
onstration of 0.21 @ 852.5 nm was accompanied as 
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Figure 7.9 Schematics of a Fabry-Perot MOW-SLM (Lenslet Laboratories-Ramat-Gan Israel: Private 


Communication 2002. www.lenslet.com.) 


expected by significant attenuation. As pointed out 
earlier, the attenuation is expected as a direct result 
of the Kramers-Kronig relationship. Thus, since 
the optical modulation effect in an MQW modula- 
tor is based on the exitonic resonance effect, one 
expects the maximum ER effect to always occur 
in the vicinity of the absorption peak. To help 
to maximize the ER effect while minimizing the 
absorption, one can define a figure of merit by 
FM = An(E)/a(E)A, which can be used to search 
for material systems (and/or spectral regions) 
with a high ratio of refractive to absorptive opti- 
cal modulation [142,143]. This formulation implies 
that operation at a shorter-wavelength region 
may hold advantage for this type of modulation. 
However, earlier attempts to use this expression in 
search of material systems exhibiting significantly 
high FM were not too encouraging [142]. A more 
recent attempt indicates a possibility of perform- 
ing phase modulation for beam-steering applica- 
tions [144]. However, the authors do not indicate 
the absorption-associated insertion losses in this 
device. 


7.3.3.5 MOW SLMS 


Earlier attempts to construct small-size arrays of 
MQW-SLM devices were made [145] (6 X 6 array) 
and [146] (128 x 1 array). 

More recent attempts of constructing and dem- 
onstrating larger-size arrays of MQW-SLMs were 
made [147] (128 x 128 array) and the successive 
effort [148,149] (256 x 256 array). The latter was 
reportedly operated at up to 300,000 frames per 


second, allowing up to 600,000 correlations per 
second to be performed on a 128 x 128 array. The 
device was based on GaAs/GaAlAs MQW stacks 
in a PIN geometry, inserted within an asymmet- 
ric FP cavity discussed earlier, with pixel sizes of 
around 40 pm x 40 pm. The group claimed up to 6 
bits of grey levels achievable on their 2048 x 1 1D 
devices. Summary of the MQW-SLM performance 
is given in Table 7.9. 

The demonstration of a 16 x 16 cross-bar 
switch “Amoeba”, for optical communications was 
recently demonstrated [150]. The switch is based on 
flip-chip bonding of a 0.8 um technology CMOS 
chip to an MQW array of detectors/modulators. 

Optically addressed MQW-SLMs were also 
conceived [138] and demonstrated [151]. 

The recent demonstration [151] is based on a 
reflective-mode, GaAs/GaAlAs MQW modulator 
constructed inside a Fabry-Perot cavity, and com- 
bined with a free-carrier trapping layer. The latter 
is sensitive to the incident, near-IR to visible opti- 
cal beam. A near-excitonic resonance probe beam 
reads out the spatial optical signal-modulated 
reflectivity of the Fabry-Perot cavity to form an 
output image. Other efforts in the development of 
MQW-SLM arrays that should be mentioned are 
[152] in developing GaInAs-based MQW modula- 
tor arrays for cross-bar switch applications. 

A recent effort is based on MQW modula- 
tor-array prototypes fabricated using the GaAs/ 
AlGaAs material system. The first prototypes fab- 
ricated are designed with an array size of 128 x 128 
pixels, made with a pixel pitch of 38 ym [153]. 
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Table 7.9 The main performance parameters of a state-of-the-art, electrically addressed MOW-SLM 


Performance parameter Description Units 

Input/output Electrically addressed, readout beam, N/A 
intensity modulated 

Device configuration Reflective-mode N/A 

Optical modulator MQW-electro-absorption mode N/A 

Driver array CMOS driver array N/A 

Array size 256 x 256 Pixels 

Resolution (est.) 25 LP mm-'! @ 50% MTF LP mm-' @ MTF 

Temporal response Opt. switching time: ~10 ps; frame-rate ps, kHz 
(CMOS limited): 300 kHz 

Spectral bandwidth Output beam center-A~ 850, bandwidth~5 nm 

Input sensitivity N/A N/A 

Contrast ratio/grey levels 100:1 (20 dB)/6 bits dB 

Output modulation efficiency ~30 % 

Output Uniformity 1 

Physical array size ~10 x 10 mm 


Source: Trezza, J.A. et al., Proc. SPIE, 3490, 78-81, 1998; Kang, K. et al., Proc. SPIE, 3715, 97-107, 1999. 


Finally, significant improvements in the uni- 
formity of FP-based MQW arrays (Figure 7.9), 
reducing optical nonuniformity to 3.3 nm across 
the 4in. wafer, have recently been demonstrated 
[154]. These devices operate at low voltage (2.1- 
3.9V), producing a reasonably high contrast ratio 
(16:1-98:1) and are designed for a frame rate of up 
300kHz. The array was hybridized to an 8-bit 0.25 
CMOS technology, Si driver. 


7.3.4 Solid-state electro-optic SLMs 


In general, there has not been much progress in 
solid-state electro-optic SLMs. These devices, 
which constituted the “first-generation” SLMs 
in the late 1960s to early 1980s, were quite com- 
plex and bulky due to the very high bias voltage 
required, and the related large pixel size/SLM 
apertures required (see discussion on the funda- 
mental limitations of SLMs in Section 7.6 later). 
Thus, devices based on KD*P and LiNbO,, turned 
out to be commercially unviable for the main dis- 
play applications, and gave way to LCDs as the 
SLM technology of choice for displays and to some 
extent for optical processing. Some references to 
these technologies can be found in the earlier con- 
ferences on SLM technologies [155-157]. 

One electro-optic SLM that should be men- 
tioned is the PLZT-based device. There have 
been recent efforts in trying to form 1D and 2D 


products using this technology [158]. The interest 
in this modulator technology is in the relatively 
high electro-optic coefficient with half-wave volt- 
ages of the order of 100 V [159,160]. 


7.3.5 Acousto-optic SLMs 


The acousto-optic (AO) modulator technology has 
been largely implemented in the form of either 
bulk devices such as the Bragg cell or as sur- 
face acousto-optic (SAW) devices. This is a fairly 
mature technology of 1D modulator arrays which 
has been in existence since the early 1930s [161]. 
The general structure of an AO modulator is 
shown in Figure 7.10. A piezo-electric transducer 
modulated by RF (100-3000 MHz) generates an 
acoustic wave that propagates within the AO crys- 
tal (e.g., quartz). The propagating density grating 
formed in the crystal gives rise to a correspond- 
ing, propagating refractive index grating. The lat- 
ter diffracts an optical beam launched at the AO 
crystal perpendicular to the grating. The grating 
constant, which is essentially the acoustic wave- 
length A, varies inversely with the modulating RF 
frequency f, i.e, A = C,/f, where C, is the acoustic 
(sound-wave) velocity in the material. The Bragg 
diffraction angle 9, which is inversely proportional 
to the acoustical grating period, can be varied by 
modulating the RF frequency of the piezoelectric 
transducer. This device can therefore deflect an 
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Figure 7.10 Schematic of the acousto-optic effect. 


incident optical beam into a range of diffraction specifying the number of available spots or “pixels” 
spots thereby acting as a 1D SLM. is the time-bandwidth product. This parameter is 

A typical performance parameter of an AO _ roughly the product of the frequency bandwidth 
Bragg cell is given in Table 7.10. From the stand- (200MHz in this case), and the time aperture 
point of the SLM technology, a key parameter (5 pus), which is determined by the “time of flight” 


Table 7.10 Typical performance parameters of an AO Bragg cell 


Operating wavelength 


Time—bandwidth product 
Center frequency 

3 dB bandwidth 

Active aperture 

Time aperture 
Interaction medium 
Acoustic velocity 
Diffraction efficiency 
Electrode 

Optical surface flatness 
Input impedance 

Input VSWR 

Optical reflectivity 


Any within the range 442-850 nm 


1000 

300 MHz 

200 MHz 

1.5mm H x 19mm L 

5 ps 

PbMoO, 

3.63 mm ps“! 

10% at 1 W RF power (633 nm) 
Apodized to minimize acoustic walkoff 
wavelength/10 or better 

50Q 

2:1 across RF bandwidth 
5%/surface 


Source: Courtesy of Isomet Co. 
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of an acoustic wave (C, = 3.63mm us) across 
the crystal length (19mm), giving around 1000 
resolved spots. 

For further information on the AO technology, 
we refer the reader to several extensive technol- 
ogy surveys which have been recently published 
[162-164]. 


7.3.6 Magneto-optic SLMs 


The employment of the magneto-optic effect for spa- 
tial modulation is shown in Figure 7.11 [165-167]. 
The device, which is based on the magneto-optic 
effect associated with the magnetic domain reversal 
in a ferromagnetic material [gadolinium-gallium- 
garnet (GGG)], is electrically driven using conduct- 
ing X-Y mesh-lines which form the boundaries of 
the magneto-optical pixels. As a particular mesh- 
node is cross-activated (i.e., both the X-line and the 
Y-line for that node) the resulting current flowing 
around that pixel corner generates a magnetic field, 
which causes a magnetic domain reversal in that 
magneto-optical pixel. 

Although initially developed as a binary (phase 
or amplitude) SLM, the presence of a “neutral” 
state of the magnetic domains later allowed the 
realization of a ternary-state device [166]. Due to 
its relatively high insertion losses in most of the 
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The magneto-optic effect for spatial light modulation. (After Pulliam, G.R. et al., J. Appl. 


visible spectrum, the device was targeted mainly 
for optical processing applications. The relatively 
high current required to switch to the on-state is 
one of the main drawbacks of this device, effec- 
tively limiting its frame rate to the low kilohertz 
range (see Table 7.11). Recently, a renewed effort 
in the development of the MO-SLM was reported 
[168,169]. An overall improvement by a factor 
of three in power consumption using a modified 
shape driveline with detailed device simulation 
was reported. 


7.3.7 Photorefractive SLMs 


This modulation technology is based on the refrac- 
tive index modulation in response to an incident 
optical beam. The effect has been extensively stud- 
ied in the past 30years [170-173] and, hence, only 
a brief description will be presented. Figure 7.12 
describes the process of photo-refraction upon the 
illumination by an interference pattern generated 
by the illumination of two coherent beams (Figure 
7.12a). The spatial periodic illumination results 
in the generation of periodic, local photo-carriers 
(e.g., electrons—Figure 7.12b). These drift apart due 
to diffusion resulting in the formation of a space- 
charge field (Figure 7.12c, d). Finally, the generated 
space-charge field results in a periodic modulation 
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Table 7.11 Summary of the magneto-optic SLM performance parameters 


Performance parameter Description Units 
Input/output Electrically addressed, readout beam, intensity/ N/A 
phase modulated (ternary state: 0, +1) 
Device configuration Transmissive/reflective modes N/A 
Optical modulator Magneto-optic GGG N/A 
Driver array CMOS driver array N/A 
Array size 128 x 128 Pixels 
Resolution Approximately 20 LP mm~' @ 50% MTF LP mm-' @ MTF 
Temporal response Frame rate: 0.5-1.0 kHZ 
Spectral bandwidth 630 (15%)-788 (45%) nm 
Input sensitivity N/A N/A 
Contrast ratio/grey levels > 10,000:1 @ 788 nm 
Output modulation efficiency 15-45 % 
Physical array size ~30 x 30 mm 
Power consumption ~2-4 Ww 
niteeetag the refractive index of the material. As it turns out, 
coherent beams __ the response time of the effect is inversely propor- 
Photorefractive tional to the same figure of merit, Q [174]. 
medium While the effect has largely been studied for 
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Figure 7.12 Schematics of formation of the pho- 
torefractive effect. 


of the refractive index due to the electro-optic 
effect in this material. The resulting diffraction effi- 
ciency is therefore proportional to the appropriate 
linear electro-optic coefficient 7, with a figure of 
merit for the material which can be written as [174]: 
Q=nxt1/e, where s is the dielectric constant and n is 


holographic storage applications, as well as adap- 
tive optical corrections by phase conjugation (see 
Section 7.5), the photo-refractive (PR) crystal can 
actually serve by itself, as a photo-addressed or PA 
SLM. Thus, with the formation of a photo-induced 
grating by the optical input signal, the device acts 
to modulate a probe (readout) beam. Since both the 
input and the output beams can contain spatially 
encoded information, the device actually acts as 
a PA SLM, although in this arrangement another 
input SLM is often used. This property of PR 
crystals has been used for numerous applications 
including phase conjugation [175], optical stor- 
age [176], photorefractive correlators [177], laser 
beam cleaning [178,179], information processing 
[179,180] and novelty filter processors [181]. 

Early attempts to demonstrate spatial modula- 
tion included the photorefractive incoherent-to- 
coherent optical conversion (PICOC) device [182]. 
The process is based on modulating the preformed 
index grating generated in the photorefractive 
crystal by the interference of the two coherent 
beams, by the illumination of the incoherent (sig- 
nal) image. Another demonstration of an incoherent- 
to-coherent spatial modulator was based on a 
grating-encoded phase modulation in a Ce-doped 
SBN crystal [183]. A more recent demonstration of 
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Figure 7.13 Photo-refractive-based MOW-SLM. 
(a) p-doped conductive layer; (b) free carrier trap- 
ping layer; (c) intrinsic MQW layer; (d) n-doped 
Bragg mirror; (e) n-doped GaAs wafer; (f) 
incident probe beam; (g) modulated reflected 
beams and (h) diffracted beams. (After Bowman, 
S.R. et al., J. Opt. Soc. Am. B, 15, 640-647, 1998.) 


this effect was based on the self-phase conjugation 
in an SBN crystal, in which a 28 lp mm” resolu- 
tion of spatial modulation was demonstrated with 
a6 mm X 12mm crystal [184]. 

A more recent employment of the photorefractive 
effect was in the use of an optically addressed MQW- 
SLM bya US Naval Research Laboratory group [185]. 
The researchers developed an GaAs-AlGaAs, reflec- 
tive-mode MQW device, shown in Figure 7.13. This 
device is capable of operating both as a “regular” PA 
SLM and as a photorefractive or holographic mode 
device. In this mode, two coherent pump beams gen- 
erate a periodic photocharge pattern in the carrier 
absorption layer (Figure 7.13b). The resulting space- 
charge field induces refractive index modulation 
within the MQW layers which in turn modulates the 
probe beam (Figure 7.13f). Diffraction efficiencies of 
1.5% were obtained with down to a 7 um spot-size 
resolution @ A = 856 nm. A principal advantage of 
this SLM technology is its very high spatial resolu- 
tion, while the main drawback is its inherently slow 
response due to the energy (time x power) to form 
the photo-refractive grating pattern, including the 
use of fast MQW configurations. 


7.3.8 Smart-pixel SLMs 


A smart-pixel SLM (SPS) can be described as an 
SLM array consisting of opto-electronic pixel 


circuits where each of this circuits is capable of the 
following: 


1. Performing optical detection of the input 
beam. 

2. Performing some level of signal processing on 
the incoming signal. 

3. Optically modulating the output beam. 


Such “smart pixel” arrays clearly integrate some of 
the signal processing functions into the SLM itself 
[186,187]. The main variable parameters of this 
SLM technology are 


1. Detector array technology. 

2. Modulator array/modulated active emitter 
array technology. 

3. Signal processing functionality in each pixel. 


Earlier concepts were based on silicon-PLZT modu- 
lators [188] where the silicon circuitry was used to 
provide both photo-detection and drivers for the 
PLZT electro-optic modulators. With the advent of 
liquid crystals and more recently QW modulators, 
smart SLMs were demonstrated with Si-FLC [189] 
as well as Si-VLSI MQW [187,190] combinations. 
More recent configurations for SPSs involve the use 
of acombined Si-driver and VCSEL source/modula- 
tor [191], shown in Figure 7.14. 

In a recent development of a CMOS-based imag- 
ing smart-array system, a 2.5 MB s* bandwidth, 
over 30dB in dynamic range with 150 pm smart 
pixels in an array size of 64 x 64 has been demon- 
strated [192]. 

Finally, the combination of both functions of 
imaging and display in a single SLM device has 
been demonstrated using CCD circuits operating 
both as an imager and a display [193]. A recent 
development effort based on CMOS technology 
has been proposed with the combined imaging 
and display functions implemented in each pixel of 
the image transceiver array [194]. The application 
of this image transceiver device is targeted at the 
“smart goggle”. 

The main application for the SPS technology is 
the OI or cross-connects [187,195-197]. Another 
important application is that of optical processing, 
where operations such as motion detection, pat- 
tern recognition, SIMD-parallel processing, neural 
net processing and analogue to digital conversion 
have been demonstrated [198-202]. 
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Figure 7.14 VCSEL-Si-based smart pixel. (After Neff, J. et al. http://wwwocs.colorado.edu/research/ 


fsoi/research6.html.) 


7.4 NOVEL DEVICES AND 
EMERGING TECHNOLOGIES 


The very successful application of SLMs in the 
booming area of displays, and the more recent 
interest in their potential use in the exploding 
communication field, has motivated the research 
and development of novel spatial optical modula- 
tor technologies and devices. 


7.4.1 Electro-holograms 


This method is based on controlling the recon- 
struction process of volume holograms by means 
of an externally applied electric field [203-205]. 
The electro-holography (EH) effect exploits the 
voltage dependence of the photo-refractive effect 
in the para-electric phase of a material such as 
KLTN, which results in controlling the process of 
hologram reconstruction. Thus, the use of an elec- 
tric field can result in the activation of pre-stored 
holograms in the photo-refractive material, which 
would otherwise appear optically homogeneous. 

Such an effect when used in an array of such 
devices can be used to modulate, two-dimension- 
ally, the optical intensity of a wave front or of an 
array of optical beams. Alternatively, it can be used 
to steer individually a 1D or 2D arrays of optical 
beams. 

Furthermore, since this is essentially a con- 
trolled photo-refractive effect, the direction of 


the diffracted beam clearly depends on the beam’s 
wavelength. This is of particular importance in 
the optical cross-connect (OXC) switch applica- 
tion for communications (e.g., fora WDM system) 
where one needs to have a wavelength-dependent, 
routing control of the optical beams. This topic is 
addressed in Section 7.5 later. 

Switching speeds of ~10 ns, and diffraction effi- 
ciencies of over 30% in 32 angularly multiplexed, 
volume holograms in a 3 mm X 3 mm X 3mm 
KLTN crystal, which were switched on and off 
electrically, have been demonstrated. 


7.4.2 PBG devices 


This is an extremely interesting optical-physics 
phenomenon introduced in the late 1980s. The 
main feature of photonic band-gap (PBG) struc- 
tures [206,207] is the periodic modulation of the 
refractive index n (or the dielectric constant), along 
one, two or three directions of space. In a compos- 
ite formed by two dielectrics, the periodic modu- 
lation of one of the dielectrics effectively creates 
scattering centers, which are regularly arranged 
in the second medium, resulting in the coherent 
scattering of light. In this case, interference will 
eventually inhibit some frequencies that will not 
be allowed to propagate, thus giving rise to “for- 
bidden bands”. Under certain conditions, regions 
of frequency may appear to be forbidden regardless 
of the propagation direction in the PBG. In sucha 
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case, this region is said to present a full PBG. On 
the other hand, if the forbidden photonic band var- 
ies with the propagation direction in the region, a 
photonic pseudo-gap is spoken of. Furthermore, 
we can also introduce defects into the structure, 
resulting in the introduction of allowed energy 
levels into the gap, analogously to a doped semi- 
conductor. All these facts permit us to establish a 
parallelism between the formalism used for elec- 
trons in ordinary crystals and that for photons in a 
photonic crystal (PX). 

Continuing with this similarity, it is also pos- 
sible to control the “optical conductivity”, i.e., the 
optical transmission of a photonic crystal, by mod- 
ulating the relative indices of diffraction of the two 
materials of which the crystal is composed. In par- 
ticular, if an electro-optical material, such as liq- 
uid crystal, is infiltrated into the optically periodic 
crystal (e.g., periodically etched, porous silicon), 
then controlling the effective refractive index of the 
LC via voltage bias can result in a significant shift 
of the forbidden band frequency. This in turn can 
drastically change the optical transmission. Hence, 
here we have another potential efficient candidate 
for SLM—since minute changes in the effective 
refractive index could lead to very significant varia- 
tions in the optical transmission. This was actually 
demonstrated with a matrix of porous silicon, hav- 
ing an air-pore pitch of 1.58 jim, infiltrated with the 
nematic LC E7. The photonic band-gap transmis- 
sion of this structure then becomes quite sensitive 
to temperature variations, through their effect on 
the effective refractive index of the LC as was dem- 
onstrated by the Japanese as well as the Canadian- 
German groups which studied this effect [208,209]. 


7.4.3 Bubble cross-connect arrays 


A modified technology, originally developed for 
inkjet printers, to be used in an all-optical switch 
[210], is based on the formation of bubbles of gas 
by electrical heating, which then propel droplets of 
ink toward the printer paper. This printer technol- 
ogy is now quite reliable following two decades of 
development and perfection. The device generates 
bubbles in the same way, but uses them instead as 
a quickly appearing and disappearing gas—fluid 
optical interface. 

At each switching point, two silica-based single- 
mode wave guides intersect at a fluid-filled trench 
such that the angle between each wave guide 


and the normal to the trench is greater than the 
angle at which total internal reflection begins for 
a gas-to-fluid interface. When a bubble is created 
at the intersection, light reflects off it; when the 
bubble disappears, light passes straight through. 
Switching time is of the order of 1 ms; the cross- 
talk was measured to be —70dB. The researchers 
have fabricated an array of such elements into a 
32 x 32 optical switch. The device is claimed to 
achieve up to a 20 year lifetime. 

From the standpoint of SLM technology, we 
have a 2D array of binary-modulator switches 
capable of routing an incoming (1D) array of 
beams into an output array with a switching speed 
of 1 ms. This is a particular example of a “cross-bar 
switch” implemented in a planar configuration. 
Similar functionality has been offered by using LC 
and micro-mirror (MEMS) technologies for opti- 
cal communication (see Section 7.5). 


7.4.4 Bio-chemical SLMs 


This novel class of SLM technology is based on the 
spatial optical modulation induced as a result of 
chemical or bio-chemical spatial variations, which 
are usually converted to spatial modulation of the 
electric field/potential. 

An earlier attempt to use bio-chemical agents 
to induce spatial optical modulation was made 
by using bacterio-rhodopsin molecules [211,212]. 
However, while having scientific curiosity, this 
type of modulator has not yet been practically 
implemented. 

Other more recent attempts have focused on the 
use of bio-chemical material used as bio- or chemi- 
cal sensors to affect the optical property of the 
underlying optical modulator either directly or by 
spatially modulating an electric field or potential. 
A fiber-optic DNA sensor for nucleic acid deter- 
mination was demonstrated using a fluorescent 
DNA stain [213]. Such a bio-chemo-optical sensor 
can be replicated in an array allowing the paral- 
lel detection of multiple forms of nucleic acids, and 
hence can be considered as a candidate technology 
for the “bio-chemical SLM”. Another such candi- 
date based on an evanescent wave in a 1024 fiber 
optic array for the detection of oligo-nucleotides 
was demonstrated in 1996 [214]. Demonstrations 
of actual operation of arrays of bio-chemical opti- 
cal modulators started to appear in the late 1990s. 
A fiber-optic-based micro-optode (i-., optical 
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micro-electrode) for the measurement of oxygen 
distribution was demonstrated in 1997 [215]. The 
researchers used a phase-modulation technique 
to determine the phase shift produced by the fluo- 
rescent stain, in response to a sinusoidal optical 
excitation. A biological sensing technique for the 
detection of specific bacteria was demonstrated 
using a micro-mechanical array of silicon nitride 
cantilever beams to which a specific immobiliz- 
ing antibody of a certain bacterium (E. coli) was 
attached. The detection mechanism is based on the 
shift of the resonant frequency of the cantilevers 
which is measured by the optical beam. A limiting 
detection of 16 E. coli cells was demonstrated [216]. 
The simultaneous detection of six bio-hazardous 
agents using a cocktail of fluorescent antibodies 
coupled with the analysis of fluorescence intensity 
was demonstrated [217]. 

A surface-plasmon-based resonance was used 
to construct a bio-sensor array based on multiple 
analytes. By performing parallel, multiple spec- 
troscopic analysis of the plasmon resonance for 
the various analytes, researchers demonstrated 
the bio-sensing ability to detect the affinity of the 
peptide sequence YGGFL to human B-endorphin, 
as well as the affinities between other types of bio- 
agent. The 1D array technology allows the simulta- 
neous evaluation of up to 160 samples [218]. 

An inexpensive technology for the production of 
a colorimetric resonant optical bio-sensor, based on 
the use of multiple micro-titer wells and an array 
of fiber optic transceivers, was demonstrated [219]. 
The demonstration using 96 micro-titer plates 
showed the detection of a protein-protein affinity 
with an antibody detection sensitivity of 8.3nM. 

Finally, the use of a LC-based optical shutter 
array coupled to an aligned array of bio-sensing 
elements constructed on a planar wave guide was 
demonstrated [220]. The LC shutter array selec- 
tively contrOIs the transmission of the fluores- 
cence generated by the biosensor array, activated 
via the optical wave guide. 


7.5 SLM APPLICATIONS 


7.5.1 Optical communication, signal 
processing and interconnects 


7.5.1.1 OVERVIEW 


The most dramatic recent technology develop- 
ment relevant to the SLM field, since the last 


comprehensive report on SLM technology was 
demonstrated in 1995 [4], has been the explosive 
interest and use of communications. The most rel- 
evant applications of the SLM technology for this 
field are twofold: 


1. OXC switch arrays. 

2. SLM-based adaptive optical aberration correc- 
tion systems for point-to-point optical com- 
munication (a part of the so-called “last mile 
communication” problem). 


A third application that is currently under develop- 
ment and which may be quite important for optical 
communication (OC) is that of adaptive ultrashort 
(femtosecond) pulse shaping. The latter is also an 
example for the use of optical signal processing 
(OSP) in optical communication. Indeed, since the 
communication field is by far the most dominant 
application for OSP and OI technologies, we will 
include a brief description of other OSP and OI 
applications as well within this section on optical 
communication. 


7.5.1.2 OPTICAL INTERCONNECT AND 
CROSS-CONNECT 


In a typical operation, the optical cross-connect 
(OXC) unit enables a programmable interconnection 
between an array of N input optical channels (pos- 
sibly implemented by N input fiber-optic channels) 
to an N-output channel array. It is desirable that the 
cross-connect switch allows spatial and wavelength- 
dependent routing operation. The role of an OXC 
unit for communication is to route each of the optical 
signals spatially to its designated end location, based 
on the signal (header) information and the particu- 
lar operating wavelength of the channel. An obvi- 
ous candidate of SLM technology to carry out this 
dynamic cross-connect function is LC technology. 
An earlier work used a PA LC-SLM for a 256-channel 
system [221]. In order to improve the performance 
of the slow-response NLC devices, an eight-channel 
device was demonstrated using FLC in 1995 [222]. 

A significant pioneering work in the area 
of LCDs for OI was done using both nematic 
[223,224] and FLCDs [225], as well as a demon- 
stration of polarization-insensitive switches [226], 
thereby overcoming one of the major weaknesses 
of the LC technology. 

A study on the use of sub-wavelength LC-based 
diffractive optical elements for OXC applications 
was recently published [227]. 
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The MEMS technology with its DMD repre- 
sentative detailed in Section 7.3 earlier is another 
potential candidate for the OXC technology 
[228,229], with numerous companies currently 
attempting to develop MEMS-based cross-connect 
switches. The relatively slow response coupled with 
the reflective-mode constraints is the main draw- 
back of this technology. 

The bubble-SLM technology mentioned in 
Section 7.4 earlier is being developed for crosscon- 
nect application. A 32 x 32 switch has been demon- 
strated with a 10 ms response time [209,230,231]. 

The electro-hologram technology mentioned in 
Section 7.4 earlier is under development for optical 
cross-connect applications. The main advantages 
of this technology over MEMS, LC, or the bubble 
technique is its relatively fast response (of the order 
of nanoseconds) as well as its unique wavelength 
selectivity. 

An 8 x 8 optical crossbar is under development 
[232]. The OXC will consist of 64 inputs that can 
be routed to any one of the 64 outputs, using a 
hybrid IH-V semi-conductor and Si CMOS flip- 
chip bonded technology. The optical switch will 
have 4096 detectors configured as a 64 x 64 array, 
each operating at >200 Mbit s!. Another effort has 
developed a 16 x 16 CMOS/GaAs OI operating at 
0.83 pm [233]. 

Finally, there have been numerous demonstra- 
tions of VCSEL-based smart-pixel OI systems 
[234-237]. A free space, 64 x 64 optical cross- 
connect switch based on FLC-SLM was recently 
demonstrated [238]. The cross-connect interface 
can serve as a programmable parallel array proces- 
sor for a variety of data processing applications. 


7.5.1.3 FEMTOSECOND PULSE SHAPING 


An important as well as an interesting application 
of the SLM technology for optical communica- 
tion is the shaping of ultra-short, femtosecond 
optical pulses first suggested by Weiner [239] 
using a fixed mask, and later refined to include 
an LC-programmable mask or an SLM [240]. The 
concept is to spatially disperse the optical pulse 
using an appropriate grating and then spatially 
phase-modulate the dispersed signal pattern. The 
re-collection of the spatially phase-modulated 
pulse using a second grating, leads to a temporal 
re-shaping of the optical pulse (in particular, pulse 
compression), which can then be used for optical 
communication. In more recent developments, 


temporal as well as spatial pulse shaping was 
accomplished using SLMs [241,242] with a recent 
demonstration of a 5 fs pulse compression [243]. 


7.5.2 Display applications 


This is obviously a very wide application field, 
with a huge billions of dollars market, encompass- 
ing numerous technologies. As we are interested 
only in reviewing the field from the SLM tech- 
nology standpoint, we will confine ourselves to 
nonemissive or passive display technologies, omit- 
ting technologies such as LEDs/OLEDs, electro- 
luminescent and plasma displays. The discussion 
will therefore be limited to innovations in the area 
of LC- and MEMS-based displays. 

One of the very recent applications of the PA 
LCLV discussed in Section 7.3 earlier is that of elec- 
tronic cinema [244]. Here, only high-brightness, 
high-resolution projection systems are required. A 
projection system based on combined LCLV tech- 
nologies can provide up to 3000 ANSI lumens at 
a resolution of 2048 x 1536 pixels. A DMD-based 
projection system is in close competition. 

Another important display technology for 
large-screen, video projection displays and for 
the electronic cinema application is the DMD, 
MEMS device discussed earlier [245]. The use of 
SLM panels (whether LCDs or digital mirrors) 
is implemented in a three-panel configuration, 
where either red, green and blue panels or some 
form of color-subtraction configuration is used. 
This allows high brightness to be accomplished 
as the full bandwidth of the (white) light source 
is utilized simultaneously for all three channels. 
The disadvantage is in the complexity of the color- 
separation system and the cost of three panels 
and their drivers. The other more compact and 
cost-effective method is to use a “color-sequential” 
scheme, where one broad-band SLM panel (either 
LCD or DMD) is used in conjunction with a color- 
sequencing scheme where the red, green and blue 
portions of the filtered broad-band source are used 
sequentially to read out the panel. This requires a 
minimum of 180Hz operation, which, although 
quite demanding, has been successfully demon- 
strated on LCD panels [246]. 

Finally, we should also mention the fast- 
growing field of head-mounted displays (HMDs) in 
which miniature LCD panels play an increasingly 
growing role [247-249]. LC-based, 3D stereoscopic 
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displays are a derivative of both LCD and HMD 
technologies [250,251] where the stereoscopic or, 
3D depth visualization can be accomplished, e.g., 
by ascribing orthogonal polarizations to each of 
the ocular views using appropriate LC shutters 
[250]. Auto-stereoscopic systems where the ste- 
reoscopic imagery is embedded in the displayed 
imagery, obviating the need for the shutter glasses, 
has been a major thrust recently with DMD-type 
SLMs serving as the image source [251]. All digital 
cinema projection is now done via micro-mirror 
SLMs from TI. 


7.5.3 Optical data processing 


As pointed out in the previous section, novel meth- 
ods and technologies related to the application field 
of OSP are covered within the former section on 
optical communication. This section covers inno- 
vations and updates related to the optical process- 
ing of spatial data information (the latter may either 
be optical or electronic) with an emphasis on image 
processing applications. One area, albeit not new, 
has been continuously attracting attention as one 
of the most efficient optical methods of image com- 
putation, namely the optical correlation technique. 
With the introduction of ultra-fast SLM technolo- 
gies such as the MQW-SLM, a natural progression 
has been made to utilize this fast SLM in demon- 
strating high-throughput image correlation. 

The group [148,149] claims a correlation 
throughput of up to 600000 correlations s“! with 
128 x 128 images. This translates into an equiva- 
lent computational throughput of: P..., = 2N? 
logNf ~ 115 GOPS, where N = 128; f= 600,000, and 
GOPS = giga binary operations s". 

While this obviously is a very respectable com- 
putational throughput, it is uncomfortably close (to 
within a factor of approximately x10) to throughputs 
attainable by current dedicated ULSI, system-on- 
chip (SOC) processors. This state of affairs represents 
the on-going dilemma of optical processing—the 
very fast moving electronic competition. 


7.5.4 Adaptive and programmable 
optics applications 


Here, we focus on the use of dynamic, adaptive 
optics and programmable optic techniques for 
noncommunication applications including astro- 
nomical and vision systems. 


7.5.4.1 ADAPTIVE OPTICAL CORRECTION 


The use of adaptive optics for atmospheric aberra- 
tion corrections has been intensively researched in 
the last four decades or so. The particular use of 
SLM technology as a key tool in adaptive, real-time 
correction systems was given in detail by Pepper 
et al. [4, Chapter 14]. 

Phase conjugation, whereby a phase-aberrated 
wave-front is “cleaned up”, is an attractive way 
of correcting phase aberrations of optical beams. 
There have been several attempts to use the PA 
LCLV technology, through its capability of real- 
time phase conjugation, to demonstrate such an 
action first by using nematic LCs [252,253] and 
later by using FLC modulators [254]. More recent 
use of SLM technology for real-time aberration 
correction appears in references [255,256]. 

A recent application of this technology has 
emerged in the area of visual system examination 
and correction. A retinal imaging system, which 
uses a deformable mirror-type SLM for the cor- 
rection of the eye-lens aberration was recently 
developed [257]. Here, a Hartmann-Shack-type 
wave-front sensor senses the ocular lens aberra- 
tions. These wave-front distortion data are subse- 
quently fed to a deformable mirror SLM, to correct 
adaptively the retinal image. This system allows a 
clear, nonaberrated imaging of the retina for per- 
forming retinal examination and subsequent med- 
ical treatment. 

The next stage in the use of this technology is 
an adaptive visual correction system, in which this 
real-time SLM-based, adaptive optics technique 
will be used to attain a “super-normal vision”, thus 
allowing a near-diffracted-limited vision to be 
achieved. 


7.5.4.2 PROGRAMMABLE OPTICAL 
ELEMENTS 


This application, which is related to the previous 
“adaptive optics” one, is based on the use of phase- 
modulating SLMs to perform programmable opti- 
cal element functions e.g., variable focal-length 
lenses or variable slope prisms used for beam- 
steering. Such devices often make use of the fact 
that a given optical phase profile required for a 
desired shaping of a monochromatic optical beam 
wave-front can be substituted by its “modulo-22” 
representation. Thus for example, a linear phase 
profile, (x), of a glass prism is described by 
(x) = tan(a)xny, where a is the apex angle of the 
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prism, ny is the glass refractive index, and x rep- 
resents a point along the length of the prism, will 
have the same effect on a monochromatic beam at 
Xp as a phase profile which “resets” to (x) = A, at 
x-locations in which (x) satisfies: b(x)/A, = 22n, 
where n = 1, 2, .... Note that this allows relatively 
thin optical elements with thickness of the order 
of A, namely a few micrometres, to be substituted 
for the traditional thick (mm-cm) glass elements. 
Also note, however, that such substitution is only 
valid for the particular wavelength i, for which 
o(x)/A, = 2nn. These optical elements are com- 
monly referred to as “binary optics” or “kino-form 
optical elements” [258,259]. 

The ability to structure wavelength-thin, optical 
elements opened the way for using phase modulat- 
ing SLMs thereby allowing programmable optical 
elements to be developed. 

Applications of such dynamic kino-form opti- 
cal elements include LC beam-steering devices 
[260,261], as well as variable focal-length lenses 
[262-264]. The main drawback of these dynamic 
programmable elements, as in the static kino-form 
elements, is their limitation in monochromatic 
beam shaping applications. 


7.5.5 Wavelength image converters 


An additional important application of SLMs is 
the conversion of an imagery from one wavelength 
to another. This can be done using either a single, 
photo-activated SLM whose photo-substrate and 
optical modulators operate at different wave- 
lengths. Alternatively, one can use an imager 
device to record the imagery at a certain wave- 
length and relay the electronic signal to an electri- 
cally addressed SLM operating at a desired output 
wavelength. 

A commonly used example of the latter is 
the regular TV monitor which relays the image 
picked up by a video camera and displays the 
color version video on the screen. Although in 
this case both the input and the output imagery 
are formally based on red-green-blue (RGB) vis- 
ible channels, we can easily manipulate the TV 
color range, and thus can effectively perform an 
image wavelength conversion operation with this 
system. 

PA LCLVs of the type discussed in Section 
7.3 earlier have been successfully used in con- 
verting visible imagery to IR video scenes for IR 


scene simulation applications [265]. Such devices 
were also used for visible- to near-IR image 
conversion [266]. 

The high spatial resolution and array sizes 
attainable in some SLM technologies, combined 
with the capability of UV modulation, naturally 
calls for the use of this technology to perform pro- 
grammable, real-time photo-lithography [267]. 
The use of an NLC-SLM with 600 x 800 pixel 
resolution, 27 mm X 20mm aperture, over 100:1 
contrast and 7/25 ms rise/decay response times at 
the argon ion wavelength of 351.1 nm, were used to 
form a 3D hologram in a photopolymer material. 
The hologram was subsequently read using the 633 
nm ofa He-Ne laser [268]. 


7.6 FUNDAMENTAL LIMITS AND 
FUTURE TRENDS 


7.6.1 Performance trade-offs and 
fundamental limits 


Some important interplays or trade-offs exist 
among the main SLM performance parameters. It 
is important to understand those limits in order 
to realistically design an SLM or make a sound 
projection of its expected performance. The main 
parameter trade-offs of SLMs are briefly described 
below. 


7.6.1.1 DYNAMIC RANGE/SPEED 


This is perhaps the most significant trade-off in the 
SLM technology. It is somewhat intuitive in that a 
large electro-optic coefficient, which is related to the 
dynamic range attainable by the optical modulator 
system, will be associated with a reduced speed of 
response, and vice versa. A good example for two 
modulator systems showing this trade-off would be, 
on the one hand, the class of nematic liquid crys- 
tal material featuring an enormously high (second- 
order) electro-optic effect with an effective half-wave 
voltage of around 2V, but with a typically low fre- 
quency response of around 100 Hz. (The half-wave 
voltage can roughly be taken as inversely propor- 
tional to the electro-optic coefficient.) This is con- 
trasted with solid-state electro-optic crystals such as 
the KD*P or LiNbO, capable of responding to bias 
frequencies well above 1 MHz, but with the penalty 
of a very large, 1000V half-wave voltage. To gain 
some insight into this trade-off—reminiscent of 
the well-known gain-bandwidth product (GBWP) 
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in electronic devices—it can be argued that under 
certain hypotheses this trade-off may in fact, obey a 
“universal” form of the GBWP behavior, for a vari- 
ety of optical modulator material systems, where 
the gain is actually represented by the generalized 
susceptibility relevant to the particular modula- 
tor material system (e.g., the electro-optic coeffi- 
cient, for solid state EO modulators), and where the 
bandwidth is the total frequency bandwidth of the 
modulator system, comprised of both the optical 
(spectral) bandwidth of the modulator and the elec- 
tric field, RF-modulating frequency (see Appendix). 


7.6.1.2 RESOLUTION-SPEED TRADE-OFF 


This constraint is originated in the dynamic range- 
speed trade-off, as discussed earlier. Thus, for very 
fast-responding materials (e.g., solid-state electro- 
optic crystals) the high biasing voltage required, 
due to their low EO coefficients, as a result of the 
above, dynamic range-speed trade-off, will neces- 
sitate the use of large pixel sizes to avoid field 
break-downs. 


7.6.1.3 DYNAMIC RANGE-RESOLUTION 


A common manifestation of this trade-off is the 
familiar “blooming” effect, which occurs in the 
“over-activated” regions of the SLM pixels. This 
is usually due to the finite charge-holding capac- 
ity of the driver array (both in PA and electrically 
addressed devices). The excessive pixel driver 
charge, resulting from over-driving the pixel in 
attempting to attain high output signal (or equiva- 
lently, high dynamic range), spills over to adja- 
cent pixels, resulting in the “blooming effect”. 
However, this trade-off can also be “utilized” in a 
constructive manner by using a cluster of pixels in 
a high-resolution, binary-mode device to produce 
effectively a grey-level modulation at the lower res- 
olution defined by the pixel cluster, in a similar way 
to half-tone image techniques. 


7.6.1.4 SENSITIVITY-SPEED 


This trade-off (for PA devices) is usually the 
consequence of the constancy of the energy flux 
required for the photoactivations (J cm), rather 
than that of the power flux (W cm”). Examples are 
PA SLMs, in which we must compensate for low 
input power flux levels, by reducing their frame 
rate or integration time. Another well-known 
example is the reciprocal relationship between 


the response time for the formation of a grating 
in a photo-refractive material, and the power level 
of the input signal. In general, the consequence 
is the necessity to use longer integration periods 
to accumulate sufficient signal charge and thus to 
lower the speed or the frame rate of the device. 
This is similar to the limitation of imagers under 
low-level illumination. 


7.6.2 Future trends in SLM 
technology 


The last decade has witnessed tremendous changes 
in three main optics-related areas, namely, com- 
munications, information processing and display 
technologies. The explosive growth and the pros- 
pects of even more dramatic future developments 
are certainly expected to impact the trend in the 
related SLM technology. In addition, the trend of 
developing artificial man-made optical materials, 
which started around two decades ago, has led to 
a very successful development of, in fact, the fast- 
est optical modulator in existence today, namely, 
the MQW modulator. Therefore, a continuation 
of all these trends in the next decade is predicted. 
The development of OXC-SLM technology, which 
has already seen the development and adapta- 
tion of MEMS-based devices, will continue to 
expand with a thrust towards the development 
of novel SLM technologies such as the Bubble 
Array or the electro-hologram devices. We have 
already witnessed the redirection of the historic 
optical data processing technology into the use 
of Ols. It is predicted that this will continue as 
a thrust for the development of SLMs in infor- 
mation processing systems. The development of 
relatively slow (~1 ms) phase-modulating devices 
for adaptive and programmable optical element 
applications will continue to grow. At the same 
time, it is expected that the effort in developing 
faster modulators will be expanded. In the boom- 
ing and attractive field of displays, active arrays 
such as organic LEDs (OLEDs) may well chal- 
lenge the traditional passive, SLM-type devices 
such as LCDs. Finally, with the strong interest 
in bio-chemical data processing and biological 
sensors—enhanced by the explosion in genetic 
research as well as the recent threats of weap- 
ons of mass destruction, one can see a signifi- 
cant thrust in the area of bio-chemical SLMs (see 


Appendix: On the dynamic range-speed trade-off 257 


Section 7.4.4) to allow a fast, parallel processing 
and analysis of bio-chemical data. 


APPENDIX: ON THE DYNAMIC 
RANGE-SPEED TRADE-OFF 


The following derivation represents an intuitive 
argument rather than a rigorous proof and should 
be treated as such. We chose to include it despite 
its approximate nature, as the consequences of this 
hypothesis may be far reaching in terms of the fun- 
damental limits of SLMs. 

First let us generalize the term dynamic range 
of the system to be the range of values which the 
appropriate optical response function of the par- 
ticular modulator system can take. It is the range 
of the electro-optic coefficient for an electro-optic 
modulator system or the range of the electro- 
absorption coefficient for the MQW modulator 
system. The speed of response is obviously directly 
related to the frequency bandwidth of the system. 
With this in mind, taking the electro-optic effect 
as an example, the change in the refractive index of 
the material An, upon the application of an electric 
field E, is given in general by [269] 

An ~ n° rE (7.1) 
where r is the appropriate electro-optic coefficient. 
This behavior can be presented more generally, as a 
response of a system parameter P(f), to an external 
driving field F(). 

It can be shown that the temporal response 
function y(#) (or the generalized susceptibility) of 
the system acts essentially as an impulse response 
function to the external driving field F(t), such 
that 


P= Jxe-DF Cade (7.2) 


where in the frequency domain the convolution 
operation of Equation 7.2 turns into a simple prod- 
uct, namely [270,271], 


(7.3) 


This is a generalization of Equation 7.1 above with 
P, x, F, replacing An, 1, E, respectively. 


Now, the DC value of y, namely, yp, = Sx 
dt = y(w = 0), must be finite, based on the material 
stability considerations. This is so since an infinite 
value would imply that a vanishingly small fluctua- 
tion in the external field 5F would result in a finite 
change of the system parameter P (e.g., the refrac- 
tive index) [272]. However, since the latter is deter- 
mined by the structure of the particular material 
system (e.g., an electro-optic crystal), a finite 
constant (DC) change in its value implies a finite 
structural change. Such a finite structural change, 
with a vanishingly small perturbation, indicates an 
inherent instability. Now, by the Parseval identity: 


uo 


and hence the finiteness of po = J uloae 


dt= | x(@) do (7.4) 


2 
implies a finite value for foece dt, and hence for 
Joao Assuming a finite bandwidth, Ao, for 


x(@), and making the approximation: 


1n= [xl@Ndo ~ x. A0 (75) 


where Y,y is an average value for y(@) within the 
finite bandwidth Aw, we conclude that the finite 
value of Iy indicated by Equation 7.4 implies that 
the susceptibility y,, must, in general, be traded 
off against the system bandwidth, Aa, or speed of 
response. Thus, we conclude that there must exist, 
indeed, a trade-off between the dynamic range of 
the generalized electro-optic response function 
and the speed of response of that system. 

Next, we need to consider the frequency range 
of the system, where there are basically two physi- 
cally different frequency regions. The first, ©; =Ogp 
is the RF field (e.g., the low-frequency externally 
applied electric field that controls the electro-optic 
effect). The second frequency region, ®)=@,,,, is the 
optical frequency range encompassing the spectral 
(wavelength) region in which this particular sus- 
ceptibility (say, the EO effect) is nonzero. Typically, 
we have 0, » Og; Let us consider in detail the 
particular EO example with its associated second- 
order susceptibility, (2), related to the EO (Pockel) 
effect. Using results from nonlinear optics, we have 
for the second-order polarization P®(#) [273]. 
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PO) =e9 | do, | dx” (—@;0,0)E, (0)E,(@)e*” 


(7.6) 


where we take: @ = Oo); = 5 © = Opp = 0 (low fre- 
quency); E,(0) = const (=amplitude of the external 
bias field); E(w) = ES” =const (=amplitude of 
the optical field). We now assume, as in the previ- 


ous argument, that [x?@rdo ~YXQA@ is finite 


over the finite (optical) bandwidth Aw and thus we 
have an upper limit to the polarization given by 


P(t) < Aw, A@ xX) E:(0)E3. (7.7) 


now the polarization, or its variation with the 
applied field, is directly related to the variation in 
the electronic or ionic displacement (x) via [274] 


P=Ne(x) (7.8) 


and so again due to material stability consider- 
ations P, or P® (£), must remain finite, given a finite 
(x), for any finite field levels below the breakdown 
limit of the material. Therefore, the conclusion is 
that the quantity: 

SBP = AM,AO:XR = AW: AOREX (7-9) 
must be finite for this case of an EO material 
system. 

So, we see that the GBWP constant takes on a 
new form for the “susceptibility-bandwidth prod- 
uct” (SBP), which involves two frequency ranges 
rather than one, due to the two operating fields: the 
RE and the optical field. 

We can conveniently express this new SBP, in 
units of frequency bandwidth (Hz) x optical band- 
width (cm!) x optical susceptibility, which we take 
as proportional to the linear electro-optic coef- 
ficient or An/AE (cm V-). So, for the EO case we 
have the SBP quantity in units of Hz V7. 

It can be argued that the above considerations 
can be extended to a generalized susceptibil- 
ity (e.g, AO, magneto-optic). This is so since the 
polarization modulation resulting from the appli- 
cation of the appropriate external field and with 
it, the associated ionic or electronic displacement, 


must remain below the structural damage level, 
regardless of the optical modulation mechanism 
(e.g., acoustic or magnetic). Now, since this range 
of acceptable ionic or electronic displacement in 
solids, although not constant, varies within one 
to two orders of magnitude, we expect the SBP for 
all solid state material systems to be also roughly 
within one to two orders of magnitude. 

Let us examine this hypothesis for three mate- 
rial systems: 

Liquid crystals: A@g, = 104Hz (FLC); 
A Dopp ~ 2.5 X 104cm7! (400-10,000 nm); An/ 
AE = 0.2/104 (cm V-') Here, we get for the SBP, -: 


SBPic = AW: AOREX Ry = AWop AWar [An/ AE] 


=5x10°(Hz V") 


MQW modulator (electro-refraction): 
Age ~ 10'°Hz Aw, = 30 cm! (850-852 nm); An/ 
AE = 0.01/10° (cm V1). Thus we get for the SBPyow: 


SBPmqw ~ AMopA@rr[An/AE]= 3 x10*(HzV") 


Solid-state electro-optic modulator (r = 30 pm 
V! = 3 x 10-9cm V!): A@gp = 107Hz (FLC); 
A Wo = 1.5 X 104 cm™ (400-1000 nm); An/ 
AE = nr. = 30 X 3 X 109cm V1 


SBP59 = AM AMre [| An/AE ] = 1.4 x10*(HzV") 


We thus see that the SBPs of all these three mark- 
edly different material systems, with orders of mag- 
nitude variations in their RF frequency responses, 
spectral bandwidths and optical susceptibilities, 
come to roughly the same level, within an order of 
magnitude. This finding supports the “universal- 
ity” hypothesis for this quantity. 

We also note that for systems with compa- 
rable spectral (optical) bandwidths such as liquid 
crystals and solid-state EO materials, the SBP 
constancy simplifies approximately to a trade-off 
between the EO susceptibility or An/AE and the 
RF bandwidth A@,, in analogy to the well-known 
GBWP. 

Finally, an interesting situation arises when we 
can invoke a trade-off between the optical sus- 
ceptibility and the frequency response within the 
same material system, e.g., by incorporating an 
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optical feedback in a PA nematic LCLV [275,276]. 
In this case, it can be shown that the product of 
the open-loop optical gain defined for the phase 
modulating LC-SLM by 


Ad out = G Ain (7.10) 
and the LC frequency bandwidth, 
A® = 1/T (7.11) 


the GBWP G,/t,, remains unchanged as the opti- 
cal feedback mechanism is turned on, namely 
when we have 


AD, = GcLA®, (7.12) 


where for the closed loop system with a negative 
feedback fraction, B, we have 


Go 
= 7.13 
TRG (7.13) 
and the associated LC frequency response is 
A® = I/Tc. (7.14) 


It can be shown that [275]: 
Gc. /Tecr = Go/To. 


This GBWP constancy allows a 50 Hz (open-loop) 
PA nematic LCLV to attain kilohertz frequency 
response using a negative feedback, closed loop 
system [276]. This is in close analogy to the con- 
stant GBWP behavior in electronic systems (e.g., 
operational amplifiers). 
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8.1 INTRODUCTION 


Over the past few years organic light emitting diode 
(OLED) technology has emerged as a major player 
in the established field of displays. Since the dem- 
onstration of efficient light emission from small 
organic molecules [1] and conjugated polymers [2], 
built on earlier studies of organic electrolumines- 
cence in polymers [3] and small molecules [4], 
their potential application to displays has been 
technically compelling. 

OLEDs have several advantages over other dis- 
play technologies. Compared to inorganic semicon- 
ductors, OLEDs offer large-area manufacturing on 


8.3.1 Substrate stack 284 
8.3.1.1 Substrate 284 
8.3.1.2 Backplane 284 
8.3.1.3 Anode 285 
8.3.1.4 Substrate stack 

preparation 285 
8.3.2 Organic layers 286 
8.3.2.1 Vapor phase deposition 286 
8.3.2.2 Solution processing 286 
8.3.3 Cathode and encapsulation 287 
8.3.3.1 Cathode 287 

8.3.3.2 Encapsulation and 
finishing 288 
8.4 Summary 288 
References 289 
Further reading 291 


cheap substrates with easy integration of different 
color devices. Compared to liquid crystal displays 
(LCDs), OLEDs avoid the need for separate back 
lighting and for color filters (although they are 
often still used, for example, to make RGB displays 
using white OLEDs), enabling thinner and lighter 
construction with higher power efficiency. OLEDs 
only emit light where needed, which improves 
both efficiency and contrast. OLEDs also naturally 
offer a good angular distribution of light, which is 
possible but complex for LCDs. However, an OLED 
is not a panacea. OLEDs are current controlled, 
which requires more complex backplane circuitry 
(than an LCD) and tracking able to deliver higher 
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currents to the center of the display. An OLED 
will never offer lower power consumption than an 
electrophoretic display and may not attain compa- 
rable lifetime to the discrete inorganic LEDs used 
in very high-brightness stadium-sized displays. 
Most critically, OLED technology is still relatively 
young and cannot yet compete on cost with the 
mature LCD industry because of the latter’s high 
yields and huge volumes. However, despite recent 
substantial improvements, LCDs still struggle to 
match OLEDs for performance. Certainly you can 
get LCDs with high contrast, or amazingly accu- 
rate color and gray scale, or high speed, or wide 
viewing angles, or high efficiency, but rarely all of 
these at the same time. 

Organic light emitting materials divide into 
two main structural categories (small molecules 
and polymers) and two light generation mecha- 
nisms (fluorescence and phosphorescence). These 
divisions are important for several reasons. 
The material structure determines how it can 
be applied to the substrate: solution process- 
ing for polymers, vacuum deposition for small 
molecules. This has significant implications for 
displays. The light generation mechanism deter- 
mines the ultimate quantum efficiency, fluores- 
cence having a quarter to a half the efficiency of 
phosphorescence [5]. 

This chapter aims to show how OLED devices 
can be applied as a display technology. There are 
many similarities between small molecule and 
polymer-based technologies, so the two are cov- 
ered together, with their differences only brought 
out as necessary. The basic device physics, the 
chemical and material properties of specific 
organic materials, are not covered in detail here. 

The first half of this chapter will discuss what 
distinguishes OLEDs from other display tech- 
nologies, working up from the key properties of 
an OLED pixel to the performance of different 
types of display. The second half then concen- 
trates on the construction of an OLED display 
panel, explaining the structure, fabrication meth- 
ods, and important properties of the constituent 
components. 


8.2 OLED DISPLAY TECHNOLOGY 


An OLED is a thin-film self-emissive device that 
converts an applied electrical current into light. 


By arraying a sufficient number of these devices 
and by providing a suitable means of controlling 
the current through such devices, a display device 
can be constructed. The performance of an OLED 
display can be entirely understood from the char- 
acteristics of the OLED devices and the method of 
driving. 


8.2.1 OLED devices 


OLEDs are thin, large-area LEDs. This statement 
points toward the most important properties that 
govern how an OLED display should be driven, ie., 
an LED requires current to emit light and is, most 
appropriately, current controlled. That it is a thin- 
film large-area structure means that the transport 
time through the device is fast. However, it also 
means that it has a significant capacitance and is 
prone to leakage. The total electro-optic response 
of the OLED device is split into three sections— 
electrical response, optical response, and aging 
effects. 


8.2.1.1 ELECTRICAL RESPONSE 


An OLED is a stack of various organic and inor- 
ganic layers. While the precise layers used vary a 
great deal from device to device, as far as the elec- 
trical properties are concerned the layers can be 
split into three groups, shown in Figure 8.la: hole 
conduction, light emission, and electron conduc- 
tion. Figure 8.1b shows the equivalent circuit for 
this structure. The emission layer(s) typically has 
much lower conductivity than the others and the 
device should only conduct strongly once sufficient 
forward bias is applied, so that both electrons and 
holes can reach the emission layer(s). Charge accu- 
mulation in this thin structure is represented in 
the equivalent circuit as a parallel capacitance. The 
fabrication of real, slightly less than ideal, devices 
also gives rise to other important properties, in 
particular, reverse bias current leakage that adds 
a parallel resistance to the equivalent circuit and 
conduction through all transport layers adding 
series resistance. 


8.2.1.1.1 Capacitance 


Unless the device is forward biased above the 
threshold voltage, no carriers are injected into 
the emission layer and so the OLED structure 
reduces to a thin insulator (<100 nm) sandwiched 
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Figure 8.1 (a) OLED device layers in functional groupings and (b) a simplified OLED equivalent circuit. 


between two large-area conductors, ie., the 
structure for a capacitor, typically with a typical 
capacitance of the order of 300 pF mm”. Under 
steady-state operation (active matrix driven), 
this property matters little; however, when pulse 
drive is required (see the later section on pas- 
sive matrices), the capacitance becomes very 
important. The capacitance could be reduced by 
increasing the thickness of the emission layer; 
however, this would also increase the drive volt- 
age of the diode. As the energy required to charge 
the capacitance varies as the voltage squared, a 
thicker device will tend to have an increased 
power consumption. 


8.2.1.1.2 Diode properties 


An OLED device has a well-defined threshold 
voltage, above which the current flowing through 
the device turns on rapidly. The threshold voltage 
depends on the charge injection at the contacts (i.e., 
the voltage required between the contacts and the 
organic layers for charge injection to occur) and 
on the bandgap of the emission layer. In general, 
the bandgap depends on the wavelength emitted 
and on the type of emission. Shorter wavelengths 
(blue rather than red) require a larger bandgap 
and hence a higher threshold voltage, and at any 
wavelength a phosphorescent emitter requires a 
larger bandgap than a fluorescent emitter. Typical 


turn-on voltages might range between 2 and 
4V. It should also be noted that, in general, the 
threshold voltage will reduce with temperature. 
Unlike typical inorganic LEDs, the (steady-state) 
current-voltage response of the OLED is not 
exponential—at least, not once there is a signifi- 
cant current flowing. 

Figure 8.2 shows a typical I-V curve for an 
OLED device. Under low fields, the device is 
largely injection limited and this does indeed give 
rise to an exponential response. However, once 
the voltage is taken a little above threshold, the 
I-V response follows an approximate power law, 
similar to an old-fashioned thermionic diode, due 
to the current flow becoming space charge limited 
li.e., the injection of charges into the emission layer 
is inhibited by the concentration of like charges in 
the low conduction emission layer(s)]. Unlike a 
thermionic diode, the power law is not necessarily 
quadratic but can be anything from a power of 2-4 
due to the carrier mobility being field dependent. 
The full I-V properties of an OLED device can 
be described by using the Murgatroyd equation 
[6]. However, over a specific region of interest, the 
response can usually be adequately approximated 
as J (V—V,)”, where J is the current density, V the 
applied voltage, V, the threshold voltage, and n an 
exponent depending largely on the field-dependent 
mobility. 
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Figure 8.2 A typical I-V curve for an OLED 
device (diamonds) with an exponential fit of 
current density (dashed line) for low fields and 
a power law fit (solid line, see text) for high 
fields. 


8.2.1.1.3 Reverse leakage 


An OLED is a very thin large-area structure. The 
thin structure and large area results in an inevita- 
ble leakage current even if the device is perfectly 
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Figure 8.3 I-V characteristics of a number of 
similar devices, six with good and two with poor 
reverse leakage characteristics. The voltage 
offset, at low currents, of the good devices is a 
measurement artifact. 


fabricated with uniform layers and no defects. 
Any nonuniformity will increase the local elec- 
tric field and therefore the leakage. This prob- 
lem is termed “reverse leakage” because it is only 
under reverse bias that leakage effects become 
noticeable. Figure 8.3 shows the I-V characteris- 
tics of a number of similar devices, two of which 
show poor reverse leakage properties. For an 
active matrix-driven device, this is not a problem 
as devices are never strongly reverse biased. For 
a passive matrix display, however, leakage has a 
large impact on display performance (see the sec- 
tion on passive matrix driving for more details), 
and so care must be taken with the selection of 
materials and fabrication process to minimize 
this effect. 


8.2.1.2 OPTICAL RESPONSE 
8.2.1.2.1 Light output: Device driving 


The current efficiency of a device is usually 
expressed in units of cd A™ and is typically 
reasonably stable against drive level, device-to- 
device variation, temperature, and only slowly 
degrades as the device ages. This results in a light 
output that is, over a normal operating range, 
linear with current. In contrast, the light output 
against voltage is highly nonlinear and varies a 
great deal from device to device, with tempera- 
ture, and as the device ages. Figure 8.4 shows the 
light output against voltage and current for three 
nominally identical devices, demonstrating the 
reproducibility (and linearity) of current drive. 

It is clear that in order to produce displays 
that are uniform, have a long life, low burn-in, 
and accurate gray-scale control, current drive is 
required. 


8.2.1.2.2 Response time 


OLED devices can switch extremely quickly. 
The specific response time varies with materials, 
device geometry, and applied field, but is typically 
of the order of 10 ns-1 ys, i.e., essentially instanta- 
neous as far as a display application is concerned. 
Any observed motion artifacts will, therefore, 
be a function of the drive scheme rather than a 
fundamental property of the OLED. This will be 
covered in more detail in the section on display 
performance. 
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Figure 8.4 Light output against voltage and current for three nominally identical devices, demonstrat- 
ing the reproducibility of current drive and its superiority over voltage drive. 


8.2.1.2.3 Light output coupling 


OLED devices can be designed to emit light 
through either the anode (known as bottom emis- 
sion as the anode is typically formed on the sub- 
strate), the cathode (top emission), or even both for 
transparent displays. This choice of device type has 
a strong impact on the optical output, both color 
and efficiency, but in either case the out-coupling 
problem for an OLED is the difficulty of extract- 
ing light from a high-index emissive layer out into 
air, whether this is through the substrate or out 
through the cathode. 

Organic semiconductors, whether small-mole- 
cule or polymer, are media of high refractive index. 
When light propagating through such a medium 
meets its interface with a low-index medium such 
as air, if its angle of incidence is too shallow then 
it will undergo total internal reflection and not 
escape. The fraction of light thus trapped will 
depend on the angular emission profile from the 
emitting layer(s). Unless steps are taken to reduce 
this effect, the fraction of emitted light that leaves 
the device is on the order of 17% for small-mole- 
cule OLEDs and 30% for polymer OLEDs [7]. 

In principle, there are two approaches to 
increasing the optical out-coupling: coupling 
light out of the trapped optical modes or increas- 
ing the fraction of light emitted into the out-cou- 
pled modes. To extract light from trapped modes 
typically some kind of scattering structure is 
required. For display applications, the efficiency 
benefits are often outweighed by the side effects 
of increasing the spatial cross talk (guided light 
from one pixel out-coupled at the next) and 


ambient reflection, so that both the dynamic 
and static contrast are reduced (see Section 8.2.3 
on display performance), and so this approach is 
only really considered for lighting applications 
(see, e.g., Reference [8]) and not used in displays. 

Increasing the proportion of light emitted into 
out-coupled modes is now part of the standard 
device design optimization and involves the care- 
ful tuning of the device thickness and emission 
zone with further enhancements possible through 
the addition of partially reflective layers [9]. In this 
way, efficiency and color can be tuned to better 
match the display requirements (e.g., producing 
a deeper blue). Side effects can be some color and 
luminance variation with viewing angle, although 
usually much less than often observed with LCD 
displays. 


8.2.1.3 OLED AGING AND BURN-IN 


As a device is driven, there is a gradual loss of per- 
formance due to degradation of the functional ele- 
ments of the device. This primarily manifests as an 
increase in the voltage required and a reduction in 
the light output at given current density. 

Usually the only quoted lifetime figure is a lumi- 
nous intensity half-life for a given current drive or 
initial brightness; however, voltage aging can be 
as important for some applications. In particular, 
even if a device has a huge half-life, the voltage 
increase can reduce the system lifetime if there is 
insufficient headroom on the supply voltage. 

In all cases, it is the system life, determined by 
the driving conditions and the minimum acceptable 
display performance, which should be calculated to 
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Table 8.1 Required subpixel luminous intensities to achieve a Rec.709 standard white 


Color set A Color set B Color set C 
CIE x, y Y (cd/m?) CIE x, y Y (cd/m2) CIE x, y Y (cd/m?) 
Red 0.64, 0.33 96.1 0.64, 0.33 85.1 0.64, 0.33 121.2 
Green 0.30, 0.60 220.3 0.30, 0.60 286.1 0.20, 0.70 249.2 
Blue 0.15, 0.15 83.6 0.15, 0.06 28.9 0.15, 0.06 29.5 


Assumptions include a 100 cd/m? white average luminance, a CIE white point of 0.3127, 0.3290, and a 25% aperture 


ratio per subpixel. 


compare potential material sets. This is particularly 
the case for RGB display systems (which form an 
image using separate red, green and blue emitting 
sub-pixels) where the luminance half-life is often 
not the most relevant figure, as it is differential 
aging (usually referred to as burn-in), both between 
the colors and between different areas of the screen, 
which is usually most apparent to the user, and can 
become apparent at very modest decreases in device 
efficiency, for example, a degradation of only 3%-5% 
[10]. Furthermore, in comparing different options of 
available materials, it is important to calculate the 
system lifetime of each complete RGB material set 
rather than taking one component in isolation. A 
common example of this is the choice of blue mate- 
rial. It is typically the case that deep blue emitters 
will exhibit a shorter lifetime, at a given luminous 
intensity, than lighter blue alternatives. However, 
the proportion of red, green, and blue subpixel light 
output required to produce a target white point 
depends on the color points of each of the subpixels 
and on the desired white point. 

Table 8.1 shows three examples of RGB color 
sets and the luminous intensities these would 
require. Color set A includes a light blue and set 
B includes a deeper standard blue. Although the 
deeper blue may have a shorter nominal lifetime 
than the lighter blue, the deeper blue will not need 
to be driven as hard and, therefore, will have a lon- 
ger lifetime than the headline figure might suggest. 
The color coordinates of the red and green emitters 
will also affect the result (see, for example, color set 
C, which has a deeper green). The impact of a given 
OLED lifetime on the performance of a display 
can only be properly assessed as part of a complete 


RGB color set. 

A discussion of the potential causes of aging 
effects is beyond the scope of this review, and a state- 
ment of “current” specifications, considering the pace 
of development, is liable to be seriously out of date 


by the time of reading. However, the reader needs 
to be aware that aging of a device depends strongly 
on drive level, duty cycle, and environmental condi- 
tions (particularly temperature). Furthermore, the 
lifetime under one set of conditions cannot always 
be extrapolated reliably to another. It is, therefore, 
important to obtain the aging characteristics for the 
conditions specific to an application. 


8.2.2 Display driving 
8.2.2.1 PASSIVE MATRIX DISPLAYS 


A passive matrix is the simplest possible matrix 
display and consists of arrays of rows and columns 
forming pixels where they intersect. One row is 
selected and all the pixels on that row are driven 
from the columns. The other rows are selected in 
turn until the entire frame has been scanned. From 
their inception OLEDs were, due to their diode 
nature, touted as ideal for the fabrication of passive 
matrix displays, with visions of huge screens for 
very low cost. Of course, life is never that simple. 
As we shall see, the size of a passive matrix display 
is limited by the peak current required for pulsed 
light emission, by current leakage and, in particu- 
lar, by the effects of device capacitance. 


8.2.2.1.1 Passive matrix operation 


Figure 8.5 shows (1) a functional diagram of an 
OLED passive matrix and (2) the current, voltage, 
and light output of one pixel during a line scan. 
The line scan sequence (assuming current drive) is 
as follows, with the step numbers corresponding to 
the labels in Figure 8.5b: 


1. The row is selected (usually switched to ground 
while other rows are held high) and the cur- 
rent, I, is driven into the column. 

2. The voltage across the device will increase lin- 
early until the threshold voltage is reached. The 
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(a) Column driver 


(b) 


Row driver 


Figure 8.5 (a) Functional diagram of an OLED passive matrix and (b) current, voltage, and light output 


versus time of one pixel during a line scan. 


time this will take is t.=CV,/I, where C is the 
capacitance of the column and V, the threshold 
voltage. 

3. The pixel is now emitting light and the voltage 
gradually tends toward the nominal drive level. 
The time constant for this is harder to derive as 
it depends on the specific I-V relationship of 
the device. 

4, The current supply is turned off and the voltage 
across the device rapidly drops to the threshold 
voltage, and then very gradually drops back to 
zero, assuming no other pixels are driven. 


Electrically, to complete the picture, the resistive 
losses from the pixel’s row and column, plus the 
driver compliance, need to be added. Taking all 
of these into account, a simple model can be con- 
structed, which, despite its simplicity, can be a pow- 
erful predictor of the expected display performance. 

Figure 8.6 shows results from an OLED passive 
matrix model, predicting power consumption per 
pixel versus number of rows. This is an iterative 
model based on the electrical response of an OLED 
pixel as described in Section 8.2.1.1. These results 
show that the potential dominance of capacitive 
power consumption in larger displays. With more 
rows comes more column capacitance and more 


charging cycles leading to an approximately qua- 
dratic increase in capacitive losses with row count. 
Furthermore, there is the knock-on effect of an 
increased current demand causing more resistive 
losses and driver compliance losses. It is these capac- 
itive losses that often limit the practical size, resolu- 
tion, and/or brightness of a passive OLED display. 


8.2.2.1.2 Leakage and defects 


All OLED devices in a passive matrix are con- 
nected to one another via the common anode and 
cathode lines. Therefore, a single pixel’s defect or 
current leakage can have an impact on the perfor- 
mance of the whole display. The effects on display 
operation of current leakage (localized or over the 
display) and other display defects depend on the 
details of the specific drive scheme. 

The simplest possible driver with no gray scale, 
an off-state anode clamped to 0V and off-rows 
open circuit, is relatively unaffected by display 
defects. Pixel shorts on their own will, largely, only 
prevent the operation of that specific pixel and 
reverse leakage will, if not excessive, cause a slight 
loss of brightness of the pixel being driven (assum- 
ing current drive is used). With more significant 
leakage, spatial cross talk, apparent as streaking 
along the row and column from the active pixel, 


276 Organic electroluminescent displays 


100% 
90% 
80% 
70% 
60% 
50% 
40% 
30% 
20% 
10% 

0% 


Contribution to total 
power consumption 


32 48 


(7) Diode 
(CN Driver 


=) Cap 


we Total 


Power consumption 
per pixel (uW) 


64 96 
Number of rows (multiplex ratio) 


(TI Resistive 


Figure 8.6 Power consumption per pixel versus number of rows in a passive matrix display. Results are 


calculated from an analytical model. 


can become increasingly visible, although the 
acceptable threshold for this depends upon the 
application and displayed images. The opposite 
extreme might be a driver that uses methods to 
actively drive the deselected rows and columns off 
to minimize the effects of capacitance and improve 
contrast and gray scale but where a shorted pixel 
would cause all other pixels on a common row or 
column to cease functioning. 

The selection of a driver for a display needs to 
be informed not only by the target application but 
also by the level and type of defects and leakage 
likely to be present in the display and the sensitiv- 
ity of the specific drive scheme to such defects. A 
high-fidelity gray scale and contrast capable driver 
can only work well with a zero defect and low leak- 
age display—if the display in question tends to suf- 
fer from defects of any form then, in general, the 
simpler the driver the better. 


8.2.2.1.3 Intensity-scale methods 


There are two ways to modulate the quantity of light 
emitted by a pixel during a line scan—by changing 
the height of the current pulse (analog drive) or its 
duration (pulse width modulation [PWM)). 

Driving an OLED ata lower current for a longer 
time (analog drive) might seem, on first inspection, 
the better option. A lower average drive voltage 
lowers both the direct OLED power consumption 
and the (often considerable) capacitive losses. A 
lower drive level also reduces aging. Unfortunately, 
analog gray scale can also result in poor image 
quality as described below. 


PWM always has a full charge-discharge cycle 
per frame, whereas, with analog drive, the charg- 
ing and discharging are always only the difference 
in drive level between pixels, resulting in another 
power advantage for analog. However, this also 
results in the initial state of the voltage on a pixel 
being dependent (for analog drive) on the state of 
the previous pixel causing potential cross talk in 
the form of vertical edge blurring. 

The most serious difficulty for analog drive, 
though, is dynamic range. Human perception of 
brightness is highly nonlinear and this is accom- 
modated for in displays by making the pixel light 
output proportional to the gray level raised to 
some power (referred to as a gamma), typically 2. 
A display with a modest 32 levels of brightness and 
a gamma of 2 requires a 1024:1 ratio in actual light 
output between its brightest and darkest levels. On 
top of this, there is also often a global brightness 
setting that might require a further 100:1 dynamic 
range. For an analog-driven display, this becomes 
a ~100,000:1 ratio in drive currents, which is 
extremely challenging. For a PWM-driven dis- 
play, however, it is relatively simple to obtain the 
1024:1 dynamic range through the PWM pulse 
timing with a cruder setting of reference current 
for the global brightness setting. Almost all passive 
matrix OLED drivers use PWM control. 


8.2.2.1.4 Capacitance charging 

Pixel capacitance can cause three effects: a power 
loss, an offset in the light output to a given drive 
level and cross talk through residual capacitance. 
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The power loss has been dealt with previously, the 
offset in light output is due to the time taken to 
charge the pixel up to the operating voltage, and the 
cross talk is due to the residual charge left at the end 
of a line scan (and present at the start of the next). 

The simplest way to minimize the power loss is 
simply to do nothing, as corrective actions for the 
other two effects tend to increase power consump- 
tion. However, for some displays the cross talk, in 
particular, can cause problems, and in this case 
some form of precharge and discharge is often used. 

Pixel precharging is typically implemented by 
applying a voltage to the pixel column, prior to 
the current drive signal, which charges the pixel 
to a set voltage. This injects current more rapidly 
than the current drive would be able to and thus 
keeps the charge-up “dead” time to a minimum. 
Pixel discharging ensures that the voltage is rap- 
idly reduced at the end of the PWM pulse. 

Using precharge and discharge can substan- 
tially improve the visual performance of a display. 
Unfortunately, it can also substantially increase 
power consumption. It might be possible to use 
charge recycling methods, such as are used with 
LCD displays, to reduce the power consumption 
of an OLED passive matrix. The limiting factor in 
this case is the anode [typically indium tin oxide 
(ITO)] resistance. If Q is the charge, R is the anode 
resistance, and t is the time that is available to 
recover the charge, then resistive energy loss dur- 
ing recovery is Q?R/t. This means that, although 
such recycling could help reduce power consump- 
tion, in large high-resolution displays, where 
capacitive losses are greatest, there is also the high- 
est anode series resistance and least time in which 
to recover charge. The benefit of charge recycling 
would be limited unless thicker ITO or anode bus 
bars were used to reduce the series resistance. 


8.2.2.1.5 Multiplex control 


There are a number of controller enhancements 
that can be achieved through multiplex control 
and all take advantage of the graph in Figure 8.6— 
the lower the multiplex rate (the number of rows 
over which the column drive is shared), the lower 
the power consumption. 

The simplest method of making a display more 
efficient involves splitting the columns across the 
center of the display and driving them from top 
and bottom, in essence splitting the display into 
two passive matrices. This comes at the cost of an 


increased component count (double the column 
drivers, although they need handle less than half 
the power) and possible lower yield (double the col- 
umn connections to make). Nevertheless, this is a 
valuable method in achieving larger displays, dou- 
bling the row count being practically achievable. 

Selective scanning modes, reducing the multi- 
plex rate by driving only a subset of the rows, can 
also prove to have substantial benefits. This could 
be a fixed display mode, for example, only driving 
the central quarter of a display, or dynamically 
skipping blank rows. 

Multiline addressing schemes have in the past 
provided large improvements to the performance 
of passive matrix LCD screens and some attempts 
[11,12] have been made to achieve the same in 
OLED. Unfortunately, both the algorithms and 
drive schemes required to implement multiline 
addressing are complex, and they were largely 
overtaken by the adoption of active matrix back- 
planes for OLED displays. 


8.2.2.2 ACTIVE MATRIX DISPLAYS 


From the above, it is clear that OLED passive 
matrices, while capable of providing high-quality 
image reproduction for small display formats, are 
unsuitable for expansion to larger display formats. 
The solution to this is to form an active matrix of 
simple per-pixel thin-film transistor (TFT) drive 
circuits patterned onto the display glass, build- 
ing on the active matrix technology developed for 
LCDs. However, as with passive matrices, there are 
particular factors that need to be considered when 
applying active matrix technologies to OLEDs, 
the primary ones being TFT carrier mobility, TFT 
threshold voltage nonuniformities, and OLED 
nonuniformities. The first of these restricts our 
choice of TFT technology, and the other two lead 
us to prefer current-programmed drive circuits 
over voltage-programmed despite their greater 
complexity. 

Table 8.2 shows typical carrier mobility for the 
various TFT technologies available, along with 
crystalline silicon for reference. Figure 8.7 shows 
the driver channel width requirements of a transis- 
tor to control a current through an OLED pixel to 
achieve a 300 cd m” display luminance assuming 
typical device dimensions and OLED efficiencies. 
The >100pm driver channel width requirement 
of amorphous silicon (a-Si) is significant when 
compared to the 300pm pixel pitch. This does 


278 Organic electroluminescent displays 


Table 8.2 Mobilities of available transistor technologies 


TFT technology 

a-Si (amorphous Silicon) 

Oxide TFT (e.g., IGZO) 

LTPS (low-temperature polysilicon) 
CGS (continuous grain silicon) 
Bulk silicon 


Majority carrier mobility (cm? V-' s~*) 
<1 
10-40 
30-150 
300-500 
~1000 


10000 


1000 


100 


10 


TFT channel width (um) 


Crystalline Si 


“O01 1 10 100 1000 


Carrier mobility (cm? V7! 57) 
Figure 8.7 Channel widths of transistors capable 
of driving an OLED pixel at 300 cd m~, assum- 
ing an efficiency of 10 cd A“, pixel pitch of 
300m, and typical values for TFT electrical 
characteristics. 


not rule a-Si out as a potential TFT technology; 
however, it certainly makes the case borderline. 
Low-temperature polysilicon (LTPS) and its vari- 
ants [e.g., continuous grain silicon (CGS)] are the 
favored current production technology but the 
processing required is complex and expensive. 


(a) 
Reflecting cathode 


Qs... Qs 


Possibly the most promising TFT technologies are 
the relatively new, transparent oxides, for example, 
indium-gallium-zinc oxide (IGZO) [13]. These 
seem to offer the simplicity of a-Si with a perfor- 
mance approaching that of LTPS. 

TFT threshold voltage variation is an issue 
with most TFT types. Polycrystalline devices (e.g., 
LTPS) often have grain boundary variations from 
device to device, whereas amorphous devices (e.g., 
a-Si) typically show threshold voltage shifts with 
age. Such variations can, if not compensated for, 
cause significant brightness nonuniformities in a 
display. Of course, variations in the OLED pixel 
response can also cause nonuniformities that, 
although not dependent upon the TFT technology 
directly, is dependent upon the design and opera- 
tion of the pixel circuit, which in turn is limited by 
the choice of TFT technology. 

There are a number of circuit types (and a great 
many variations thereof) that can be used to drive 
an OLED pixel, and these are covered in the fol- 
lowing sections. However, there is one other design 
factor important to the operation of an active 
matrix display and this is the choice of top or bot- 
tom emitting structure. 
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Figure 8.8 Cross sections of bottom emitting and top emitting devices. 
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The standard OLED structure uses a transpar- 
ent anode (e.g., ITO) and reflective metallic cathode 
(see Figure 8.8a). This means that the device must 
emit light through the substrate and therefore, as 
with LCDs, the pixel area must be shared between 
the pixel circuitry and emitting area. However, 
there is another option that uses a metallic anode 
and thin transparent cathode, typically referred 
to as a top emitter (see Figure 8.8b). The resulting 
increase in aperture ratio (the ratio of the emitting 
area to the total pixel area) gives a boost to display 
lifetime as a lower current density in the OLED 
device suffices for a given brightness and can also 
increase efficiency primarily by allowing larger 
TFT areas (less TFT voltage drop). These advan- 
tages are offset by significant disadvantages: the 
encapsulation must both be transparent and pos- 
sess better barrier properties than in the standard 
case as the thin, highly reactive, electron injector is 
no longer protected by a thick metal layer. 


8.2.2.2.1 Voltage programming 


Figure 8.9a shows a schematic of a typical active 
matrix OLED (AMOLED) display panel with 
Figure 8.9b showing the simplest OLED pixel 


Row select driver 


drive circuit. With an LCD all that is required is 
a method to fix a voltage over a capacitor, so the 
only transistor required is a select device acting as 
a switch to connect the cell to the data line when 
the row on which the pixel resides is addressed. In 
the case of an OLED, a drive transistor is needed 
through which current can be controlled, thus the 
simplest circuit uses a select TFT to set a gate volt- 
age on a drive transistor, and the gate voltage is 
set to produce the required luminous output [14]. 
However, the current controlled through the drive 
transistor is dependent not only on the gate voltage 
but also on all the TFT characteristics including, in 
particular, the threshold voltage. 

The alternative way of driving the two-TFT 
cell is to turn on the transistor hard and use dis- 
play subframes to achieve gray scale. This method 
does work well; however, it is essentially a voltage- 
driven method and so suffers from sensitivity to 
OLED variations and accelerated aging (compared 
to current drive), as discussed in Section 8.2.1.2. 

Attempts have been made to compensate for 
variation in TFT threshold voltage; one such [15] 
can be seen in Figure 8.9c. During the line scan, the 
select transistor is first activated and 0 V applied to 
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Figure 8.9 Voltage-programmed active matrix pixel circuits: (a) schematic of an active matrix, 
(b) simplest OLED pixel circuit, and (c) threshold voltage-compensated circuit. 
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the data line. TFT 3 is closed, which discharges the 
charge on the gate of the drive TFT, bringing its 
voltage down to threshold so that when TFT 3 is 
subsequently opened the threshold voltage is held 
at the drive TFT gate. The signal applied on the 
data line will now produce a voltage at the drive 
TFT gate offset by the threshold voltage. The select 
transistor is deactivated holding the drive level and 
TFT 4 is closed allowing current to flow through 
the OLED. Although it compensates for threshold 
voltage changes, this circuit still depends on the 
uniformity of the other parameters of the drive 
TFT, as well as requiring two TFTs that can handle 
the full drive current and two extra control lines. 
Both of the above circuits also suffer, when ana- 
log driven, from the highly nonlinear relationship 
between gate voltage and drive current. 


8.2.2.2.2 Current programming 


All current drive schemes have the advantages of 
linearity and relative insensitivity to transistor and 
OLED variations. Two general methods are pos- 
sible in principle—the setting of OLED current 
directly or indirectly through a current mirror. 
Figure 8.10a shows a typical example of the 
first of these circuits of which there are many 
variations [16,17]. At the start of the addressing 
period, the deselect TFT is opened and the select 
TFTs are closed, diverting the output from the 
drive TFT to the data line. The display control- 
ler sinks a current through the data line and any 
mismatch between this current and that supplied 
by the drive TFT will modify the charge on the 
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capacitor until the currents are balanced. At the 
end of the addressing period, the select TFTs are 
opened and the deselect TFT is closed, holding 
the charge on the capacitor and redirecting the 
drive current to the OLED. This pixel circuit has 
excellent uniformity and extremely good linear- 
ity resulting in a high fidelity of gray scale. Its pri- 
mary disadvantage is that it requires three TFTs 
that must be capable of handling the full drive 
current as well as needing two complementary 
control lines. 

Figure 8.10b shows the current mirror circuit 
that was used in an early 13-in. OLED demonstra- 
tor [18]. The operation of this circuit relies upon 
an amplifying current mirror. The relationship 
between the geometries of the drive and mir- 
ror TFTs results in a scaling factor between their 
drain-source currents when a common voltage is 
applied to the gate. This relationship can only work 
when other TFT parameters are sufficiently simi- 
lar between the two devices. However, this is the 
case when the TFTs have similar layouts and are 
in close proximity. During the addressing period, 
the erase TFT is initially closed to clear the charge 
on the capacitor. The scan TFT is then closed, a 
current is sunk through the data line and the volt- 
age on the common gate capacitor adjusts so that 
the mirror TFT supplies this current. The erase 
and scan TFTs are then opened and the voltage is 
held on the gate, producing a scaled current in the 
drive TFT. This circuit possesses the advantages of 
the previous circuit without the problems of two 
control lines or three large TFTs. Use of either of 


(b) 


Scan 
Vs 
Scan : 
TET Mirror Drive 


TFT 


Erase 
TET 


Erase 


Data 


Figure 8.10 Current-programmed pixel circuits: (a) current sampling circuit and (b) amplifying current 


mirror. 
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these pixel cells can produce displays with excel- 
lent uniformity and a high fidelity of gray-scale 
reproduction. 

The circuits shown and described above are 
examples of the most common approaches taken 
when designing OLED drive circuits. Further 
subtleties in the design of the circuitry and drive 
scheme, not discussed here, can relate to the par- 
ticular choice of TFT technology, OLED devices or 
the specific requirements of the display design, for 
example, to reduce the effects of burn-in and pixel 
aging [19]. 


8.2.3 Display performance 


This section is inevitably much briefer than it could 
be as this is a very big subject. The key display per- 
formance metrics will be discussed in the context of 
AMOLED displays, as passive matrix (PMOLED) 
displays are typically now only used in applica- 
tions where high performance is not an issue. For 
more information, see the further reading section 
at the end, in particular, the Displaymate website, 
which has some excellent analyzes of the perfor- 
mance of different display technologies. 


8.2.3.1 STATIC IMAGE PERFORMANCE 


There are six properties that determine how well a 
static image is reproduced on a display: 

Luminance is the maximum brightness, which 
a display can produce. With an emissive display 
(including OLED) there will always be a trade- 
off with lifetime, and with any display a trade-off 
with power consumption. As a headline figure, it 
can also often be misleading—a display with twice 
the luminance does not appear twice as bright. A 
useful approximation is that perceived brightness 
varies as the cube root of luminance, so twice the 
luminance means only 25% more perceived bright- 
ness. LCDs can often have a much higher lumi- 
nance than an equivalent OLED panel, although 
these days this is often just a design decision (see 
black level) rather than a limitation. 

Black level determines the contrast of the dis- 
play. What makes OLEDs stand out is that, under 
dark conditions, a black pixel has a near zero 
luminance, even if nearby pixels are on full white. 
Contrast is the white-to-black luminance ratio 
and has a very strong impact on perceived image 
quality. A high contrast makes the image appear 
brighter and more colorful. A really high contrast, 


that is, a truly black background on a display, 
removes the gray rectangle we are used to watch- 
ing images within and makes some scenes look 
much more realistic. The contrast often quoted 
with displays is the “dynamic contrast,” which is 
the ratio of an all-white screen luminance to an 
all-black screen, which can be >100,000:1 for an 
LED backlit LCD display. This has little relevance 
to actual performance. Static contrast, that is, the 
white-to-black ratio of patches shown simultane- 
ously, is what really matters. For a good LCD, this 
is often around 1000:1 and for OLED displays, this 
is usually quoted as around 1,000,000:1 [20]. 

The color gamut, that is the range of color, which 
can be reproduced, is determined by the display’s 
RGB color primaries. There is little need to repro- 
duce colors beyond what can be described in the 
image or video format; however, there is also little 
harm so long as the color reproduction is managed 
properly. OLEDs have gained somewhat of a repu- 
tation for artificially saturated color reproduction, 
but in reality this is a function of the image pro- 
cessing rather than the OLED pixels. Both LCD 
and OLED displays can be made with virtually 
perfect color primaries and most good screens have 
accurate color modes, although it is also true that 
some users, and some markets, seem to prefer dis- 
plays with unnatural supersaturated colors (much 
to the despair of displays experts). It also may be the 
case that, historically, products with OLED displays 
were set up to stand out from competing LCD- 
based devices. Both superaccurate and supersatu- 
rated colors are possible with both OLED and LCD. 

Gray-scale reproduction determines how well 
all of the colors between black, white, and the RGB 
primaries are reproduced. Displays can have a per- 
fect color gamut but still fail to achieve good image 
reproduction because they have poor gray-scale 
control. LCD devices are often highly nonlinear, 
requiring careful image processing and signal con- 
trol to achieve good gray-scale control. OLEDs, in 
contrast, are fundamentally linear devices, and so 
very accurate gray-scale control should always be 
achievable with a suitable drive scheme. 

Resolution is the one area where LCD has an 
advantage. An active matrix LCD requires just one 
transistor per pixel, whereas an AMOLED needs 
at least two, and often four, per pixel. A 1080p 
(1920x1080 pixel) display would need 6.2 million 
transistors if it were an LCD, but possibly 25 million 
if it were an OLED. This is not a fundamental issue, 
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but does make the fabrication of OLED backplanes 
much more challenging, and means that some high- 
resolution OLED screens must have a top emitting 
structure so that the transistors sit under the OLED 
allowing both to use the full pixel area. 

Viewing angle has always been quoted as an 
OLED advantage, and indeed OLED displays can 
be viewed from any angle with minimal color or 
luminance changes. Modern LCD screens can, 
however, have wide viewing angles if the appli- 
cation requires it (although this may be at the 
expense of other attributes) and indeed only emit- 
ting light in a restricted viewing range can offer a 
power consumption saving. 


8.2.3.2 DYNAMIC PERFORMANCE AND 3D 


OLED pixels can switch extremely fast. 
Phosphorescent pixels do switch more slowly 
than fluorescent pixels, but still in microseconds. 
Therefore, any motion artifacts are most likely to 
be due to the driving scheme, and in particular the 
screen refresh rate, than anything fundamental to 
an OLED. 


Frame 
i} 
——, 


Screen position Eye-tracked position 


Blur 


|e 


os 
ach: 


8 | \ 
7 | \ 
«6 | \ 
2 5 | \ 
£4 wal S 
: r N 
a NN 


Screen position Eye-tracked position 


Active matrix displays are hold-type displays— 
they hold an image static on the screen for a frame 
before changing it on the next frame (typically on 
a rolling basis taking up to a frame time to update 
the image). If an eye tracks an object moving across 
a screen, the image of the object will be held, for 
the frame time, at a sequence of positions; this will 
cause a blurring or a judder of the moving image, 
as is shown in Figure 8.1la. It would be possible 
to hold the image for a shorter time to reduce this 
effect (Figure 8.11b), but that would either reduce 
the display luminance or, if the luminance were 
increased to compensate, reduce the display life. 
The better option is to use a higher refresh rate and 
indeed many modern TVs have refresh rates of 120 
or 240 Hz (Figure 8.11c). For this to work, the data 
source needs to have high frame rate data or, more 
commonly at present, intermediate frames need to 
be interpolated. LCD screens are often sold with 
120 or 240 Hz refresh rates with the same goal of 
reducing motion artifacts, but the LCD response 
time will still blur the tracked image more than in 
the OLED case (Figure 8.11d). 


(b) 


Frame 


Screen position Eye-tracked position 


(d) Pie 
aN 
6 I XS 
z 5 |! S 
ok \ 
ou S 


Screen position Eye-tracked position 


Figure 8.11 Motion artifacts on hold-type active matrix displays. Each figure shows the displayed 
position of an object (thick black line) for a number of frames relative to the display and to the 
path an observer follows as they track an object across the screen, and the blur length this pro- 
duces. (a) Shows a standard frame rate display with a full-frame position hold, (b) shows a standard 
rate with a half-frame hold, (c) shows a double standard rate display, and (d) shows the case when 


the display pixels have a long response time. 
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The switching time of a display is especially 
important for time multiplexed 3D. Here a user 
wears a special pair of glasses, which have elec- 
tronic shutters in front of each eye synchronized 
to a screen showing alternating left-eye and right- 
eye images (with blanking frames inserted to deal 
with the switching time of the glasses and the roll- 
ing update of the screen). If the screen is at all slow 
to react in switching between images, there will be 
bleed (cross-talk) between the left-eye and right-eye 
views, which can cause significant eye strain in the 
viewer. This is not an issue with an OLED display, 
and indeed current Samsung OLED TVs, for exam- 
ple, the Samsung KN5589, are able to display two 
completely separate video streams and synchronize 
these to two pairs of shutter glasses so two people 
can watch separate TV content simultaneously on 
the same set with no discernible cross talk. 


8.2.3.3 SYSTEM PERFORMANCE 


There are a number of other significant performance 
factors not covered by the above two categories: 

Power consumption is determined by the display 
specification and by the performances of the optical 
display structure, the OLED, and the backplane. The 
typical current consumption will be the peak luminous 
intensity times the average relative luminance content 
of a typical image and the display area, divided by the 
efficiency of the optical stack and the cd/A efficiency of 
the OLED pixels themselves. The voltage over which 
this current needs to be supplied is the sum of the 
(worst case) voltages across the OLED, the drive TFT, 
the panel supply tracks, and any overheads needed, e.g., 
temperature variations and aging. Broadly speaking, 
the OLED is responsible for the current and some of 
the voltage drop and the backplane (TFT and track- 
ing) for the rest of the voltage, so to get a low-power 
consumption it is essential to manage both. The power 
also depends strongly on image content: power con- 
sumption will be lower for user interfaces with a dark 
background. 

At the time of writing, it would seem that for typi- 
cal images OLED screens on mobile devices consume 
somewhat more power than an alternative LED back- 
lit LCD, although the figures are comparable and any 
detailed comparison is masked by a large number of 
design decisions and trade-offs, which differ between 
the two technologies. Of course, this is a rapidly 
changing field for both LCD and OLED so the situ- 
ation will certainly change in future. For TVs, there 
are too few OLED devices so far to make much of a 


comparison, although it is clear that both OLEDs and 
LCDs consume much less power per unit area than 
plasma or cathode ray tube (CRT) screens did. 

Lifetime and burn-in are of course two areas of 
performance, which have been the focus of much 
attention. Although lifetime has been, historically, 
the most discussed, it is really burn-in, which is 
the issue. Due to eye adaptation, a uniform drop 
in luminance, unless very large, is hardly noticed 
under normal viewing conditions. What hurts is 
differential aging between two areas of the screen 
leaving the negative of the average viewed content 
superimposed over the content being observed, 
typically referred to as burn-in or image stick- 
ing. Compared to the typical 50% half-life fig- 
ures quoted, as little as a 2%-3% burn-in [10,21] 
is perceptible to a user. Therefore, OLED displays 
not only need long lives, they also need to ensure 
that this first few percentage drop does not happen 
quickly. This will typically be achieved through 
both improved intrinsic performance and drive 
schemes, which correct for OLED aging (see, e.g., 
Reference [19]). 

OLEDs seem to perform (in terms of life- 
time and burn-in) well enough for at least mobile 
screens. These devices typically have a short prod- 
uct life (are replaced frequently) but are also used 
intensively with repeated fixed graphics (e.g., the 
“home” screen of the device). While burn-in has 
been observed with some OLED smartphones, it 
does not seem to be a significant problem, whether 
through OLED performance or good screen man- 
agement or both. 

Transient effects are related to burn-in. Aging 
and burn-in are long-term permanent effects, but 
some OLED material systems also exhibit short- 
term effects, which can have a similar appearance, 
but from which an OLED will recover. For exam- 
ple, it could be observed on the first commercial 
AMOLED screen, on the Kodak LS633 camera, 
that the white icons could appear inverted over 
a bright scene, but this apparent burn-in would 
fade over a few seconds. These effects were most 
likely due to charge trapping in a device and seem 
to be much less apparent in more recent material 
systems. 


8.3 DISPLAY FABRICATION 


This section describes the structure of an OLED 
display and the key aspects of the components and 
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fabrication process. The discussion will be kept as 
generic as possible, with the different fabrication 
options, e.g., OLED vapor vs. solution deposition, 
brought out in the relevant subsection. The only 
construction option that needs to be brought out 
at the start is the choice between active and pas- 
sive matrix addressing as this has an impact on 
the overall structure. The layers that constitute an 
OLED display will be discussed in the order they 
would be encountered in a fabrication process, i.e., 
from the substrate up, and grouped as layers prior 
to OLED deposition (the substrate stack: substrate, 
backplane, and anode), the organic layers them- 
selves, and finally the cathode and encapsulation. 


8.3.1 Substrate stack 


8.3.1.1 SUBSTRATE 


Glass is the obvious choice of substrate, being used 
in all established display technologies from liquid 
crystal to CRT. Its stiffness and dimensional stabil- 
ity make it straightforward to handle through the 
manufacturing process, and it can easily handle 
the required process temperatures. (For the high- 
est conductivity, ITO requires processing above 
400°C, which is too hot for many plastic substrates.) 

Glass is essentially SiO, with other materials 
added to reduce its melting point while ensuring 
that it remains amorphous (does not crystallize). 
There are very many different kinds of glass, but, 
for the relatively straightforward requirements of 
display substrates, the main contenders are stan- 
dard soda lime and borosilicate glass. Soda lime 
is usually used because it is the cheapest but in 
displays it typically has a layer of SiO, deposited 
on its surface to prevent out-diffusion of sodium. 
Borosilicate has a higher melting point, does not 
have an out-diffusion problem and is tougher, 
particularly against thermal shock. The choice of 
which to use is usually determined by the pro- 
cessing temperature requirements; for example, 
an LTPS backplane would require the use of 
borosilicate. The main disadvantages of glass in 
general are brittleness (displays have a tendency 
to break when dropped), weight, and lack of 
flexibility. 

Plastic becomes a possibility due to the inherent 
flexibility of OLEDs (if not necessarily the other 
display components). Plastics present a difficult 


challenge for liquid crystal-based devices because 
of the need to accurately maintain the thickness of 
the liquid crystal film. The sensitivity to changes 
in this liquid-layer thickness can easily be seen by 
gently pressing on an LCD. Organic light emit- 
ters, being entirely solid, do not suffer from this 
problem. 

The two main requirements on a flexible sub- 
strate are to withstand the processing temperature 
(backplane and electrodes) and offer a sufficiently 
good diffusion barrier to oxygen and water. No 
plastic material can meet the barrier require- 
ments on its own. This is well known in the food 
packaging industry, where the shelf-life of plastic- 
wrapped products (e.g., potato crisps/chips) can be 
significantly increased by using a metallized plas- 
tic bag. Organic displays are incredibly moisture 
sensitive and also require the barrier coating to 
be transparent. Suitable barrier-layer technology 
is being developed, but devices with lifetime close 
to those on glass have not yet been demonstrated. 
Single-layer barrier coatings of transparent materi- 
als, such as silicon dioxide, that are thin enough to 
be flexible, tend to suffer from pinhole defects, so 
the general principle is to apply a multilayer stack 
of alternating barrier layers and plastic layers [22]. 
Care has to be taken to ensure low stress in the fin- 
ished structure. 

It should also be noted that, even without the 
cost of barrier layers, the high-quality plastic mate- 
rials required for displays (typically polyethylene 
terephthalate [PET] or polyethylene naphthalate 
[PEN]) are more expensive than glass, although 
that higher material cost may be compensated by 
lower handling costs. 


8.3.1.2 BACKPLANE 


The practice of making LCDs with a simple elec- 
tronic drive circuit at each pixel is well estab- 
lished. The pixel drive circuits are usually made 
from TFTs of amorphous or polysilicon. a-Si has 
good enough performance for most LCD displays 
(charge carrier mobility up to 1 cm? V7 s"') and 
can be deposited at low temperature (300°C). 
Higher performance TFTs can be made by crys- 
tallizing the a-Si, but this requires a temperature 
of ~600°C, beyond the capability of typical display 
substrates. Instead localized processes have been 
developed, for example, using lasers to heat only 
the top layer of silicon to the required temperature 
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to produce so-called LTPS, which can be formed 
on borosilicate display glass giving a huge boost to 
performance (40-120 cm? V"! s!). OLED displays 
have more stringent carrier mobility requirements 
(see Section 8.2.2.2) and at present LTPS is the 
most common active matrix backplane material 
for OLED displays, but it is expensive, can suffer 
from nonuniformities, and is difficult to scale. In 
2004, a new backplane option based on transpar- 
ent oxides was reported [23]. The performance 
of oxides, particularly IGZO, looks increasingly 
promising for OLED display fabrication [24] and 
has the advantage of a very similar process flow 
to a-Si. 


8.3.1.3 ANODE 


For displays emitting through the substrate, the 
most obvious requirements of the anode layer 
are sufficient transparency and conductivity. 
Inevitably, for a layer of a given material, there is a 
trade-off between these two properties controlled 
by the layer thickness, thicker layers being more 
conductive but less transparent. For a material 
with resistivity p (Q cm) and optical absorption 
coefficient « (cm™'), a layer of thickness t has a sheet 
resistance p/t and optical transmission e“. An 
appropriate figure of merit for the material, indi- 
cating its suitability as a transparent conductor, is, 
therefore, the product pa, the sheet resistance of a 
layer, which would transmit 1/e of the light. There 
are other important criteria for anode materials. 
It must be possible to pattern the material, usu- 
ally with a wet chemical etch. The anode material 
must also be sufficiently smooth. A localized mate- 
rial spike of height comparable to the thickness of 
the organic layer will make it thinner, producing a 
large local electric field across the layer, and hence 
increased current and accelerated degradation. 
This may lead to a black spot in the display or even 
a short circuit. There has also been concern that 
the anode material can act as a diffusion source 
of species that might degrade the device perfor- 
mance with time. The conductive metal oxides can 
be thought of as highly doped semiconductors— 
self-doped due to crystal defects partially caused 
by nonstoichiometry. The species in excess (e.g., 
indium in ITO) can diffuse out and interact with 
the organic layers. 

Although the work function of the anode mate- 
rial might be thought to be an important property, 


this is not critical when a highly doped conduc- 
tive polymer such as polyethylenedioxythiophene 
(PEDOT)/polystyrenesulphonic acid (PSS) (see 
Section 8.3.2.2) is in contact with the anode. As 
both anode and highly doped polymer exhibit 
metallic conduction, there is a negligible barrier to 
charge flow regardless of work function. 

ITO is almost universally used as the anode. 
It can achieve resistivity of 1.6 x 10-* Qcm and 
a~104cm"', although it does need annealing at 
400°C to reach this. Where ITO is not the pri- 
mary conductor, for example, in an active matrix 
backplane with a metal track process, the full tem- 
perature anneal may not be required. ITO can be 
etched in concentrated hydrochloric acid. 

Other candidates for the anode material include 
zinc oxide and tin oxide. For a thorough review of 
transparent conductors, see Hartnagel et al. (1995) 
in the further reading list. 


8.3.1.4 SUBSTRATE STACK PREPARATION 


For a passive matrix display, the display manufac- 
turer generally buys glass sheets, already uniformly 
coated with ITO. The desired pattern is imparted 
to the ITO layer using standard lithographic 
techniques involving a wet chemical etch. For an 
active matrix display, the process and patterning 
steps will depend on the technology used and are 
beyond the scope of this chapter, but the result—an 
array of anodes to be coated with the OLED layers 
and a cathode—is essentially the same. 

To avoid short circuits between anode and cath- 
ode, it is vital that the organic layers are continu- 
ous. However, with organic layers typically less 
than 100 nm thick, it does not take a large piece 
of contaminant material to break the continuity of 
the layer. This means that to make a display with 
reasonable yield very thorough cleanliness must be 
maintained throughout the process. The cleaning 
of the substrate is therefore a critical process. This 
is typically done by a combination of mechanical 
agitation (brushing, ultrasonic, and megasonic) 
in a series of liquids (water with detergent and 
then organic solvents). The final step in substrate 
preparation is an oxidizing surface treatment. This 
can be carried out either by oxygen plasma or by 
UV-ozone treatment. It has the combined effect of 
removing any residual organic contamination (e.g., 
photoresist) and changing the surface energy of the 
ITO so that the first organic layer easily wets it. 
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8.3.2 Organic layers 


8.3.2.1 VAPOR PHASE DEPOSITION 


Thermal evaporation in a high vacuum is the stan- 
dard method of depositing all small-molecule 
organic layers. It is a well-established and reason- 
ably straightforward process and has the benefit of 
also being a purification process (most contami- 
nation will evaporate at a different temperature). 
Unfortunately, as OLED materials are not read- 
ily compatible with patterning after deposition 
the only way to pattern these organic layers is to 
do the deposition through a metal shadow mask 
held very close to the substrate. Shadow masking 
is adequate for simple low-resolution displays but 
becomes increasingly difficult as the substrate size 
increases and the pixel size diminishes. Examples 
of the types of difficulties include shadow mask 
thermal expansion during processing or OLED 
deposition on the mask causing aperture clogging 
and mask distortion. For example, a 4.5” diagonal 
HD mobile display (1920 pixels wide) has subpixels 
161m across and might require a position toler- 
ance of 4 pm. If the shadow mask is made of steel 
(thermal expansion coefficient of 13 x 10-§/°C), 
then the mask temperature would need to stay 
within 3°C to keep that position tolerance. 

Despite this, shadow masking is used in vol- 
ume production. To avoid the problems of shadow 
masking, an alternative, originally pioneered by 
Kodak and then LG, is to deposit a white emitting 
stack and then use color filters to obtain red, green, 
and blue subpixels in addition to an unfiltered 
white subpixel. 


8.3.2.2 SOLUTION PROCESSING 


Solution processing has been used for fabricating 
some commercial displays, but so far this has been 
used only for simple spin-coated monochrome 
devices and has largely, if not entirely, been sup- 
planted by vacuum deposition. There is, how- 
ever, substantial activity in developing printing 
techniques for producing large area RGB OLED 
screens, driven by the difficulties of yielding 
shadow-masked vacuum-processed screens on a 
large scale. Traditionally, solution processing has 
been used for polymer OLED materials only but 
there is now active development of soluble small 
molecule materials systems [25]. The rest of this 
section will discuss polymer solution processing; 


however, much of it should be relevant to suitable 
small molecule material systems. Throughout this 
section, the term ink will be used to describe the 
solution of the organic layer in a solvent. 

Spin coating is possibly the simplest way of form- 
ing an OLED device. The simplest polymer devices 
can use only two layers. The first, known as the hole 
conduction layer, is usually PEDOT and PSS, with a 
large excess of PSS [26], in an aqueous ink. It is spin 
coated to give a layer typically 70 nm thick and then 
baked to speed up the drying process. 

The second layer is the emissive polymer, which 
is spun from an organic solvent, usually toluene or 
xylene. To achieve uniform appearance over the 
area of a display, the light emitting polymerlayer 
must be of uniform thickness, typically 7045 nm. 
To achieve this level of uniformity on large sub- 
strates, a multistage spin coating process (multiple 
speeds and times) is used. As well as simplicity, spin 
coating also has the advantage that the action of 
spinning the ink over the substrate and throwing 
off the excess also tends to clear off some remaining 
contaminants. Its disadvantages are the high mate- 
rial wastage (spun-off ink is not recoverable) and no 
patterning ability. 

Inkjet printing is the most promising method 
for manufacturing multicolor polymer displays 
[27]. As explained later, there are tighter con- 
straints on properties of the polymer solution than 
for spin coating and typically two or more solvents 
are used to make a polymer ink. An inkjet print 
head is essentially an ink-filled reservoir connected 
to a small nozzle. In equilibrium, the ink does not 
flow out of the nozzle due to a combination of sur- 
face tension forces and the reduced pressure in the 
head resulting from having the main ink supply 
below the head. To print, a pulsed pressure wave 
is generated insider the reservoir, typically using 
a piezoelectric actuator, and this causes a drop of 
ink to be ejected from the nozzle and propelled 
toward the substrate. For this process to work well 
in a given design of inkjet head, the surface tension 
and particularly the viscosity of the ink must lie 
within narrow limits. As the total mass of polymer 
ejected in the drop will determine the final film 
thickness, the concentration of the ink is fixed by 
the drop volume, the pixel area and the number 
of drops printed per well. These combined con- 
straints make developing good inks a complex pro- 
cess. There is a further complication relating to the 
requirement that, within each pixel, the polymer 
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Figure 8.12 Cross sections of inkjet printed 
polymer films from a range of different solvent 
blends. Height measurements were taken using 

a white light interference scan. Solvent blends 
were (a) 90:10, (b) 70:30, and (c) 50:50 (low boiling 
point:high boiling point). In this case, the 50:50 
blend gives the flattest film. 


film must be flat over a good proportion of its total 
area. Generally, when drops of solution are allowed 
to dry on a surface, the resulting film is not flat. 
This is commonly seen when spilt coffee is allowed 
to dry leaving a dark perimeter, and the result is 
often referred to as the “coffee ring effect.” The 
actual profile produced depends on a number of 
effects, namely the solvent evaporation rate across 
the drop surface, the speed of liquid flow within 
the drop, and the speed of solute diffusion in the 
solvent. As a rule of thumb, low-boiling point sol- 
vents tend to give a peak at the edge and high-boil- 
ing point solvents give a peak in the middle. This 
means that a sufficiently flat film usually requires a 
blend of two appropriate solvents. Cross sections of 
polymer pixels are shown in Figure 8.12. 


8.3.2.2.1 Other printing techniques 


Various other printing techniques, including 
(among many others) Laser Induced Thermal 


Imaging (LITI) [28], Slot Coating [29], Gravure 
printing [30], and Nozzle coating [31], have been 
proposed and demonstrated for patterning the 
organic layers in a display. Each has shown some 
promise but each has also its own, often unique, 
demands on OLED ink properties. At the time of 
writing, inkjet printing remains the main con- 
centration of both materials and process develop- 
ment for solution-processed patterning of OLED 
displays. 


8.3.3 Cathode and encapsulation 


8.3.3.1 CATHODE 
8.3.3.1.1 Cathode materials 


There are, broadly, two approaches to injecting 
electrons into an OLED device. A cathode metal 
can be chosen with a work function suitable for 
direct injection of electrons into the organic layer, 
or a highly doped organic layer can be used between 
the cathode and the organic layer. In either case it 
is desirable to ensure that all OLED devices in a 
display can use a common cathode structure to 
avoid the need for cathode patterning. 

The energy level of electrons injected into an 
organic semiconductor is known as the LUMO 
(lowest unoccupied molecular orbital) and is 
equivalent to the conduction band of a regular 
semiconductor. The work function (the energy 
required for an electron to escape from this level 
into a vacuum) is typically relatively low (2-3 eV). 
Metals that have a similarly low work function are 
typically the group I and II metals, such as calcium 
(2.9eV), barium (2.7eV), and lithium (2.9eV). 
These would usually be deposited as a very thin 
layer, just thick enough to ensure a continuous 
film (~5 nm), and then coated over with, e.g., alu- 
minum, both for protection and to provide more 
conduction. The protection is required because a 
low work function also means that these metals are 
very reactive precisely because it is easy for a reac- 
tant (e.g., oxygen or water) to liberate an electron 
causing the metal to oxidize. 

Improved device performance has been 
reported for both small molecules and polymers, 
using an interface layer of lithium fluoride (or other 
group-one fluorides) between the organic layer and 
the cathode metal. These materials are insulators 
in the bulk but act to enhance electron injection 
when less than a critical thickness (around 10 nm). 
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The mechanism of this improvement is not fully 
understood but probably relates to interaction with 
both the organic layer and the cathode metal [32]. 

A more recent alternative to highly reactive 
metal cathodes is to use a highly doped organic 
layer [33]. This can be done just on the cathode side 
or can be used to construct a complete PIN diode 
structure (P-type doped at the anode, Intrinsic 
(undoped) in the middle, and N-type doped at the 
cathode). For this to work well, other layers are 
often required, such as selective carrier blocking 
layers, and indeed some designs of vacuum depos- 
ited small molecule devices can have 15 or more 
layers where the doped layers are used to stack 
multiple OLED devices in series often to use mul- 
tiple emitters to obtain white-light emission (see, 
e.g., Reference [34]). 


8.3.3.1.2 Additional metallization 


In many display designs, additional metal track- 
ing is needed to reduce the lateral resistance of the 
transparent anode. Without this, pixels a long way 
from the edge would experience a large voltage 
drop. This metal can be deposited and patterned 
using conventional semiconductor processing 
methods. It is usual to use aluminum with an 
adhesion layer of chrome. 


8.3.3.1.3. Processing 


The standard means of providing a metallic cath- 
ode contact is thermal evaporation in high vac- 
uum, using a separate metal shadow mask in close 
contact with the display plate to define separate 
connections to each row of pixels in the display. 
OLED materials are sensitive and it has been found 
that the simplest method of depositing metal with- 
out damaging the polymer is thermal evaporation 
with a low evaporation rate. If other techniques are 
used, such as electron beam evaporation or sput- 
tering, great care must be taken to prevent expo- 
sure of the organic layers to high-energy particles 
(electrons or ions). The injection of electrons into 
the OLED structure depends on contact with a 
low-work function electrode, for example, calcium 
or barium. However, the low work function means 
that these materials are inevitably highly reac- 
tive. It is usual, therefore, to evaporate the mini- 
mum thickness of the primary electrode material 
required to guarantee a continuous film and then 
provide a backing layer of a less reactive metal, 
such as aluminum or silver. The backing metal acts 


as a diffusion barrier and provides a low resistance 
to lateral current flow. 


8.3.3.2 ENCAPSULATION AND FINISHING 


The active layers in a display require encapsulation 
to provide both mechanical and chemical protec- 
tion. Providing mechanical protection is straight- 
forward, but to chemically protect the OLED 
device from reacting with atmospheric oxygen and 
moisture is extremely challenging. The typically 
quoted requirement is for a water vapor transmis- 
sion rate below 10-6 g/m?/day [22]. 

The standard solution is to attach a metal or 
glass “can” over the active display, using a UV-cure 
resin seal around the perimeter. The edge seal is the 
weak point, which allows some ingress. To reduce 
the impact on the display to an acceptable level, it 
is necessary to include a desiccant or “getter” inside 
the package. This is a reactive material with a high 
surface area that captures most of the offending 
water molecules before they can attack the display. 

A potentially lower cost solution to encapsula- 
tion is well established in the semiconductor indus- 
try to provide the required diffusion barrier by 
vacuum deposition of a thin conformal hermetic 
coating. Silicon nitride, aluminum oxide, and alu- 
minum nitride are examples of suitable material. 
The organic materials in an OLED device require 
that the encapsulation is deposited at relatively 
low temperatures, which makes it much harder to 
produce continuous films with no defects such as 
pinholes. The typical technique for OLED displays 
is to deposit multiple alternating layers of an inor- 
ganic barrier and organic buffer layer [35] to pre- 
vent propagation of pinholes in the same location. 
Such an encapsulation stack could be considered 
on its own if good enough or in combination with 
other encapsulation methods. 

After encapsulation, there are a variety of oper- 
ations, the details of which are outside the scope of 
this chapter. A typical sequence would be test the 
displays, scribe and break the substrate into indi- 
vidual displays, attach electrical connections to the 
drive electronics, and test again. 


8.4 SUMMARY 


In the 30 years since invention, OLEDs have devel- 
oped from a barely visible laboratory curiosity to 
full commercial production for a range of display 
sizes from mobile devices to large TVs. 
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By going step by step through OLED display 
operation, performance and fabrication process, we 
hope to have shown the reader where the strengths 
and weaknesses of the technology lie, both for 
organics in general and the differences between 
polymers and small molecule-based devices. 

Organic devices make bright, attractive, and color- 
ful flat panel emissive displays. They have the funda- 
mental advantages over other display technologies of 
a simple structure and low-voltage operation coupled 
with an intrinsically wide viewing angle, near-ideal 
contrast ratio, and an extremely fast response time. 

Vacuum-processed small molecule devices 
currently represent the vast majority of commercial 
OLED production. Vacuum processing, however, 
has a significant hurdle to cross to achieve a high 
yield on large RGB-patterned OLED displays, partic- 
ularly with the push to 4K and even 8K resolutions— 
at present all large format displays in production use 
an unpatterned white OLED layer plus color filters 
to produce RGB emitting pixels. Solution-processed 
polymer and small molecule devices patterned with 
a printing process such as inkjet printing has, per- 
haps, the greater potential for low-cost large area 
manufacture, but still needs to be proven as a viable 
display manufacturing technology. 
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Today’s three-dimensional display systems pro- 
vide new advantages to end-users; they are able 
to support an auto-stereoscopic, no-glasses, 
three-dimensional experience with significantly 
enhanced image quality over previous generation 
technology. There have been particularly rapid 
advances in personal auto-stereoscopic three- 
dimensional display for desktop users brought 
about because of the opportunity to combine 
micro-optics and LCD displays coinciding with 
the availability of low-cost desktop image pro- 
cessing and three-dimensional computer graphics 
systems. 

In this chapter, we concentrate our detailed tech- 
nical discussion on personal three-dimensional 
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displays designed for desktop use as these are 
particularly benefiting from new micro-optic 
elements. We emphasize the systems aspect of 
three-dimensional display design believing it is 
important to combine good optical design and 
engineering with the correct digital imaging tech- 
nologies to obtain a highquality three-dimensional 
effect for end-users. The general principles dis- 
cussed will be applicable to the design of all types 
of stereoscopic three-dimensional display. 


9.2 HUMAN DEPTH PERCEPTION 


Defining the requirements for three-dimensional 
display hardware and the images shown on them 
is an important first step towards building a high- 
quality three-dimensional display system. We 
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need a clear understanding of how a digital stereo- 
scopic image is perceived by an end-user in order 
to undertake valid optimization during the design 
process. 

Binocular vision provides humans with the 
advantage of depth perception derived from the 
small differences in the location of homologous, or 
corresponding, points in the two images incident 
on the retina of the eyes. This is known as stereop- 
sis (literally solid seeing) and can provide precise 
information on the depth relationships of objects 
in a scene. 

The human visual system also makes use of 
other depth cues to help interpret the two images 
incident on the retina and from these build a men- 
tal model of the three-dimensional world. These 
include monocular depth cues (also known as 
pictorial [18] or empirical [39] cues), whose signifi- 
cance is learnt over time, and oculomotor cues in 
addition to the stereoscopic cue [39]. We consider 
these in turn and introduce in detail binocular 
vision both in the natural world and when looking 
at an electronic three-dimensional display. 


9.2.1 Monocular and oculomotor 
depth cues 


Redundancy is built into the visual system and 
even people with monocular vision are able to per- 
form well when judging depth in the real world. 
Therefore, in the design of three-dimensional 
displays, it is important to be aware of the major 
contribution of monocular two-dimensional depth 
cues in depth perception and aim to provide dis- 
plays with at least as good basic imaging perfor- 
mance as two-dimensional displays. Ezra [12] 
suggests this should include levels of brightness, 
contrast, resolution and viewing range that match 
a standard two-dimensional display with the addi- 
tion of the stereoscopic cue provided by generating 
a separate image for each eye. 

The monocular depth cues are experiential and 
over time observers learn the physical significance 
of different retinal images and their relation to 
objects in the real world. These include 


e Interposition: objects occluding each other sug- 
gest their depth ordering. 

e Linear perspective: the same size object at dif- 
ferent distances projects a different size image 
onto the retina. 


e Light and shade: the way light reflects from 
objects provides cues to their depth relation- 
ships; shadows are particularly important in 
this respect. 

e Relative size: an object with smaller retinal 
image is judged further away than the same 
object with a larger retinal image. 

e Texture gradient: a texture of constant size 
objects, such as pebbles or grass, will vary in 
size on the retina with distance. 

e Aerial perspective: the atmosphere affects light 
traveling through it, for example due to fog, 
dust or rain. As light travels long distances, it 
is scattered, colors lose saturation, sharp edges 
are diffused and color hue is shifted towards 
blue. 


Many of these cues are illustrated in Figure 9.1 and 
can be considered to be two-dimensional depth 
cues because they are found in purely monoscopic 
images. Two other non-binocular depth cues are 
available: motion parallax and oculomotor cues. 

Motion parallax provides the brain with a pow- 
erful cue to three-dimensional spatial relation- 
ships without the use of stereopsis [18,39] and this 
is the case when either an object in the scene or the 
observer’s head moves. Motion parallax does not, 
however, make stereopsis redundant, as compre- 
hending images of complex scenes can be difficult 
without binocular vision. Yeh [66] and others have 
shown that both stereopsis and motion parallax 
combined result in better depth perception than 
either cue alone. 


Aerial distortion 


Figure 9.1 Picture illustrating the depth 
cues available in a two-dimensional image. 
(Photographer David Burder.) 
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Oculomotor depth cues are due to feedback 
from the muscles used to control the vergence 
and accommodation of the eye. They are gener- 
ally regarded as having limited potential to help 
depth judgement [16,39,41] and we will move on to 
consider how human binocular vision works when 
used to view the natural world. 


9.2.2 Binocular depth perception in 
the natural world 


Extracting three-dimensional information about 
the world from the images received by the two eyes 
is a fundamental problem for the visual system. In 
many animals, perhaps, the best way of doing this 
comes from the binocular disparity that results 
from two forward facing eyes having a slightly dif- 
ferent viewpoint of the world [5]. The binocular 
disparity is processed by the brain giving the sen- 
sation of depth known as stereopsis. 

Stereo depth perception in the natural world 
is illustrated in Figure 9.2. The two eyes verge the 
visual axes so as to fixate the point F and adjust 
their accommodation state so that points in space 
at and around F come into focus. 

The vergence point, F, projects to the same posi- 
tion on each retina and therefore has zero reti- 
nal disparity, i.e., there is no difference between 
its location in the left and right retinal images. 
Points in front or behind the fixation point proj- 
ect to different positions on the left and right retina 
and the resulting binocular disparity between the 
point in the left and right retinal images provides 
the observer’s brain with the stereoscopic depth 
cue. Depth judgement is therefore relative to the 


The horopter is all Horopter 
points perceived at the } — 
same depth as the point : 

of fixation, F. 


All objects within Panum’s 
fusion are seen as single 
fused images 


——_—_> 


Panum’s fusion 


Figure 9.2 The geometry of the binocular vision 
when viewing the natural world. 


current vergence point, F, and is most useful to 
make judgements on the relative rather than abso- 
lute depth of objects in a scene. 

Points in space, other than F, which project 
zero retinal disparity are perceived to lie at the 
same depth as the vergence point; all points that 
project zero retinal disparity are described as 
being on a surface in space known as the horop- 
ter. The shape of the horopter shown in Figure 
9.2 is illustrative only; it is known in practice 
to be a complex shape and to have non-linear 
characteristics [3,18]. 

Geometrically, we can define angular disparity, 
a, as the difference between the vergence angle at 
the point of fixation, F, and the point of interest. 
Considering Figure 9.3: Points behind the fixation 
point, such as A, have positive disparity. 


O,= f- a. (9.1) 
Points in front of the fixation point, such as B, have 
negative disparity. 

o,= f-b. (9.2) 
The smallest perceptible change in angular dispar- 
ity between two small objects is referred to as ste- 
reo acuity, 5 [65]. The advantage of defining stereo 
acuity as an angle is that it can be assumed to be 
constant regardless of the actual distance to and 
between the points A and B. However, it is also 
helpful to know how this translates in terms of the 
smallest perceived distance between objects at the 
typical viewing range of a desktop three-dimen- 
sional display. This will allow us to compare the 


Horopter 


a 


Figure 9.3 Angular disparity is defined relative to 
the current fixation point. 


296 Three-dimensional display systems 


ability of the eye to perceive depth with the ability 
of different displays designs to reproduce it. 

Considering Figure 9.4 when points A and C 
can just be perceived to be at a different depth, then 
stereo acuity will be 


d=a-c. (9.3) 
Various studies [28,31,65] show the eye is able to 
distinguish very small values of 6, as little as 1.8 in. 
(seconds of arc). As the exact limits vary between 
people, Diner and Fender [8] suggest that a practi- 
cal working limit is to use a value of stereo acuity 
5=20in. Using this value, we can calculate the size 
of the smallest distinguishable depth difference 
at a given distance from the observer. We choose 
m=750mm as the distance from the observer as 
a common viewing distance for desktop stereo- 
scopic displays and use an average eye separation, 
e=65mm. 

Calculating along the center line between the 
visual axes, we can find the minimum distinguish- 
able depth, n, at distance m by considering points 
A and C. The angle a can be calculated as 


a= 2arctan{ 2xarctan( 27°) (9.4) 
750 


m 


by the definition of stereo acuity we know that 


tan(c/2) = tan 2?) tan( 202.) (9.5) 


and if n is the distance between A and C we also 
know that 


(9.6) 


tan(ci2)=( a } 


m+n 


Figure 9.4 Stereo acuity defines smallest depth 
difference an observer can perceive. 


rearranging Equation 9.6 we have 


ie ( e/2 )- s 
tan(c/2) 
Substituting Equation 9.4 in Equation 9.5 and 
using the result to solve Equation 9.7 gives n=0. 
84mm. 

We can conclude that a person with a stereo 
acuity of 20in. and an eye separation of 65 mm will 
be able to perceive depth differences between small 
objects of just 0.84mm at a distance of 750mm 
from the eyes. 

It is also possible to calculate a geometric value 
for the furthest possible range of stereo vision 
which occurs when the vergence angle between the 
two visual axes is equal to or less than the stereo 
acuity. 

The distance m from the observer to the point A 
when the angle a= is given by 


( e/2 
m= 
tan(a/2) 


Again taking 5=20in. and e=65mm we get 
m=670m. 

This means that points such as C at a distance of 
670m or more from the observer will not be able to 
be distinguished in distance from A using binocu- 
lar vision alone. Just before this limit is reached, 
the smallest distinguishable depth difference 
between points will have increased to over 300m 
and it is clear only gross differences in depth will 
be perceived at the furthest limits of stereoscopic 
perception. 

To summarize the above, binocular vision uses 
the stereoscopic depth cue of retinal disparity to 
perceive an object’s depth relative to the fixation 
point of the two eyes. At close and near range 
this provides a high degree of depth discrimina- 
tion and even at tens of metres from the observer 
enables relative depth perception for larger objects. 


(9.7) 


(9.8) 


9.2.3 Depth perception in electronic 
stereoscopic images 


Wheatstone [60] demonstrated that the stereo- 
scopic depth sensation could be recreated by show- 
ing each eye a separate two-dimensional image. The 
left and right eye views should be two-dimensional 
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planar images of the same scene from slightly dif- 
ferent viewpoints; the difference in the viewpoints 
generates disparity in the images. When the images 
are subsequently viewed, the observer perceives 
depth in the scene because the image disparity cre- 
ates a retinal disparity similar, but not identical, to 
that seen when looking directly at a natural scene. 

Wheatstone was able to demonstrate this effect 
by building the first stereoscope and many devices 
have since been invented for stereoscopic image 
presentation each with their own optical configu- 
rations. Reviews of these devices and the history 
of stereoscopic imaging are available in several 
sources [23,30,32,40,53]. 

To help characterize and compare the perfor- 
mance of different electronic three-dimensional 
display designs, we will consider the perception 
of depth in planar stereo image pairs and how this 
differs from the stereoscopic perception of depth in 
the natural world. 

A key physiological difference is that although 
the eyes need to verge off the stereoscopic image 
plane to fixate points in depth, their accommoda- 
tion state must always keep the image plane itself 
in focus. This requires the observer to be able 
to alter the normal link between vergence and 
accommodation and is one reason why images 
with large perceived depth are hard to view. This 
suggests that the perceived depth range in stereo- 
scopic image pairs needs to be limited to ensure 
the observer will find a stereo image pair com- 
fortable to view. 

While there are several studies of the comfort- 
able perceived depth range on electronic three- 
dimensional displays [17,64,65], it can be difficult 
to factor out variables relating to display perfor- 
mance from the results. Display variables include 
absolute values, and inter-channel variations, of 
brightness and contrast in addition to stereoscopic 
image alignment and crosstalk. All of these can 
affect the comfortable range of perceived depth on 
a particular display. For example, high-crosstalk 
displays generally do not support deep images as 
the ghosting effect becomes more intrusive to the 
observer as screen disparity is increased. 

An analysis of the geometry of perceived depth 
assuming a display with ideal properties helps 
identify the geometric variables affecting per- 
ceived depth independently of the display used. 
Geometric models of perceived depth have been 
studied by Helmholtz [23] and Valyus [53] and 


more recently in [8,24,27,64]. We present a sim- 
plified model in Figure 9.5 for discussion pur- 
poses which helps emphasize the key geometric 
variables affecting the perception of stereoscopic 
images. 

Figure 9.5 shows the geometry of perceived 
depth for a planar stereoscopic display; for sim- 
plicity, we consider the geometry along the center 
line of the display only; more general expressions 
are available [23,64]. The viewer’s eyes, Land R, are 
separated by the interoccular distance, e, and are at 
a viewing distance, z, from the display plane. The 
screen disparity between corresponding points in 
the left and right images, d, is a physical distance 
resulting from the image disparity which is a logi- 
cal value measured in pixels. Image disparity is 
constant for a given stereo pair; however, screen 
disparity will vary depending on the characteris- 
tics of the physical display. Screen disparity in a 
pair of aligned stereo images is simply the differ- 
ence of the physical x coordinates of correspond- 
ing points in the right x, and left x; images: 


d = x,-X, (9.9) 


Two key expressions relating screen disparity to 
perceived depth can be derived from the similar 
triangles in Figure 9.5. Perceived depth behind the 
screen plane, i.e., positive values of d, is given by 


(9.10) 


Pp; perceived depth 
z, viewing distance 
e, eye separation 

d, screen disparity 


Figure 9.5 Perceived depth behind (top) and in 
front (bottom) of the display plane. 
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Perceived depth in front of the screen plane, i-e., 
negative values of d, is given by 


Pa. (9.11) 
(gle 


Equations 9.10 and 9.11 provide several insights into 
the geometric factors affecting perceived depth: 


e z, the viewing distance to the display. Perceived 
depth is directly proportional to the view- 
ing distance, z. Therefore a viewer looking 
at the same stereoscopic image from differ- 
ent distances perceives different depth. How 
important this is, is application dependent, but 
applications such as CAD, medical imaging 
and scientific imaging may critically depend on 
accurate and consistent depth judgements. 

e d, the screen disparity. Perceived depth is also 
directly proportional to screen disparity, d. 

The screen disparity for any given stereoscopic 
image varies if the image is displayed at differ- 
ent sizes, either in different size windows on 
the same screen or on different size screens. 
Again this is important to note in applications 
where depth judgement is a critical factor. It 
means stereoscopic images are display depen- 
dent and an image displayed on a larger display 
than originally intended could exceed com- 
fortable perceived depth limits or give a false 
impression of depth. 

e e, individual eye separation. Perceived depth is 
inversely proportional to individual eye separa- 
tion which varies over a range of approximately 
55-75 mm with an average value often taken as 
65mm. Children can have smaller values of eye 
separation and therefore see significantly more 
perceived depth in a stereoscopic image than 
the average adult. It may be particularly impor- 
tant to control perceived depth in systems 
intended for use by children, as they will reach 
the limits of their vergence/accommodation 
capabilities sooner than most adults. 


For display design, controlling these variables so 
that the viewer sees a consistent representation 
of depth ideally requires tracking head position, 
identifying eye separation and controlling screen 
disparity. These are challenging goals in addition 
to designing a display with imaging performance 
as good as a two-dimensional display. 


9.2.4 Benefits of binocular vision 


An important question is what advantages does 
binocular vision provide in the real world? As a 
visual effect, it clearly fascinates the majority of 
people when they see a three-dimensional picture. 
Beyond the attractive nature of stereoscopic three- 
dimensional images, they provide the following 
benefits over monocular vision: 


e Relative depth judgement. The spatial relation- 
ship of objects in depth from the viewer can be 
judged directly using binocular vision. 

e Spatial localization. The brain is able to con- 
centrate on objects placed at a certain depth 
and ignore those at other depths using binocu- 
lar vision. 

e Breaking camouflage. The ability to pick out 
camouflaged objects in a scene is probably 
one of the key evolutionary reasons for having 
binocular vision [47]. 

e Surface material perception. For example, lustre 
[23], sparkling gems and glittering metals are 
in part seen as such because of the different 
specular reflections detected by the left and 
right eyes. 

e Judgement of surface curvature. Evidence sug- 
gests that curved surfaces can be interpreted 
more effectively with binocular vision. 


These benefits make stereo image display of consid- 
erable benefit in certain professional applications 
where depth judgement is important to achiev- 
ing successful results. In addition, the effect of 
stereopsis is compelling enough that stereoscopic 
images have formed the basis of many entertain- 
ment systems. 


9.3 THREE-DIMENSIONAL DISPLAY 
DESIGNS USING MICRO-OPTICS 


The possible combinations of LCD and micro- 
optics provide many degrees of freedom for display 
design; the ideal three-dimensional display design 
will depend on specific application requirements. 
However, there are characteristics that all display 
designs should give consideration to and we briefly 
review these here. 

There is a need to compare the basic image qual- 
ity of a three-dimensional design to that achieved 
by current two-dimensional displays; ie., the 
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two-dimensional characteristics of a three-dimen- 
sional display should match the performance of 
two-dimensional displays as closely as possible. 
Key characteristics are 


e Brightness typical of a current LCD display is 
150 cdm~? 

e Contrast typical of a current LCD display is 
300:1 

e Color reproduction, measured white points 
and measured CIE coordinates of primaries 


These values are typical of current two-dimensional 
displays but are clearly a moving target as two- 
dimensional displays improve. 

In addition, there are a number of important 
characteristics unique to three-dimensional dis- 
plays. The first is that two-dimensional character- 
istics need to be matched between all the viewing 
windows of the three-dimensional display. Each 
viewing window should also be matched spatially 
and temporally so that there is no noticeable posi- 
tion or time difference between corresponding 
images. 

Inter-channel crosstalk appears to an observer 
as a ghost image, which will be particularly visible 
at high-contrast edges in images. It is an unwanted 
feature in most display designs because high values 
of crosstalk are known to be detrimental to three- 
dimensional effect, particularly on high-contrast 
displays showing large values of perceived depth 
[42]. Ideally crosstalk measurements need to be 
no more than 0.3% if the ghosting effect is to be 
imperceptible to an observer. Crosstalk, although 
often due to optical effects in the display, can also 
result from poor separation of the two image chan- 
nels in the display driving electronics, image com- 
pression formats or the camera system generating 
the images. 

An observer of a two-dimensional display will 
usually expect to be able to see a good quality 
image at a wide range of positions in front of the 
display. Because of the need to direct images sepa- 
rately to the two eyes, many three-dimensional 
displays have a more limited viewing freedom. 
Consideration needs to be given to the targets for 
lateral, vertical and perpendicular freedom in a 
display design. Three-dimensional display systems 
capable of supporting multiple observers will often 
do so at the expense of viewing freedom. Improved 
viewing freedom can be found in designs with 


multiple viewing windows or using head tracking 
to steer viewing windows to follow the observers’ 
head movements. When head tracking is used, a 
design needs to consider targets for the maximum 
supported head speed as this directly determines 
key tolerances. 

Some displays have the capability to operate 
in either three-dimensional or two-dimensional 
modes switching electronically or mechanically 
between the two. In this case, the image quality in 
each mode needs to be considered against the per- 
formance of a standard two-dimensional display, 
as a display in three-dimensional mode will often 
have different optical performance to the same dis- 
play in two-dimensional mode. 

The capability of a three-dimensional dis- 
play to represent perceived depth is probably the 
single most important design target; however, we 
will return to how to quantify and compare this 
between displays after presenting details of repre- 
sentative three-dimensional display designs. 

We would like three-dimensional displays to 
provide the ability for the observer to accommo- 
date naturally at the fixation point. However, this 
is not a feature supported in stereoscopic images 
and has been attempted in very few display designs. 


9.3.1 Stereoscopic systems 


Stereoscopic displays require users to wear a 
device, such as analysing glasses, that ensures left 
and right views are seen by the correct eye. Many 
stereoscopic display designs have been proposed 
and there are reviews of these in several sources 
[30,32,34,40,53]. Most of these are mature systems 
and have become established in several profes- 
sional markets but suffer from the drawback that 
the viewer has to wear, or be very close to, some 
device to separate the left and right eye views. This 
has limited the widespread appeal of stereoscopic 
systems as personal displays for home and office use 
even when the three-dimensional effect is appeal- 
ing. However, stereoscopic displays are particu- 
larly suited to multiple observer applications such 
as cinema and group presentation where directing 
individual images to each observer becomes diffi- 
cult compared to providing each observer with a 
pair of analysing glasses. 

As stereoscopic display systems are well described 
elsewhere, we limit ourselves here to a summary of 
the major types using electronic displays: 
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A spatially multiplexed image (SMI) with left (L) and right (R) image pixels is 
placed behind a patterned micro-polariser (uPol) element. 


Polarized 
pixels 


Multiplexed 
: pixels 


SMI uPol 


Right (R) pixels 


Left (L) pixels 


Viewing glasses 


When viewed with polarized glasses the P1 polarized pixels 
are seen only in the left eye and P2 polarized in the right 


Figure 9.6 The micro-polarizer stereoscopic 


@ Wheatstone mirror stereoscopes using CRT 
displays or LCD displays. 

© Polarized glasses in combination with a 
method of polarizing the two views. 

e Shutter glasses working in synchronization 
with a view switching display. 

e Analglyph glasses analysing different color 
channels to obtain the images. 

e Brewster stereoscopes, of which head mounted 
displays are up to date examples. 


A series of stereoscopic display designs that use 
polarizing micro-optics have been produced 
[14], as shown in Figure 9.6. The micro-optics 
split a single display into two differently polar- 
ized views, which are viewed correctly by left 
and right eyes when the observer wears a pair 
of analysing polarized glasses. This requires two 
half resolution views and may be achieved using 
a chequerboard pattern of image multiplexing 
and polarization as shown in Figure 9.6 as the 
spatially multiplexed image (SMI) and patterned 
micro-polarizer (Pol). 

A drawback of the design, particularly for direct 
view LCD-based displays, is the parallax between 
the display pixels and the micro-polarizer when 
the micro-polarizer is mounted over the LCD 
due to the layer of substrate between the two ele- 
ments forming the gap g in Figure 9.6. If the head 
moves from the nominal viewing position, part of 


the adjacent view’s pixel becomes visible result- 
ing in crosstalk. One way to reduce this is to use 
interlace the images in alternate rows so at least 
lateral head movement is not affected by parallax. 
As noted by Harrold [20] this problem can only be 
fully solved in the long term by manufacturing the 
micro-polarizer element within the LCD pixel cells 
reducing the parallax between polarizer and pixel. 


9.3.2 Auto-stereoscopic systems 


Auto-stereoscopic displays are those that do not 
require the observer to wear any device to sepa- 
rate the left and right views and instead send them 
directly to the correct eye [6]. This removes a key 
barrier to acceptance of three-dimensional dis- 
plays for everyday use but requires a significant 
change in approach to three-dimensional display 
design. Auto-stereoscopic displays using micro- 
optics in combination with an LCD element have 
become attractive to display designers and sev- 
eral new three-dimensional display types are now 
available commercially. The key optical reasons 
[62] for combining micro-optics with LCD ele- 
ments are 


e LCDs offer pixel position tolerances better than 
0.1 um. 

e LCD pixels, unlike CRT pixels, have high posi- 
tional stability. 


9.3 Three-dimensional display designs using micro-optics 301 


e LCD elements have carefully controlled glass 
thickness. 


Auto-stereoscopic displays have been demon- 
strated using a range of optical elements in combi- 
nation with an LCD including 


e Parallax barriers, optical apertures aligned 
with columns of LCD pixels. 

e Lenticular optics, cylindrical lenses aligned 
with columns of LCD pixels. 

e Micropolarizers are found in several auto-ste- 
reoscopic three-dimensional display designs. 

e Holographic elements have been used to create 
real images of a diffuse light source. 


In the following, we introduce how these elements 
are used in auto-stereoscopic three-dimensional 
display designs including two-view and multi-view 
designs. We begin by looking at auto-stereoscopic 
two-view designs using twin-LCD elements. 


9.3.3 Two-view twin-LCD systems 


A successful approach to building high-quality 
auto-stereoscopic displays has been to use two 
LCD elements and direct the image from one to 
the left eye and from the other to the right eye; the 
principle is illustrated in Figure 9.7. Several designs 
have adopted this approach including [12,13,22]. 
Ezra [12,13] describes one of the designs, which 
produces bright, high-quality, full color moving 
three-dimensional images over a wide horizontal 
viewing range. As shown in Figure 9.8, the dis- 
play produces two viewing windows using a single 


Each eye is able to see 
the appropriate image on 
the display through a 
viewing “window” 


Display plane 


« 


Left and right images are 
displayed in different sets 
of pixels on the display 


Figure 9.7 Two-view displays create two viewing 
windows. 


illuminator. The arrangement of optical elements 
generates horizontally offset images of the illumi- 
nator at a nominal viewing distance to form the 
viewing windows. An observer's eye placed in one 
of the viewing windows will see an image from just 
one of the LCD elements. 

Ifa stereo pair of images is placed on the left and 
right LCD elements, respectively, then an observer 
will see a stereoscopic three-dimensional image. 
The image appears in the plane of the left LCD as 
the observer looks at the display and depth is per- 
ceived in front and behind this plane. As the two 
LCDs are seen separately, each eye has a full reso- 
lution image and the interface is simply two syn- 
chronized channels of digital or analogue video 
which can be generated at low cost on a desktop 
PC system. 

This basic configuration can be enhanced in 
several ways: if the light source is moved then 
the viewing windows can be steered to follow the 
observer’s head position. In order to implement 
window steering, new technologies for tracking 
head position have also been developed [25]. The 
effect of implementing head tracking linked to 
window steering is to increase the viewing free- 
dom of the display and if the images are updated 
the design has been demonstrated to provide a full 
look-around effect. This allows the observer to look 
around the display and see different views of the 
scene as they would in the natural world. Image 
generation for look-around can be implemented 
by using a three-dimensional computer graphics 
system to generate the new views when given head 
tracking position information. 

Another possible development [13] is to have 
multiple light sources providing multiple stereo 
views to multiple viewers. This could be imple- 
mented either by sending the same image pair to 
each viewer, or by time slicing the light source 
and the displays to send a different image to each 
viewer in rapid succession. 

The system uses bulk optics and therefore has 
a large footprint, particularly as the LCD display 
diagonal size increases. This led to the micro-optic 
twin-LCD display [61] which provides the same 
effect in a smaller footprint and is more practical 
for scaling to larger display sizes. 

The micro-optic twin-LCD display is illustrated 
in Figure 9.9. The two LCD elements remain in the 
design with a half mirror acting as a beam combiner 
between them. The arrangement of optical elements 
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Figure 9.8 The twin-LCD display [12]. 
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Figure 9.9 The micro-optic twin-LCD display [61]. 


behind one LCD panel directs light so that it forms 
one viewing window at a nominal viewing distance 
from the display, another is formed adjacent to this 
from the backlight of the other LCD panel. As with 
the bulk optic display the observer placing their 
eyes in the viewing windows will see the appropri- 
ate image in each eye and experience a stereoscopic 
three-dimensional effect. 

As discussed in Reference [61] the micro-optic dis- 
play produces a better viewing window profile than 
the bulk-optic display. This is because the micro-optics 
form a wider and more even illumination distribu- 
tion for each viewing window so that, when steered, 
the windows can be moved further laterally before 
aberrations reduce their quality. This also results in 


Lens array 


a ee ee ee Slit array 
es Backlight 


side lobes of better quality, which in untracked dis- 
plays can be used by additional observers. 


9.3.4 A note on viewing windows 


One of the key influences on the perceived perfor- 
mance of auto-stereoscopic displays is the quality 
of the viewing windows that can be produced at 
the nominal viewing position. Degradation of the 
windows due to unresolved issues in the optical 
design can lead to flickering in the image, reduced 
viewing freedom and increased inter-channel 
cross-talk. All of these reduce the quality of view- 
ing experience for observers in comparison to the 
two-dimensional displays they are used to using. 
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Figure 9.10 Viewing freedom in an auto-stereoscopic display [61]. 
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Figure 9.11 The characteristics of a viewing window [63]. 


In addition, in head tracked systems degraded 
window quality can lead to harder constraints on 
the accuracy and response speed of the tracking 
and window steering systems, increasing system 
costs [25]. 

The auto-stereoscopic displays considered 
so far produce two viewing windows in space 
typically at a nominal distance from the dis- 
play in a plane parallel to the display surface, as 
shown in Figure 9.7. Although often illustrated 
in two dimensions, the viewing windows have a 
three-dimensional shape and from above appear 
as diamonds tapering away from the nominal 
viewing plane as shown in Figure 9.10. As long 


as an observer’s pupils stay within these dia- 
monds, and the display is showing a stereo image, 
the observer will see a three-dimensional image 
across the whole of the display. 

Experimentally the window intensity profile 
can be determined by measurements using a 1mm 
pinhole, a photometric filter and a detector. To 
fully characterize a display performance, the pro- 
file measurements should be repeated at a range 
of positions vertically and longitudinally offset 
from the nominal window position. The variables 
characterizing the quality of the viewing windows 
are discussed in [61] and are summarized here in 
Figure 9.11. 
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The useful width of the window determines how 
far an observer can move before the image quality 
degrades. Larger useful width, up to the interoc- 
cular separation, typically 65mm, provides more 
comfortable viewing in fixed position displays as 
there will be a small but useful lateral range of 
head positions at which a good three-dimensional 
image can be seen. 

A systems benefit of wider viewing windows is 
that it helps relax the tolerances required for win- 
dow steering and tracking mechanisms in head- 
tracked displays such as [12,61]. This is because a 
wider viewing window allows more time and/or 
distance before the steering and tracking mecha- 
nisms have to respond to user head movement in 
order to prevent the user moving out of the useful 
width and seeing a degraded image on the display. 


9.3.5 Two-view single-LCD systems 


Even with the advantages of a micro-optic design 
twin-LCD three-dimensional displays have a com- 
ponent cost that must include two LCD elements. 
This cost is acceptable in some applications when 
image quality is the key requirement; however, for 
the mass market, ie., personal office and home 
use, it is desirable to find display designs based on 
a standard single LCD element. 

We will group the single-LCD auto-stereo- 
scopic designs by the type of optical element used 


Parallax 
barrier 


Display 


to generate the viewing windows, beginning with 
the parallax barrier. 


9.3.5.1 PARALLAX BARRIER DESIGNS 


Typical emissive displays have pixels with dif- 
fuse radiance, that is they radiate light equally 
in all directions. To create a twin-view auto- 
stereoscopic display, half the pixels must only 
radiate light in directions seen by the left eye and 
half the pixels in directions seen by the right eye. 
The parallax barrier is perhaps the simplest way 
to do this and works by blocking light using strips 
of black mask. 

The principle of the two-view parallax bar- 
rier is illustrated in Figure 9.12. The left and right 
images are interlaced in columns on the display 
and the parallax barrier positioned so that left and 
right image pixels are blocked from view except in 
the region of the left and right viewing windows, 
respectively. Although not illustrated the viewing 
windows repeat in side lobes to each side of the 
central viewing position and can be used by more 
than one observer if the optical quality remains 
high enough. 

The pixels and barrier are arranged so the center 
of each pair of left and right view pixels is visible at 
the center of the viewing windows. The geometry 
defining the design of the parallax barrier pitch, b, 
can then be determined from considering similar 
triangles in Figure 9.12. 


e Right viewing 
window 


Left viewing 
window 


e, window width 

i, pixel pitch 

b, barrier pitch 

& gap between 
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z, distance to 
viewing windows 


Figure 9.12 The principle of the front parallax barrier. 
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b 21 


esti. (9.12) 
Z-g Zz 
which can be rearranged to give 
b=2i(*-8) (9.13) 
Zz 


The result, Equation 9.13, is that the barrier pitch 
for a two-viewing window display is just less than 
twice the pixel pitch on the display. This small 
difference between the pixels and the barrier 
pitch accounts for the variation in viewing angle 
between the eyes and the pixels across the display 
and is often referred to as viewpoint correction. 

Viewing distance, z, for the best quality viewing 
windows is another design factor and again from 
similar triangles in Figure 9.12, we can deduce a 
geometric relationship for this. 


e 


_ (9.14) 
&§ 2-8 
which can be rearranged to give 
z=a(*"} (9.15) 
I 


The window width is typically set to the aver- 
age eye separation, e=65mm, the pixel pitch, i, 
is defined by the display and the gap, g, between 
display and barrier is defined by the thickness of 
the front substrate on the LCD. For example, pixel 
width might be of the order i=0. 1mm and the 
gap, including front substrate and polarizer, g=1. 
15mm. The result is relatively little control of the 
closest possible viewing distance and given current 
LCD substrate thickness many current parallax- 
barrier-based displays have optimal viewing dis- 
tances of z=750mm. 

More recent two-dimensional displays could 
use a substrate such as Corning Eagle?°°° with 
thickness from 0.4 to 0.63 mm and given a polar- 
izer of thickness 0.2 mm may then be able to reduce 
viewing distance for a front parallax barrier to 
zZ=390 mm. This compares favorably with the typi- 
cal viewing distance of two-dimensional displays 
of 300-350mm although care would be needed 
to avoid artefacts at the edges of the screen plane 
where the viewing angle increases with decreasing 
viewing distance. 


Variations on the basic twin-view parallax 
barrier design and further practical issues are 
described by Kaplan [29] including a discussion of 
multi-view parallax barrier displays and aperture 
design. 

Okoshi [40] notes that problems with paral- 
lax barriers include the reduced brightness due 
to blocking the light from pixels, reflection from 
the glass surface of the parallax barrier and the 
design of the parallax apertures to avoid diffrac- 
tion problems. However, these disadvantages have 
been addressed and recent LCD-based designs 
overcome the first two problems by using bright 
light sources and antireflection-coated optics. The 
result is parallax barriers are now widely used for 
two-view displays such as described [62,63] and 
illustrated in Figure 9.13. 

The diffraction problem is more serious but 
has also recently been addressed. An ideal display 
would have viewing windows described by a top 
hat function; however, in practice they have the 
characteristics shown in Figure 9.11. A number of 
factors determine this and an important one is the 
detailed design of the parallax barrier apertures, w, 
shown in Figure 9.13. A wider aperture results in a 
brighter image but reduces the geometric perfor- 
mance of the aperture and creates less well-defined 
windows. A narrow aperture results in a less bright 
image with better window definition; however, too 
narrow an aperture suffers from diffraction effects 
which in turn results in less well-defined windows. 
In both cases, the crosstalk performance, useful 
width and uniformity of intensity at the viewing 
window are affected. 

A detailed study of the barrier position, aperture 
design and related diffractive effects is presented 
in [35,62]. In [62] a comparison was made between 
placing the parallax barrier behind and in front of 
the LCD element. The analysis uses a model of the 
parallax barrier accounting for Fresnel diffraction 
and compares this to a set of experimental mea- 
surements. Placing the parallax barrier behind the 
display results in lower crosstalk while placing it in 
front of the display has very much better intensity 
uniformity and useful width at the window plane. 
These factors are decisive for tracking displays and 
hence the front position was adopted to build a 
single-LCD observer tracking display [62]. 

In [35] several apodization modifications to the 
parallax barrier are analysed; these include soft 
aperture edges, multiple sub-apertures at aperture 
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Figure 9.13 Detail of a single-LCD front parallax barrier design [62]. 


edges and combinations of the two techniques. 
The analysis concluded that choosing the correct 
apodization can make a substantial improvement 
to the window profile improving both the cross- 
talk performance and viewing freedom of the 
display. In particular, crosstalk of less than 1% is 
theoretically achievable using an improved paral- 
lax element; this is a significant improvement over 
the value of 3.5% achieved using unmodified aper- 
tures. These new studies show it is now possible to 
overcome the limitations of parallax barriers iden- 
tified by Okoshi. 

A practical problem encountered by users of 
two-view parallax barrier displays without head 
tracking is how to find the best viewing position. 
One reason is the parallax barrier produces not 
just the central two viewing windows but also 
repeated lobes to each side of these as illustrated in 
Figure 9.14a. An observer in position A will see an 
orthoscopic image (left image to left eye and right 
image to right eye) as will an observer in position 
D. However, an observer in the intermediate posi- 
tion C sees a pseudoscopic image (the left image in 
the right eye and the right image in the left eye). 
This causes problems as typically pseudoscopic 
images show false depth effect and it can be hard 
for novice observers to determine if they are seeing 


a correct three-dimensional image or not. A num- 
ber of devices have been proposed to help observ- 
ers determine when they are in the correct viewing 
position; the VPI (viewing position indicator) dis- 
play described in [62,63] achieves this by integrat- 
ing an indicator into the parallax element. 

The parallax barrier in the VPI display is 
divided into two regions: the image region, which 
is most of the display, and the indicator region, 
which may cover just the bottom few rows of pixels 
on the display. The result is shown in Figure 9.14a 
and b, respectively. In the image region the con- 
ventional barrier design allows the left and right 
views to be seen at the nominal viewing position 
A. In the indicator region, the display shows a pat- 
tern of red and black stripes and the barrier design 
is modified so that the indicator region shows 
black to both eyes only when the observer is in a 
position to see an orthoscopic three-dimensional 
image as at A. If the observer is approaching, as 
at B, or in, as at C, a pseudoscopic region he will 
see red in one eye in the indicator region indicat- 
ing he should move laterally until returning to the 
orthoscopic zone. A drawback of the VPI design is 
that when the observer is in viewing zone D, she 
can see an orthoscopic image but the indicator will 
still show red. However, the observer is guaranteed 
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Figure 9.14 The VPI display operation (a) in the image region and (b) in the indicator region [62]. 


that whenever a black indicator region is seen, he 
will see an orthoscopic image on the display and 
this seems a reasonable trade-off. 

The indicator region is implemented by using 
a barrier pitch in the indicator region double that 
used in the image region. As a result, the VPI dis- 
play requires little additional design or manufac- 
turing cost and uses only a few lines of pixels to 
display the appropriate indicator pattern. It has 
the benefit that once the parallax barrier is aligned 
for image viewing, the indicator mechanism is 
automatically aligned. The VPI also works to help 
guide observers find the best longitudinal viewing 
position if the aperture width, w, is kept the same 
in both the image and indicator regions of the par- 
allax element. 

A range of designs using parallax barrier optics 
in combination with LCD elements has been pro- 
posed, prototyped and commercialized. 

A large range of display designs was developed 
using parallax barriers [19]. One example uses both 


a rear and a front parallax barrier with the aim of 
reducing crosstalk, although no window profile 
measurements are given to say how successful this 
was. Because the combination of two parallax bar- 
riers reduces display brightness, the rear barrier 
was mirror coated on the side facing the illumi- 
nator to recirculate light. A further design using 
just a rear parallax barrier places an electronically 
switchable diffuser between the parallax barrier 
and the LCD element. This allows instantaneous 
switching between two-dimensional and three- 
dimensional modes and if the diffuser is program- 
mable also allows three-dimensional windows 
to appear on a two-dimensional display. Several 
designs also combine a window steering mecha- 
nism and head tracker to increase lateral viewing 
freedom; one of these [26] uses an electronically 
programmable LC parallax barrier. 

A design also based on an electronically pro- 
grammable parallax barrier is described by Perlin 
[44-46]. A key goal for the design is to steer the 
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viewing windows to track the viewer in three 
dimensions by varying the pitch and aperture of 
the parallax barrier in real time. The aim is to gen- 
erate real time viewpoint correction so the viewer 
can vary position and still see a three-dimensional 
image across the whole display surface. The poten- 
tial benefit of the design is in extending longitu- 
dinal movement with respect to the display and 
it is also capable of accounting for head rotation, 
which effectively varies the observer's eye separa- 
tion. The design is relatively complex and before 
choosing this approach, it would be wise to make a 
comparison with the longitudinal freedom already 
available from a fixed aperture display with good 
quality viewing windows. In practice, realizing the 
display presents a number of challenges including 
the optical quality achievable from the program- 
mable parallax element and the speed and latency 
targets with which the tracking and steering mech- 
anisms need to work. 


9.3.5.2 LENTICULAR ELEMENT DESIGNS 


Lenticular elements used in three-dimensional 
displays are typically cylindrical lenses arranged 
vertically with respect to a two-dimensional dis- 
play such as an LCD. The cylindrical lenses direct 
the diffuse light from a pixel so it can only be seen 
in a limited angle in front of the display. This then 


Lenticular 
element 


allows different pixels to be directed to either the 
left or right viewing windows. 

The principle for a two-view lenticular element 
stereoscopic display is illustrated in Figure 9.15 
and described in [50]. This shows the geometry for 
a viewpoint-corrected display where the pitch of 
the lenticular is slightly less than the pitch of the 
pixel pairs. As with parallax barrier displays the 
effect of viewpoint correction is to ensure pixels at 
the edge of the display are seen correctly in the left 
and right viewing windows. The lenticular pitch 
needs to be set so that the center of each pair of 
pixels is projected to the center of the viewing win- 
dows and this can be found by considering similar 
triangles where 


(9.16) 


(9.17) 


=a =F} 
Z 


Typically, the pixel pitch i is set by the choice of 
two-dimensional display and the minimum focal 
length, f, determined in large part by the substrate 
thickness on the front of the display. The view- 
ing distance can again be derived from similar 
triangles: 


e, eye separation 
and window width 

i, pixel pitch 

Z, lenticular pitch 

Ff focal length 

z, distance to 
viewing windows 


Figure 9.15 Front lenticular auto-stereoscopic display principle [50]. 
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Lag & (9.18) 


which can be easily rearranged to give 


z=1(£+1} 
1 


Typically, the window width for a two-view system 
is taken to be the average eye separation, e=65mm, 
to give some freedom of movement (up to e/2) 
around the nominal viewing position. Combining 
this factor with the display-related values of i and 
fit may be that there is again little choice over the 
closest possible viewing distance. 

Lenticular elements have been used less often 
than parallax barriers in recent two-view display 
designs; one exception is the range of displays 
designed by the DTI corporation. 

The DTI display design described by Eichenlaub 
[10,11] uses light guide and lenticular elements 
behind an LCD display to generate light lines that 
are functionally equivalent to having a rear paral- 
lax barrier. The principle of creating viewing win- 
dows using the light lines is shown in Figure 9.16. 
The pitch required for the light lines can be calcu- 
lated using similar triangles as for the parallax bar- 
rier example discussed earlier. 


(9.19) 


et (9.20) 
Zz 


Z+g 


Light lines Display 


which can be rearranged to give 


p=2( **8), 
Z 


In this case, the pitch of the light lines, b, is slightly 
larger than twice the pixel pitch to achieve view- 
point correction. Again the gap, g, will determine 
viewing distance and is likely to be constrained by 
the substrate glass thickness when using an LCD. 
The backlight construction that creates the 
light lines is shown in Figure 9.17. A modified light 
guide uses a series of grooves to generate an ini- 
tial set of light lines, which are then re-imaged by 
the lenticular element to form a larger number of 
evenly spaced light lines in front of the light guide. 
A two-dimensional/three-dimensional switch- 
ing diffuser in front of the lenticular element is 
made of polymer dispersed liquid crystal (PDLC) 
which when on is transparent allowing the display 
to operate in three-dimensional mode. When the 
PDLC is off it becomes a diffuser, scattering light 
and preventing the initial set of light lines reaching 
the lenticular lens. The result is a diffuse illumina- 
tion for the display, which will operate with similar 
performance to a normal two-dimensional display. 
Various size displays have been constructed with 
5.6 and 12.1 in displays having crosstalk of 3% and 
6% and uniformity of 20% and 24%, respectively. 
The DTI design has the advantage of being able 
to electronically switch between two-dimensional 
and three-dimensional illumination modes as well 


(9.21) 


Viewing 
windows 


e, window width 

b, light line pitch 

i, pixel width 

g, gap between 
light lines and 
pixels 

z, distance to 
viewing windows 


Figure 9.16 The geometry of rear parallax illumination by light lines. 
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Figure 9.17 The DTI compact backlight allowing two-dimensional/three-dimensional illumination. 


as being small enough to be used in portable dis- 
play devices. In addition, there are no optical ele- 
ments in front of the display surface allowing the 
observer to directly view the LCD display. Against 
this are some trade-offs and the three-dimensional 
mode has higher crosstalk than a well-designed 
parallax barrier system. 

Other designs for single-LCD three-dimen- 
sional displays using lenticular optics include 
[37,38,43,61]. 

A novel design using micro-prism elements was 
proposed [48,49]. The D4D display uses an array 
of vertically oriented micro-prisms as the parallax 
element and the left and right images, vertically 
interlaced in columns, are directed to two viewing 
windows by the micro-prisms. A commercial dis- 
play based on this principle included a head track- 
ing device and both electronic image shifting and 
mechanical movement of the micro-prisms were 
investigated as ways to steer the viewing windows. 


9.3.5.3 MICRO-POLARIZER DESIGNS 


Displays using polarization to create light steer- 
ing optical elements have been proposed by several 
groups. The stereoscopic display design described 
by Faris [14,15] can also be configured to have an 
auto-stereoscopic mode by using a series of stacked 
micro-polarizer elements to create a switchable 
parallax barrier. However, despite this potential for 
auto-stereoscopic operation most of the commer- 
cial products from VREX have been stereoscopic 
systems. 


Harrold et al. describe display designs using 
micro-polarizers in [20,21]. The design exploits 
the polarized light output from an LCD element 
over which is created a patterned retarder array. 
A final polarizing layer is placed over the retarder 
array effectively creating a front parallax barrier 
and hence an auto-stereoscopic display. If the final 
polarizing layer is constructed so that it is remov- 
able, the display can be mechanically switched 
between a two-dimensional display mode and an 
auto-stereoscopic three-dimensional display mode 
(Figure 9.18). 

Key to the success of this design is the construc- 
tion of the patterned retarder array to an accu- 
racy of better than 1 part in 2000 for the 13.8 in 
XGA display prototype. This was achieved using 
a process based on standard LCD manufacturing 
techniques to create a manufacturable patterned 
retarder array that is front mounted onto the LCD 
element. 

A stereoscopic display design is also described 
by Harrold in [20] where the patterned retarder 
and polarizer are constructed inside the LCD ele- 
ment to avoid the parallax problems of the Faris 
design [14,15]. A prototype LC cell demonstrated 
the feasibility of this approach. 

A micro-polarizer display described by Benton 
[1,2] uses a combination of polarization and 
bulk optics to create two viewing windows that 
can be steered electronically if a suitable head 
tracker is available. An LCD panel with the ana- 
lysing polarizer removed acts as an electronically 
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Without the analysing polarizer the 
display operates as a normal 2D 
display as the eye is insensitive to 
polarized light 


With the analysing polarizer at 
135° in place the retarder acts 
as a parallax barrier as shown 

and the display operates in 
autostereoscopic 3D mode 


Analysing os . 
polarizer 135° orientation 
Substrate 
| 45° axis 
Patterned see 8 
retarder [-] 90° axis 
Polarizer | 45° orientation 
LCD Pixel layer 
panel 
Polarizer 135° orientation 


Light that passes through the LCD emerges with linear 
polarization at 45° and is either rotated a further 
45 or 90° by patternedretarder element 


Figure 9.18 The Sharp micro-polarizer display with two-dimensional/three-dimensional switching 


capability [20]. 


programmable polarizing light source: light com- 
ing from the light source LCD will be either rotated 
at 90° or not rotated. An illumination pattern of 
two blocks of light is displayed on the light source 
LCD, each polarized differently. A micropolar- 
izer array arranged as rows behind an image LCD 
display allows alternate rows of image to be illu- 
minated by differently polarized light and hence 
appear in the viewing windows for the left and 
right eyes. A large lens after the LCD produces an 
image of the viewer-tracking LCD (polarized light 
source) at the intended viewing distance of about 
1m creating the two viewing windows. 

Benton notes there can be problems with the 
lens (a Fresnel lens) creating Moire patterns in 
association with the image display LCD. In com- 
mon with many auto-stereoscopic displays the 
viewer has to be at or close to the nominal view- 
ing distance, which at 1m is significantly further 
than typical two-dimensional display viewing dis- 
tances. No measurements of crosstalk or window 
brightness uniformity are given 


9.3.5.4 HOLOGRAPHIC ELEMENTS 


Holographic optical elements (HOEs) have been 
used [51,52] to create three-dimensional dis- 
plays in conjunction with LCD elements. When 


illuminated the HOE acts to form the viewing 
windows. The HOE is arranged in horizontal 
strips to reconstruct a real image of a diffuse illu- 
minator; the strips are arranged so alternate strips 
reconstruct left and right viewing windows. When 
placed behind a display with two horizontally 
interlaced images, the observer will see an auto- 
stereoscopic image. 

A number of practical problems in the optical 
design are discussed in [51] and in particular color 
fringing due to the diffractive nature of the HOE 
could prove difficult to overcome. Otherwise, this 
design has several advantages and can be modi- 
fied to track users by moving the light source and 
also constructed so that it can be switched between 
two-dimensional and three-dimensional using a 
modified light source. 


9.3.6 Multi-view systems 


The viewing freedom of a three-dimensional dis- 
play is a key requirement in certain applications, 
for example public information kiosks, where ease 
of viewing is needed to attract and retain the atten- 
tion of passers-by. Multi-view systems, as in Figure 
9.19, provide viewing freedom by generating mul- 
tiple simultaneous viewing windows of which an 
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Figure 9.19 Multi-view displays create multiple 
viewing windows. 


observer sees just two at any time. Multi-view sys- 
tems can also support more than one observer if 
enough horizontal viewing freedom is available. 
Bulk optic multi-view displays have been devel- 
oped and are reviewed in the literature [7,36]. The 
display was designed to use temporal multiplexing 
of the view images and because the basic switch- 
ing speed and interface bandwidth of LCD displays 


Lenticular 
element 


Display 
pixels 


were not sufficient, this led to the use of highspeed 
CRT technology. 

Micro-optic multi-view designs using standard 
two-dimensional displays have been proposed 
where the images are spatially multiplexed. The 
Heinrich-Hertz-Institut has a well-established 
programme investigating lenticular three-dimen- 
sional displays and Borner [4] describes a number 
of multi-view designs. 

The principle for a multi-view LCD display 
using a front lenticular element, similar to the 
two-view lenticular design described previously, 
is illustrated in Figure 9.20. This shows a five-view 
lenticular display, where each pixel in every group 
of five pixels is directed to a different viewing win- 
dow. As with the two-view displays the system 
should be viewpoint corrected so that the viewing 
windows are aligned with pixels across the whole 
display. 

To use the display, five images are sliced verti- 
cally into columns and interlaced appropriately. 
The images will then be visible separately in the 
five viewing windows V1-V5 in Figure 9.20. The 
viewing windows can be designed as shown so 
pairs of image separated by one image, for example 
V2 and V4, are seen simultaneously by the left and 
right eyes and if these form a stereo pair, then an 
observer sees an image with stereoscopic depth. In 


Multiple 
viewing 
windows 
v2 
e V3 
V4 


e, eye separation 
i, pixel pitch 
J, lenticular pitch 


ff focul length 
z, distance to 
viewing windows 


Figure 9.20 The principle of amulti-view front lenticular auto-stereoscopic display. 
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addition, if the observer moves laterally they can 
see a different pair of images, for example V3 and 
V5, and therefore a different stereoscopic view of 
the scene. 

Using a similar geometrical argument as for 
two-view lenticular displays, the pitch of the lenses 
can be determined by 


1=Nyi(2—*) 
Z 


where N, is the number of viewing windows 
required. 

There are several drawbacks to the basic multi- 
view approach that are particularly apparent when 
electronic displays are used [55]. The first is there 
is a black mask between LCD pixels and this is 
imaged into dark lines between each view win- 
dow which is distracting to observers when the eye 
crosses a window boundary. Also images with any 
significant depth will result in an image-flipping 
artefact as the observer moves the eye across one 
view window and into the next. Finally as more 
views are used the horizontal resolution of the 
images decreases rapidly. To overcome these prob- 
lems, a new approach to multi-view LCD display 
was proposed [55]. 

Several multi-view systems based on lenticu- 
lar micro-optics and single LCD displays were 
proposed [54-56]. A significant step forward was 
made by positioning the lenticular array at an angle 
to the LCD pixel array; this mixed adjacent views 
reducing image flipping problems and spreading 
the effect of the black mask making it less visible. 
The other benefit of this design is that each view 
has a better aspect ratio; rather than splitting the 
display horizontally into many views both hori- 
zontal and vertical directions are split. 

The arrangement of one lenticule and the 
underlying pixels in the slanted lenticular design 
is shown in Figure 9.21. The slanted lenticular 
arrangement means that all pixels along “a” line 
such as a will be imaged in the same direction. In 
this case all view three pixels are seen in the same 
direction. The arrangement shown allows seven 
views to be interlaced on the display and imaged 
in different directions by the lenticule. As the eye 
moves from position “a” to “b” the eye sees a grad- 
ual transition from view 3 to view 4. At most view- 
ing positions the eyes will see a combination of 
more than one view; while this inherent crosstalk 


(9.22) 


limits the depth that can be shown on the display 
it does hide the transition between views at bound- 
aries and blurs the appearance of the black mask 
so that it is no longer an obvious visual artefact. 
For the seven-view display described in [56] the 
magnification of the lenticules is designed so that a 
viewer at a distance of approximately 700 mm from 
the display sees views 3 and 5 in left and right eyes 
respectively, i.e., views separated by one view form 
a stereo pair. 

Analternative design where the pixels are slanted 
instead of the lenticular element is described in 
[57]. However, such a major change to LC display 
design is unlikely to happen unless there is a sub- 
stantial worldwide market for three-dimensional 
displays or an advantage of slanted pixels for two- 
dimensional LCD operation is found. 

The multi-view display design [33] adopts a 
similar solution, citing an earlier reference [59] as 
the source of the idea for using a lenticular slanted 
with respect to the vertical image axis. This display 
generates nine viewing regions, through which the 
user can see nine equal resolution images. Based 
on an SXGA (1280 1024) LCD display this results 
in each viewing window image having a two- 
dimensional resolution of 426 by 341 pixels. 

Experience with lenticular optics [61] suggests 
displays based on lenticular optics have to make 
additional design trade-offs. An important one is 


Single lenticule 


Figure 9.21 The slanted arrangement of the 
lenticular lens and pixels in the multi-view display 
[55]. 
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the difficulty of anti-reflection coating the lenses, 
which can lead to distracting reflections on the dis- 
play surface. Another is that scattering of light in 
the lenses generates a visible artefact looking to the 
user like a light grey mist present throughout the 
three-dimensional scene. 

To summarize multi-view displays: 


e Temporally multiplexed displays with high 
resolution per view suffer a number of draw- 
backs: they need high-speed display elements 
and high-bandwidth image generation and 
interface circuits. This seems likely to delay 
their widespread adoption in personal three- 
dimensional display applications. 

e Spatially multiplexed designs have lower reso- 
lution per view than twin-view displays and 
recent designs build in crosstalk limiting the 
three-dimensional depth. Despite this they are 
attractive commercially because of the benefit 
of viewing freedom they provide and their 
relatively low cost and manufacturability. 


A solution for the future is to build a system with 
an intermediate number of views, say three, not 
requiring mechanical view steering and use a head 
tracking device to keep the images up to date with 
the observer’s head position. One such system, 
known as PixCon, is described in [61] and another 
design is presented in [48]. A similar idea for using 
view switching in a twin-view system was pro- 
posed in [50]. A prerequisite for this is low-cost, 
accurate, observer head tracking and some good 
progress is being made in this area [25]. 


9.4 THREE-DIMENSIONAL DISPLAY 
PERFORMANCE AND USE 


9.4.1 Comparing perceived depth 
reproduction 


Perceived depth reproduction is the single most 
important reason for building three-dimensional 
displays but system characteristics in this respect 
are rarely reported in the literature. In this section, 
we consider three generic designs, the twin-LCD 
and single-LCD two-view systems and a single- 
LCD multi-view system, and analyse their ability 
to reproduce perceived depth. Similar real exam- 
ples of these designs are the twin-LCD display [61], 


the single-LCD VPI display [62] and the nine-view 
multi-view display, but our discussion abstracts 
from the details of specific display implementa- 
tions for clarity. We compare the ability of the three 
generic designs to reproduce depth to each other 
and to the performance of the human eye; we also 
consider the demands on the graphics and imaging 
systems supplying the displays with content. 

The generic three-dimensional display designs 
are assumed to be based on the same underlying 
LCD element, a 1280 by 1024 pixel display with a 
horizontal pixel width of i=0.3mm approximat- 
ing an 18.8in. diagonal SXGA display. The three- 
dimensional displays can then be characterized by 
the effective pixel width in the image seen by one 
eye: 


e The twin-LCD twin-view display has two over- 
laid images and the pixel width in each view is 
the same as the base panel at i=0.3 mm. 

e The single-LCD twin-view display has two 
horizontally interlaced images and the pixel 
width in each view is double the base panel at 
i=0.6mm. 

e The single-LCD multi-view display has nine 
views, interlaced horizontally and vertically, 
and the pixel width in each view is triple the 
base panel at i=0.9 mm. 


We assume the latter two displays overlay the left 
and right eye images to simplify discussion, but 
note in practice it will be necessary to consider the 
exact interlacing of RGB components. 

The following set of characteristics provides a 
basis for comparing display designs. Our aim is 
to capture the characteristics that are important 
in the human perception of three-dimensional 
displays. 


Total display resolution: However a stereoscopic 
display is designed to provide views to each 
eye, the total display resolution, i-e., the sum 
of all pixels in all views, largely determines the 
computational effort required to generate the 
images for display and the bandwidth required 
in interface circuits. Displays which require 
image interlacing will also require additional 
functionality in interface circuits as pixels 
from different views typically need to be inter- 
laced at the RGB component level. Bandwidth 
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requirements can be determined from total 
display resolution and the desired frame rate. 

Resolution per view: The resolution per view is 
a key characteristic of a three-dimensional 
display. Having stereo three dimensions does 
not replace the need for high spatial resolution 
and anyone used to 1280 x 1024 monoscopic 
displays will notice the step down when divid- 
ing these pixels between two or nine views. 
However, a three-dimensional display can 
often look better than a monoscopic display 
with the same resolution as a single view on the 
three-dimensional display because the brain 
integrates the information received from the 
two views into a single image. 

Perceived depth voxels: As shown in Figure 9.22, 
a pair of corresponding pixels in the left and 
right images represents a volume of perceived 
depth; we will call this a stereoscopic voxel or 
voxel as in [24]. Of particular interest is the 
depth of a voxel that a display can represent 
for a given screen disparity between corre- 
sponding pixels. We can use this to compare 


ti 


Pixels 


Observer 


the depth representation abilities of different 
displays in depth and to compare displays with 
the ability of the eye to perceive depth. The 
perceived voxels are arranged in planes from in 
front to behind the display; as they recede from 
the viewer the cells increase in depth [8,24]. 


The depth span of a voxel can be found by using 
Equations 9.10 and 9.11 as appropriate to calculate 
difference in depth of points 1 and 2 in Figure 9.22. 

Consider zero pixels disparity as in Figure 
9.22(0). For point 1, a pixel width i=0.3 mm implies 
screen disparity of d=—0.3mm and assuming 
z=750mm and e=65mm then the perceived depth 
in front of the screen plane is 


7 =3.45 mm (9.23) 


P=7——.— 
cae a 
|1-0.3] 


For point 2 the screen disparity is d=0.3 mm and 
the perceived depth behind the screen plane is 


(a) Zero pixels disparity represents a 
volume of perceived depth as overlapping 
corresponding pixels have non-zero width 


Typical values for the generic displays: 
Pixel width, i = 0.3 mm, 0.6 mm or 0.9mm 
Eye separation, e = 65mm 

Viewing distance, Z = 750 mm 


(b) Two pixels disparity represents a 
volume of perceived depth as shown 


Figure 9.22 The perceived depth represented by corresponding pixels of 0 and 2 pixels screen 


disparity. 
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Therefore, the total perceived voxel depth in this 
case is 6.93 mm. This is the perceived depth repre- 
sented by corresponding pixels with zero disparity 
at the screen plane. In practice, it tells us this dis- 
play cannot reproduce a depth difference between 
objects at the screen plane of less than 6.93 mm. 
Results of similar calculations for all three generic 
displays are given in Figure 9.24. 


Perceived depth range: The perceived depth range, 
that is the nearest and furthest points a display 
can reproduce, is of interest. Geometrically, 
this can be calculated from the maximum 
screen disparity available; however, for most 
displays of any size the geometric range is 
much more than can be viewed comfortably 
by the majority of observers. Instead, it is 
important to determine the comfortable per- 
ceived depth range experimentally and for our 
discussion we adopt results reported in [27]. 
This suggests a comfortable working range for 
the majority of people is from 100 mm in front 
to 100 mm behind the display surface and this 
range could probably be extended to 200 mm 
in front and 500mm behind and still be com- 
fortable to view for the majority of observers. 
We take the +100 mm range for our calcula- 
tions here without affecting the generality of 
the discussion. 
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Stereoscopic resolution: Identifying the comfort- 
able working range of perceived depth ona 
display also allows us to define the resolution 
of perceived depth within this range. Perceived 
depth voxels of equal screen disparity form 
planes of voxels parallel to the display surface 
as illustrated in Figure 9.23. We will define ste- 
reoscopic resolution to be the number of planes 
of voxels within the range of +100 mm. 


Stereoscopic resolution can be calculated iden- 
tically for each of the generic displays, which have 
the same viewing distance, by finding the screen 
disparity, d, that generates voxels at 100mm. The 
sum of these values is then divided by the width of 
a stereoscopic pixel, i, on the display in question. 

The table in Figure 9.24 shows values of the 
characteristics discussed here for the three generic 
displays. Not surprisingly the twin-LCD display 
with the most pixels per view has the best results 
for depth reproduction with an ability to repro- 
duce depth differences of 7mm at the screen plane 
and a stereoscopic resolution of 60 planes of depth 
in the working depth range +100mm. However, 
the eye is much better at perceiving depth than the 
best display is at reproducing it with a minimum 
detectable depth difference of 0.84mm and an 
equivalent stereoscopic resolution of 240 planes of 
depth in the working range +100 mm. 

This difference suggests significant improve- 
ments are still possible to the depth reproduction 
characteristics of stereoscopic displays. It is also 
important to keep in mind when using the displays 
if depth judgement is critical to task performance. 


Right 
eye 


Left 


Figure 9.23 Stereoscopic resolution is defined by planes of stereoscopic voxels. 
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Total resolution 1280(h)x1024(v) 


1280(h)x1024(v) 


oe — Twin-LCD Single-LCD Single-LCD oi 
Chipptersic Twin View Twin View Multi (9) View | ‘fuman Vision 
2x 


1280(h) x 1024(v) 


Voxel depth: 
0 pixels disparity 
Stereo resolution 
(in + 100 mm) 


14mm 


View resolution 1280(h)x1024(v) 640(h)x1024(v) 426(h)x341(v) 
View pixel width 0.3 mm 0.6 mm 0.9 mm 
Viewing distance 750 mm 750 mm 750 mm 


21mm 0.84 mm 


7mm 
fail sa 


The calculations for the table assume an observer eye separation of 65 mm. 


Figure 9.24 Table comparing characteristics of the generic displays and the eye. 


In the next section, we briefly review how to cre- 
ate images that account for the available working 
depth range and resolution of three-dimensional 
displays. 


9.4.2 Perceived depth control and 
image generation 


As discussed three-dimensional displays have lim- 
its on the comfortable perceived depth range they 
can reproduce. This results in a working volume of 
space around the display plane that content pro- 
ducers can use to present a scene in. The work- 
ing volume available on the display is unlikely to 
match the volume in the scene being captured. As 
a result, several approaches to mapping depth from 
a scene onto the available depth range of the target 
display have been proposed. 

Traditionally, this has involved a discussion of 
whether to set cameras at eye separation or not 
and whether camera axes should be parallel or 
verging. However, recent methods approach the 
problem as a mapping of a volume of scene space 
onto the available working volume on the target 
display. These methods automatically calculate 
stereoscopic camera parameters given a camera 
position, scene volume to capture and target dis- 
play specifications. Wartell describes one approach 
in [58] while a simpler and more general method 
is given by Jones [27]. These have significant ben- 
efits in ease of image generation for content cre- 
ators and guarantee that depth mappings will be 
geometrically consistent even on head-tracked dis- 
plays. The result is stereoscopic images should no 
longer be produced with excessive perceived depth 
or unwanted distortions. 


Despite the long history of stereo image genera- 
tion, it is only recently that new technology, in the 
form of three-dimensional computer graphics and 
digital camera technology, has been able to give 
enough control over the image creation process to 
use these methods. The methods are particularly 
important to apply when creating images for test- 
ing and improving a display design because poorly 
created images, or even just images produced cor- 
rectly for a different three-dimensional display, 
can cause the highest-quality display to become 
uncomfortable to view. 


9.5 SUMMARY 


Advances in micro-optics, display technologies 
and computing systems are combining to pro- 
duce an exciting range of new opportunities for 
three-dimensional display designers. To achieve 
a good three-dimensional display design requires 
a systems approach combining optical, electrical, 
mechanical and digital imaging skills along with 
an understanding of the mechanism of binocular 
vision. 

The characteristics and geometry of binocu- 
lar vision define limits on the maximum range 
of binocular vision and the minimum depth dif- 
ferences it is possible to perceive in the natural 
world. Because the perception of depth is relative 
to the current fixation point, binocular vision is 
best suited to making relative depth judgements 
between objects. 

Stereoscopic images do not provide the same 
stimulus to the eyes as the natural world and the 
implications of this affect three-dimensional dis- 
play design and use. In particular, while the eyes 
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verge to fixate different depths in a stereo image 
the eye’s accommodation must keep the image 
plane, rather than the fixation point, in focus. This 
places measurable limits on how much perceived 
depth is comfortable to view on a particular three- 
dimensional display. 

As well as the stereoscopic depth cue the 
brain uses many two-dimensional depth cues to 
help it understand depth information in a scene. 
Therefore, the first aim for a three-dimensional 
display design needs to be to keep the same basic 
image quality as a two-dimensional display includ- 
ing values of brightness, contrast, spatial resolu- 
tion and viewing freedom. 

We have introduced two-view and multi-view 
auto-stereoscopic display designs based on micro- 
optic elements including parallax barriers, len- 
ticular arrays, micro-polarizers and holographic 
optical elements. These provide different trade-offs 
in cost, system complexity and performance. Key 
characteristics that define the performance of dif- 
ferent displays include 


e Perceived voxel depth at zero disparity, i.e., 
minimum reproducible depth at the screen 
plane 

© Stereoscopic resolution, i.e., the number of 
discrete voxel planes in +100mm depth 

@ Viewing window characteristics particularly 
inter-channel crosstalk and uniformity 


As three-dimensional display quality continues 
to improve, it becomes increasingly important to 
consider the quality of the stereoscopic images 
used to evaluate displays. This requires the adop- 
tion of new methods for image generation based 
on an improved understanding of the human per- 
ception of stereo images to define the mapping of 
depth from a scene onto the working depth range 
available on the three-dimensional display. 
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Optical scanning is familiar to the general public 
through such widespread applications as super- 
market barcode scanners, desktop color scan- 
ners, laser lightshows and desktop laser printers. 
However, scanning technology is also applied to 
commercial printing processes, thermal imaging, 
medical diagnostic equipment, biochemical anal- 
ysis, and quality control of such diverse items as 
sheet steel, drawn wire, windshield glass, semicon- 
ductor wafers and rice—to name just a few of the 
many applications. 

Optical scanning in electro-optical systems 
began to be developed for pre-press commercial 
print processes in the 1940s, and for airborne map- 
ping and reconnaissance in the 1950s, but the field 
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really took off after the invention and commercial- 
ization of the laser in the 1960s. Since then the field 
has developed rapidly, with military investment in 
forward-looking infrared (FLIR) scanners, and a 
steady stream of new commercial applications. 
Scanning systems can be broadly divided into 
input and output systems. Input systems acquire 
electronic data in one, two or three dimensions 
from physical objects, by scanning a detector 
across the object. Output systems create an image 
from the electronic data by scanning a modulated 
light beam (usually a laser) across a light-sensitive 
medium. Input systems can be further subdivided 
into remote/local sensing and passive/active scan- 
ning. Although the various types of system have 
often developed separately, and in some cases are 
described by different terminologies, there is a 
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great deal of overlap and common ground in the 
technology used. 

Scanning technology is very varied, both in the 
components employed and in their configuration 
into optical systems, and no treatment of the tech- 
nology can hope to be complete. Typically, to meet 
a given requirement, many possible solutions exist, 
and comparing potential solutions can involve 
analysis of a complex trade-off of optical, mechan- 
ical, electronic and software issues. In addition, 
non-scanning solutions may also be possible, espe- 
cially those involving detector and light source 
arrays. In some instances, electronically scanned 
arrays can be considered to be part of scanning 
technology, although analysis of the system in this 
way is more meaningful when the electronic scan 
is combined with optomechanical scanning. 

This chapter is organized as follows: An over- 
view of the most commonly encountered scanning 
configurations is followed by a description of the 
most important scanning system performance 
requirements. Then follow descriptions of scan- 
ning system deflector and lens components, and 
finally more detailed descriptions of a few exam- 
ples of practical input and output scanning sys- 
tems. An attempt has been made to generalize, as 
far as possible, although inevitably many detailed 
issues are specific to a particular category of scan- 
ning system. 


10.2 SCANNING SYSTEM 
CONFIGURATIONS 


Scanning systems can take many forms, and it is 
helpful to group them together in related funda- 
mental configurations. There are several commonly 
used ways of classifying scanning systems, which 
are used in different circumstances, although none 
of them is perfect in the sense of including all pos- 
sible types of scanning system. 


10.2.1 Classification by deflector 
position 


The most commonly applied classification, due 
to Beiser [1], is based on the relative positions of 
the scanning deflector and the objective (focusing 
lens) subsystems that make up the basic optical 
configuration (Figure 10.1). 

Objective scanning is defined as the coinci- 
dence of the deflector and objective, and it implies 


Lens 


Scan 


3 2 1 


Figure 10.1 Scan configurations by deflector 
position. (a) Objective scan, (b) Pre-objective 
scan, and (c) Post-objective scan. 


translation of either the complete optical system or 
the scanned object. Examples of scanning systems 
employing this classification include push broom 
(linear translation of a linear array), XY (orthogo- 
nal linear translation stages) and external drum 
scanning (see Section 10.2.2). The scanning speed 
is limited by the mechanical inertia of large struc- 
tures, which must move at the scanning speed. 

Since this particularly affects acceleration and 
deceleration, objective scanning systems are either 
slow or continuous, as in the cases of external drum 
scanners (see Section 10.2.2) or airborne or satellite 
push broom systems, where the aircraft flight over 
the ground provides the scan motion. To overcome 
this limitation, configurations based on angular 
deflectors, giving an optical lever advantage, must 
be used. 

In pre-objective scanning, the deflector precedes 
the lens in the optical path. This usually requires 
a complex lens, as it must operate over the range 
of off-axis field angles defined by the angular scan 
range. However, this is the only scanning configu- 
ration that can generate a fast-scanned flat image 
plane via a suitably designed flat-field lens (see 
Section 10.5.3), which is the usual use for this con- 
figuration. Either a single-axis deflector producing 
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a straight-line scan, or a two-axis deflector to scan 
a flat plane can be used. 

In post-objective scanning, the deflector fol- 
lows the objective lens in the optical path. This 
allows relatively simple on-axis optics, but leads 
to a curved field, which can be circular if there is 
no separation between the rotation axis and the 
deflection point. This curvature can be acceptable, 
provided that the system either has a large depth 
of focus or is imaging a curved (e.g., cylindrical) 
surface. These curved bed systems are described 
further in Section 10.2.2. 

Additional optics (usually based on large 
aspheric mirrors) can be inserted between the 
deflector and the image to flatten the image field. 
These are often described as post-objective field 
flatteners, but the resulting systems have more of 
the characteristics of pre-objective scanners. 

This classification system deals with individual 
scanning mechanisms for a single scan direc- 
tion. In two-dimensional scanning systems, two 
orthogonal scanning mechanisms are combined. 
In the case of a raster scan pattern, fast scan and 
slow scan axes are defined. Often, the fast scan 
mechanism uses angular deflectors in a post-or 
pre-objective configuration, and the slow scan 
mechanism uses objective scanning. 


10.2.2 Classification by image 
geometry 


Particularly in applications that involve flexible 
object or image media, it is useful to classify scan- 
ning systems by the (input or output) imaging 
medium surface form, which can be either flat or 
curved when mounted for scanning. The medium 
could either be the surface of a solid object (e.g., a 
cylinder) or a thin, flexible sheet of material, which 
is constrained to take the shape of a mounting sur- 
face, or bed. This classification system deals with 
complete scanning system geometries. 

A flatbed scanner usually has the benefits of 
convenient material geometric form and simple 
handling mechanisms, together with the possi- 
bility of a compact, lightweight system. Usually, 
a pre-objective scanner or linear array is used, 
often requiring a complex lens design (see Section 
10.5). It is possible to flatten the curved field of a 
post-objective system by using dynamic focus of 
the objective, but this is limited to slow scanning 


speeds because of the mechanical inertia of the 
lens. A closely related form is the capstan scan- 
ner, which combines a straight-line scanner with a 
transport mechanism for the input or output sheet 
that moves it across the scan line using motorized 
rollers. Typically, this winds a photographic film 
from a roll to a take-up roll. 

Non-flatbed scanners are based on cylindrical 
surfaces, either convex or concave. Convex cylin- 
der scanners are called external drum, as the scan- 
ning optics is external to the cylinder surface. In 
these, the optical system is usually stationary and 
the cylinder rotates (Figure 10.2). In this configura- 
tion, there is great system flexibility for the optical 
design, which can vary from simple (single-beam, 
fixed resolution) to complex (multiple beams, 
variable resolution). In addition, the (mechani- 
cal) scanning system is separate from the optical 
system, so it is relatively easy to convert an exist- 
ing system by changing the optics module, or to 
change the scanned image format by changing the 
size of the cylinder. As the focal length of the opti- 
cal system can be small, very high resolution and 
good image quality can be achieved, but the scan 
speed is limited by the very high rotational inertia 
of the large cylinder drum. This causes problems 
for acceleration and deceleration, as well as safety 
and stability concerns that limit rotation speeds to 
about 2000 rev/min. 

An alternative form of external drum scanner 
uses a line scanner to generate a line along the cyl- 
inder, which rotates only once for each complete 
scan of the drum surface. This is the configuration 
for typical xerographic laser scanners. 

Concave cylinder surface scanning systems 
are based on post-objective scanners, which have 
simple on-axis lens optics, and which can take one 
of the two forms, as described above. When the 
deflector rotation axis is parallel to, and collinear 
with, the optical axis, an internal drum scanner 
results (Figure 10.3). This has the advantage that 
as the deflector is small, very high rotation rates 
can be attained. In addition, a large scanning angle 
range (up to 270°) is easily achieved, and fairly high 
resolution scanning is possible over a fairly large 
area. The image surface is stationary, which sim- 
plifies and speeds up material changeover between 
scans. 

Two forms of internal drum scanner optical 
layouts may be identified. If the laser and opti- 
cal systems are small, the complete optics may be 
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(a) 


Scan 


Scan 


Figure 10.2 External drum configurations. (a) Solid drum output, reflective input and (b) Transparent 


drum, transparency input. 


mounted on the linear traverse slide, in a carriage- 
mounted configuration. When the optics is too 
large or heavy, only the final focusing lens and the 
spinner mirror is mounted on the carriage, and 
the preceding optics generates a collimated beam 
collinear with the cylinder axis. The beam must be 
sufficiently large that the beam divergence is small 
over the carriage traverse length. This is known as 
a beam-riding configuration. 

Multiple-beam internal drum systems are diffi- 
cult to realize, because of the image rotation along 
the scan line inherent in the scan geometry. A half 
speed contra-rotating dove prism before the spin- 
ner mirror can correct this [2], although the speed 
control requirements are very demanding. By 
using a two-axis acousto-optic deflector, derota- 
tion of up to three beams is possible [3]. 

A post-objective scan configuration with the 
deflector axis perpendicular to the optical axis 
(such as in Figure 10.1c) is classed as curved bed. 
Compared with the internal drum configura- 
tion, only a small angular scan is possible, but 
the deflection angle is double the mirror rotation 
angle, resulting in a faster scan. Multiple-beam 


scanning is possible, as there is no rotation on the 
scanned beam axis. For faster scanning, a pris- 
matic polygon mirror may be used in place of a 
single-facet mirror. However, in this case, the 
separation between the rotation axis and mirror 
surface results in distortion of the image surface 
curvature and the scan linearity. 


10.2.3 Classification by scan pattern 


The main division is between raster and vector 
scanning systems. Raster scanning repetitively 
scans a beam ina regular pattern of adjoining lines 
over a rectangular area, like a TV display. Raster 
scanning systems are characterized by 


e Fixed scan speeds, often set by a system elec- 
tronic clock 

e Orthogonal scan mechanisms—one being 
a fast line scan and the other being a much 
slower page scan 


For multiple-beam or multiple-detector systems, a 
further sub-classification can be applied, depending 
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Figure 10.3 Internal drum configurations. (a) Carriage-mounted optics and (b) Beam-riding optics. 


on the method of overlapping the beams. Ina swathe 
scanning system, each set of multiple beams forms 
an overlapping scan pattern or swathe, as in a push 
broom scanner. To scan a larger width, multiple 
scans are made with the first beam of each swathe 
adjoining the last beam of the preceding swathe. 
Some systems allow adjacent elements of detectors 
(e.g., CCD) or light sources (e.g., LED), but when 
some physical separation of elements is necessary, 
the array must be angled with respect to the scan 
direction, to allow overlap of adjacent elements. In 
this case, a timing correction must be made to the 
electronic signals to or from the array elements. 

If an array of n elements has a spacing between 
elements of (n+ 1)p, where p is the required separa- 
tion between adjacent scan lines, an interconnect- 
ing scan pattern will eventually result if each pass 
of the array is advanced by a distance np in the slow 
scan direction. This is known as an interlaced scan. 

In page width segmentation, in a single pass 
of the scanner the beams are equispaced across 


the image width. On each pass of the scanner, the 
beams are moved across by one beam width so that 
each beam adjoins the corresponding beam from 
the previous scan, until it reaches the starting posi- 
tion of its following neighbor. 

Random-access and vector scanning systems 
require agile, low-inertia deflectors to move a scan- 
ning spot along arbitrary paths within a scanning 
area. Typically, orthogonal galvanometer deflec- 
tors or electro-optical deflectors are used for very 
high speed scanning applications. In vector scan- 
ning, the two deflectors are synchronized to move 
the spot at constant speed, to maintain a constant 
exposure. In random access, the spot is quickly 
switched to a new direction, requiring sophisti- 
cated control to minimize the switching time with- 
out causing oscillation. 

Complex scan patterns, similar to Lissajous fig- 
ures, may be generated by simple scanning systems 
comprising of two fixed-speed oscillating deflec- 
tors. The deflectors could be resonant galvanometer 
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scanners, or rotating transmissive glass wedges 
known as Risley prisms. Different combinations of 
the relative speed and phase of the two deflectors 
produce a wide range of scan patterns [4]. 


10.3 SCANNING SYSTEM 
PERFORMANCE 
SPECIFICATION 


In this section, the performance parameters that 
are relevant to scanning systems are examined. 
Scanning system performance requirements vary 
widely in terms of resolution, speed and quality, 
and a thorough analysis of these requirements is 
essential before any design is attempted. 


10.3.1 Resolution 


Resolution of a scanning system can be expressed 
in several different ways. The addressability of 
a digital system is determined by the sampling 
frequency of acquisition or output of pixel data 
points, and is usually measured in units of dots per 
inch (dpi). The image is usually sampled at finer 
intervals than the optical resolution of the system. 
Hence, addressability can give a misleadingly high 
value for the system resolution if not interpreted 
with care. 

The spot size (of the scanning laser beam, or 
of the detector element image) is a measure of the 
optical resolution of the system. The cross-scan 
(perpendicular to the scan direction) resolution 
is well characterized by spot size, but the in-scan 
(in the direction of the scan direction) resolution 
of a flying spot scanner is degraded by ‘smearing’ 
of the scanned spot as a result of its motion. In the 
cross-scan direction, the ratio of the spot size to 
the scan line pitch is significant, as this determines 
the intensity ripple in this direction, and hence the 
visibility of scan lines. 

In a laser scanning system, the beam usually 
has a Gaussian intensity profile, and the spot size 
is determined at either the 50% (referred to as full- 
width, half-maximum (FWHM)) or 1/e? (13.5%) 
level. In some systems, the spot is elliptical in 
shape, with the size optimized for both in-scan and 
cross-scan resolutions, but where a round beam is 
used, the spot size is a compromise between the 
two requirements. 

For an angular scanned system, the concept 
of angular scanned resolution is useful. Angular 


scanned resolution is defined in terms of the num- 
ber of resolved spots per scan line, N. 


N=6/A0 (10.1) 


where 0 is the maximum scan angle and A@ the 
scan angle increment corresponding to the reso- 
lution limit of the system. For an optical aperture 
width D, the resolution is determined by the dif- 
fraction limit 


A@=ad/D (10.2) 


where A is the wavelength and a a constant that 
depends on the aperture shape, beam uniformity 
and resolution criterion used. (For the common 
case of an untruncated Gaussian beam defined by 
the FWHM spot size, a=0.75.) Hence, the funda- 
mental scanning equation [5] is obtained 

N=O0D/ai. (10.3) 
A consequence of this equation is that by increas- 
ing the scan angle with additional optics such as 
an afocal beam expander does not improve the 
number of resolved spots per scan line, as 8D is an 
optical invariant. 

The calculated modulation transfer function 
(MTF) can be very useful when applied to a two- 
dimensional raster scanning system used for imag- 
ing [6]. Although MTF is only strictly applicable 
to linear systems, and some elements of scanning 
systems (especially electronic and software image 
processing) can be highly non-linear, the calcula- 
tions can still provide valuable insights if treated 
with caution. 

In-scan and cross-scan MTFs are different, 
because of the time-dependent factors that affect 
in-scan resolution, which are linearly related to 
spatial frequency response by the scan speed. 

For an input system, the model must include 


e Lens performance, or aperture, if diffraction 
limited; calculated or measured 

e Detector element size and shape; calculated 
from the Fourier transform of the detector 
dimensions 

e Scan motion; calculated from velocity as a sinc 
function [6] 


In addition, electronic responses of scanner system 
components must be included 
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e Detector frequency response 

e Detector amplifier frequency response 

e Software processing of the output data (fre- 
quency response is linearly related to MTF) 

e For output systems, the key optical factors in 
the MTF calculation for the cross-scan direc- 
tion are 

e Optical spot size and intensity profile— 
calculated from the Fourier transform 

e Output medium—measured (related to the 
microscopic grain structure of the active 
material) 


Additional system factors for the in-scan direction 
are 


e Spot motion 

e Modulator frequency response, due to the 
optical modulation device and the electronic 
amplifier response 


Measurement of scanning system MTF is difficult 
for both input and output systems, due to prob- 
lems in generating sinusoidal spatial frequencies 
and because of the complex sampled nature of the 
image. In practice, spatial resolution is usually 
assessed by subjective methods, using test images 
and resolution charts. 


10.3.2 Scan duty cycle 


Duty cycle is an important measure of the effi- 
ciency of the scanning system. It is usually defined 
for a line scanning subsystem as the proportion of 
time that the system is actually performing input 
or output of scanned data during the scanning 
operation. It arises because every scanning mecha- 
nism has a ‘dead’ time, which arises from the scan- 
ning geometry. 

Examples of how dead time arises for specific 
scanning mechanisms are 


© Galvanometer: acceleration/deceleration at 
start/end of linear angular scan range, and 
flyback time if the scan is unidirectional. 

e Polygon: transit time of the beam across cor- 
ners of the polygon mirror. 

e Internal drum: obscuration of the deflected 
beam by the spinner carriage. 

© Acousto-optic deflector: acoustic fill time. 


The duty cycle has a significant effect on the sys- 
tem electronic bandwidth requirement, and on 
the radiometry. The electronic bandwidth of the 
system must be higher than the average data rate 
due to this inefficiency in the optical system. For 
example, the optical modulation rate f,, of an out- 
put system is related to the average data rate f, by 


(10.4) 


The effect on radiometry is that higher opti- 
cal power is required to illuminate or expose the 
scanned object than if the scan were perfectly effi- 
cient. In some cases, this can limit the achievable 
scanning speed of the system. 


10.3.3 Radiometry, input systems 


The critical radiometric performance measure for 
input systems is the dynamic range of the system. 
This is defined as the ratio between the system’s 
white and black levels I, and I,, which are the max- 
imum and minimum possible detector signals. 

The white level can depend on the following 
factors: 


e The brightness of the optical source, including 
its condenser optics 

e The optical properties (e.g., reflectivity or 
transmissivity) of the scanned object 

e The efficiency (e.g., numerical aperture, trans- 
mission) of the detector optical system 

e The size, responsivity and saturation level of 
the detector element 


The dynamic range may be expressed as a simple 
ratio, but is more commonly expressed in density 


units 
Iw 
D= logn( 4} (10.5) 
I, 
or as an integer number of bit levels 
Nee (10.6) 


In2 


Optical density is often a more relevant measure 
of signal, because of the logarithmic response of 
many physical processes, including photographic 
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emulsions and the eye. Np, as defined here, is the 
maximum number of physically significant bit lev- 
els that can be achieved in a digitized output. 


10.3.4 Radiometry, output systems 


For exposure of an image on an area A, the optical 
power of the beam is given by 


f ata 
Nopt Nt 
where s is the sensitivity of the exposure material, 
1, the scan duty cycle, n,,, the optical throughput 
efficiency and Tt the total exposure time. 

For most scanning systems, the material sensi- 
tivity must be valid for very short exposure times. 
This can differ significantly from manufacturers’ 
measured values at shorter exposures because of 
reciprocity failure. For photographic materials, the 
sensitivity usually increases at shorter exposures, 
whereas for thermal materials it decreases. 

The value of the constant k in the equation 
depends on the beam profile and the required 
overlap between adjacent scan lines. When k=1, a 
threshold exposure results, when only a thin line 
at the center of the scan line is exposed. A typical 
value for a practical system is approximately 2. 

Note that, for a fixed scanner speed in terms of 
the number of scan lines per second, as the system 
resolution increases, the exposure time increases. 
Hence, the power requirement also increases. 


(10.7) 


10.4 SCANNING DEFLECTORS 


Angular scanning systems use deflector subsys- 
tems to deflect the optical axis of a beam from a 
point within the scanning system in the form of a 
rotation about a scanning axis. A variety of deflec- 
tion subsystems have been developed to meet 
the needs of different scanning applications. The 
deflector subsystem, with or without auxiliary 
focusing optics, is itself often described as a scan- 
ner by manufacturers. 

Distinction may be made between high-inertia 
deflectors, which can only be used for continuous, 
constant speed scanning, and low-inertia deflec- 
tors, whose deflection angle can be varied quickly. 
The latter category often has a limited angular 
deflection range, which limits its application to 
low-resolution systems. Low-inertia deflectors 


include fast-steering, or tip-tilt mirrors, which are 
two-dimensional beam steering devices primar- 
ily used for line-of-sight stabilization or tracking 
applications. 


10.4.1 Galvanometer mirror 


This important class of deflector employs the elec- 
tromagnetic effect of a permanent magnet and the 
magnetic field created by an electric current in sta- 
tor coils to produce a rotary torque, which drives 
a spindle through a limited angular range [7]. A 
mirror is mounted on the end of the spindle to 
deflect the optical beam. The mirror is a significant 
contributor to the mechanical inertia of the rotor, 
which limits the maximum speed and/or angular 
range of the scanner. Angular range is typically 
no more than 40° (80° optical deflection) for low- 
accuracy applications, and less than 20° for high- 
end applications. 

The rotor can be configured as moving iron, 
moving magnet or moving coil [8]. Originally, 
galvanometer scanners were based on moving 
iron designs [9], but modern designs are based on 
moving coil or moving magnet designs. Moving 
coil designs have relatively high inertia, but high 
and stable torque, and are typically used for 
high-accuracy applications with large mirrors 
(greater than 30mm aperture). Moving magnet 
galvanometer scanners, aided by recent develop- 
ments in high-energy-density rare-earth mag- 
netic materials, have the highest torque-to-inertia 
ratio, and hence the highest speeds. The angular 
deflection speed and position are controlled by 
sophisticated closed-loop servo electronics for 
high-accuracy applications, angular position 
being accurately measured using accurate, capac- 
itive or optical sensors with fast response times. 
Resolution of the order of one microradian can 
be achieved. 

For two-dimensional scanning, two deflec- 
tors are often used, mounted close together with 
their axes at right angles. In this configuration, 
the second mirror must be larger than the first, 
to accommodate the beam movement. Because of 
the separation of the scan axes, the scan pattern is 
geometrically distorted, but this can be computed 
[10] and compensated by software correction of the 
drive signals to the galvanometers. 

Galvanometer scanners are most often used 
in one of two modes: vector scanning or regular 
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periodic form. In vector scanning, the two deflec- 
tors of a two-dimensional scanner are controlled 
in synchronism to move the optical beam along a 
programmed path in response to a required pat- 
tern (e.g., alphanumerical characters), usually at a 
constant spot speed. For regular (e.g., raster) scan- 
ning, the deflector is driven so that the deflection 
is linear with time over a large proportion of the 
scan period. 

For a high duty cycle a sinusoidal variation is 
inadequate, and the deflector angle is controlled to 
a sawtooth form, where the beam that is scanned at 
linear velocity in one direction is, quickly returned 
to the starting point. The scan must be unidirec- 
tional because of the slight ellipticity in the scan 
line due to bearing hysteresis. 

For very high fixed scanning rates, resonant 
scanners have been developed. These use flexure, 
torsion or taut-band mounts to enable very high 
scan speeds of up to 8kHz or more. Although 
these deflectors have low inertia, the scan speed 
and sinusoidal oscillation are fixed, so electronic 
signal timing correction is required to obtain an 
undistorted scan with reasonable scan duty cycle. 


10.4.2 Polygon mirror 


Deflectors based on continuously rotating spindles 
are very commonly used in scanning systems, and 
may have either a single mirror facet (monogon) or 
many facets (polygon). 

These deflectors have much higher mechanical 
inertia than galvanometer scanners, but can rotate 
at very high speeds, thanks to the development of 
precision air bearings. These bearings have largely 
superseded ball bearings for high-end applications, 
having much higher speeds, much lower vibration 
and longer lifetime. Bearings can be hydrostatic, 
ie., externally pressurized, requiring a source of 
compressed air. However, many bearings nowa- 
days are hydrodynamic (or self-acting), requiring 
no external air supply. These require much tighter 
manufacturing tolerances, but modern manufac- 
turing techniques, coupled with developments in 
bearing design (especially materials and surface 
finish), make these a practical choice for most 
high-speed applications. 

At very high rotation speeds (>30,000 rpm), air 
resistance (windage) and turbulent flow of air at the 
mirror edges limit the achievable speed, and very 
high-performance scanners are often enclosed in 


partial vacuum or in a helium atmosphere, which 
has much lower viscosity than air [11]. 

Polygon mirrors may be pyramidal or (more 
commonly) prismatic. Prismatic polygons have 
their facet surfaces parallel to the rotation axis, 
while the facets of pyramidal polygons are at an 
angle (usually 45°) the rotation axis. Prismatic 
mirrors deflect the optical beam by twice the 
mechanical rotation, whereas pyramidal mirrors 
deflect it by the rotation angle. 

Prismatic polygons are most commonly used 
in a configuration that has the input and output 
beams in the same plane, perpendicular to the 
polygon rotation axis (Figure 10.4). This configu- 
ration avoids scan line bow. Careful calculation 
is required to design the polygon dimensions to 
achieve the required system performance with the 
minimum size [12,13]. 

Active facet tracking can be used to further 
reduce the size of the polygon facets. In this tech- 
nique, a subsidiary small-angle deflector is inserted 
before the polygon. This high-bandwidth deflector 
is programmed in such a way that it always deflects 
the input beam to the center of the active polygon 
facet. 

Pyramidal mirrors (Figure 10.5) are more 
expensive to fabricate, but can achieve very high 
scan speeds in a compact configuration. They are 
often used overfilled, i.e., with an input beam larger 
than the mirror facet. This results in a scanned 
spot of constant power and a high scan duty cycle, 
but the optical transmission efficiency is poor. It is 
possible to design a coarse two-dimensional scan 
system by varying the pyramidal angle between 
adjacent facets to produce an angular step between 
successive scan lines. 

The single-facet (monogon) versions of pris- 
matic and pyramidal deflectors are shown in 
Figure 10.6. The pyramidal type is the more com- 
monly used, usually in a post-objective internal 
drum scanner, and many innovations have been 
applied to increase the speed. Optically transpar- 
ent enclosures have been developed to overcome 
windage limitations [14], and special mirror shapes 
have been developed to reduce dynamic distor- 
tions of the cantilevered mirror surface [15]. 


10.4.3 Holographic 


Holographic deflectors [16] are based on rotating 
(flat or curved) surfaces containing fine diffraction 
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Figure 10.4 Prismatic polygon scanner configuration. 
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e Figure 10.6 Single-facet (monogon) deflectors. 
Cn, (a) Pyramidal form and (b) Prismatic form. 
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gratings, which can be either linear gratings 
to deflect collimated beams, or more complex 
curved structures that have optical power and 
act as lenses. Thick blazed transmissive gratings 
Figure 10.5 Pyramidal polygon scanner can have high transmission efficiency in a sin- 
configuration. gle diffraction order for a single wavelength. A 
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Holofacet (grating) 


Figure 10.7 Disc hologon. 


single surface that combines several holographic 
elements is referred to as a hologon (from holo- 
graphic polygon). 

The disc hologon is a form in which hologram 
elements are arranged radially around a flat circu- 
lar disc, which rotates about its axis (Figure 10.7), 
and performs a similar function to a prismatic 
polygon [17]. The figure shows a hologon with opti- 
cal power, but linear gratings and a separate lens 
are often employed in a pre-objective scanning 
configuration. In this case 

. : r 
sin8; + sin®, ve (10.8) 
where t is the grating period. 

If the hologon is tilted by a small angle da, the 
change in diffracted angle is given 

d0, =[1- 0-480 a (10.9) 
cos(6, — da) 
from which it can be shown that sensitivity of the 
output angle to wobble in the hologon tilt is mini- 
mized when 0,=0,, which is the Bragg diffraction 
condition. 

An alternative form of holofacet deflector is 
based on holograms arranged around a cylindri- 
cal surface that rotates about the cylinder axis 
(Figure 10.8), and is analogous to a pyramidal 
polygon. 

Holographic deflectors have several potentially 
attractive fundamental features, compared with 
polygon mirrors: 


e Thin lightweight elements, with improved 
aerodynamic shape (elimination of sharp edges 
in the rotating part), therefore potential for 
higher-speed rotation. 

e Reduced centripetal deformations and/or 
reduced sensitivity to such deformations, 
therefore improved accuracy at high rotation 
speeds. 

e Reduced wobble sensitivity (at Bragg angle), 
therefore potential elimination of anamorphic 
deflection error correction optics. 


In addition, they offer increased design flexibility 
to enable compact configurations, and are poten- 
tially more accurate and/or economic to be manu- 
factured in quantity. 

However, they have some limitations: 


e Wavelength sensitivity of deflection angle, 
which must be corrected by auxiliary diffrac- 
tive optics when used with typical laser diodes 
[18]. 

© Specialized and technically demanding design 
and fabrication requirements, few suppliers 
having the necessary skills and facilities, and 
typically a lengthy, expensive development for 
a new system. 

e The most useful configurations are radially 
asymmetric, leading to complex and difficult 
optical system design issues; in particular [19], 
careful consideration may need to be given to 

e Scan bow and scan linearity; arising from 
complex, asymmetric scan geometry. 
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Figure 10.8 Cylinder hologons. (a) Transmissive and (b) Reflective. 


e Radiometric uniformity; diffraction effi- 
ciency varies with angle of incidence. 

e Polarization: diffraction efficiency depends 
on the relative orientations of the input 
beam polarization and the grating, unless 
the beam is circularly polarized. 


A single-facet holographic deflector, comprising a 
single linear grating, has been successfully used as 
a monogon deflector [19]. This is designed to have 
a diffraction angle of approximately 90°. The grat- 
ing is mounted at 45° to the input beam, when the 
wobble insensitivity is most effective, in a mount 
that rotates about the beam axis. The rotating 
mount can include a lens after the grating, for use 
in internal drum configuration, or the deflector 


can be used with a fixed flat-field lens in a pre- 
objective scan configuration. 


10.4.4 Optoelectronic 


The term optoelectronic is used here to denote any 
deflector that has no mechanical moving parts, 
and whose operation depends on the interaction 
of an electrical signal with the optical properties 
of the deflector material to produce an angular 
deflection. These deflectors can have effectively 
zero inertia, the response being limited only by the 
speed of the drive electronics, making them ideally 
suited to broadband position control and ultra- 
high speed scanning. However, in practice, they 
are only rarely viable alternatives to mechanical 
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scanning mechanisms, due to their limited angu- 
lar deflection range and optical aperture, and 
hence poor angular resolution, typically much less 
than 1000 spots per scan line. Their application is 
therefore generally limited to low-resolution very 
high-speed imaging and active error correction 
subsystems. 

Electro-optic (EO) deflectors use the electro- 
optic Pockels effect of some crystal materials 
to deflect a linearly polarized beam by an angle 
proportional to an applied voltage. Electro-optic 
coefficients of even the best crystal materials have 
low values, so angular deflection is small. Because 
of the birefringence and thermal sensitivity of 
available EO materials, sophisticated designs are 
required, often passively compensating the unde- 
sirable effects of temperature and mechanical 
stress using additional crystals. 

One commercial implementation produces a 
uniform refractive index gradient across the crys- 
tal using a quadrupole array of electrodes [20]. An 
alternative form of EO deflector works by inducing 
a refractive index change in a prism (Figure 10.9). 

The deflection angle may be expressed as 


§=—— (10.10) 


where An=(n,—n,) and n is the refractive index 
at the final air interface. For an EO material, the 
change in refractive index with applied voltage V, 
is given by 


7 
An= amr (10.11) 


where 1; is the material electro-optic coefficient 
and t the thickness of the cell in the z direction 
(perpendicular to the paper). 

The deflector performance is improved by 
fabricating the deflector in the form of several 
sequential prisms of alternating polarity, increas- 
ing the length of the cell while allowing practically 
achievable crystal sizes. This has been implemented 
using semiconductor-manufacturing technology, 
growing LiTaO, crystal prisms using photolithog- 
raphy [21]. 

Acousto-optic (AO) deflectors use diffraction in 
a crystal material, produced by pressure waves, to 
deflect the beam by an amount dependent on the 
applied frequency f: The deflection results from a 
thick diffraction grating created within a crystal 
material by a radio-frequency traveling pressure 
wave generated by a piezo-electric acoustic trans- 
ducer bonded to the side of the crystal (Figure 
10.10). The first order deflection angle is given by 
the diffraction Equation 10.8 above, where the 
grating period t=f/v,, where v, is the acoustic 
velocity of the cell material. 

For high diffraction efficiency, Bragg diffraction 
is used, where the deflection angle of the diffracted 
beam at the center of the scan line is twice the angle 
of incidence. Varying the frequency of the acoustic 
wave about the mean frequency varies the deflec- 
tion angle, producing a scan. 


Figure 10.9 Prism-type electro-optic deflector. (a) Single prism and (b) Multiple prism. 
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Figure 10.10 Acousto-optic deflector. 


The number of resolved spots per scan line for 
an acousto-optic deflector is given by [22] 


_ Ar 
a 


N (10.12) 


where T is the transit time of the acoustic wave 
across the optical aperture and Af the change in 
transducer frequency from the center frequency. 

To maximize T, a material with a slow acoustic 
velocity (usually TeO,) is used and the size of the 
laser beam in the acoustic wave direction is made 
as large as possible, although these factors increase 
the response time of the device. To keep the size 
of the crystal within practical limits, cylindri- 
cal lenses are used to shape the beam within the 
deflector. Practical considerations of crystal size, 
acoustic attenuation and deflector speed limit the 
beam width to about 50mm. 

To ensure that the deflected beam has con- 
stant power over the scan range, it may be neces- 
sary to use a phased-array transducer design [23] 


Bragg angle 0, = + 


Relative deflection angle = 0 = a Af 


a 


to maintain Bragg diffraction over the range of 
deflection angles. 

During scanning, the acoustic wave frequency 
varies across the beam aperture. The frequency 
gradient produces a cylindrical lens effect with a 
focal length of 


2 
Vat Va 


oe ae, (10.13) 
"Or; Mdf/dt) 


f. 


The output beam is converging or diverging, 
depending on the scan direction. This can prevent 
bidirectional scanning, and requires the use of an 
external compensating cylindrical lens in the opti- 
cal system. 

Two-dimensional AO deflectors have been pro- 
duced, using transducers on orthogonal faces of a 
single crystal, although practical limits to crystal 
size mean that the achievable angular resolution is 
severely limited. 

Recent research has employed optoelectronic 
devices with high spatial resolution, such as liquid 
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crystal or micromirror arrays, to produce deflec- 
tion via phased array phase shift [24,25] or vari- 
able-pitch diffraction gratings [26]. These devices 
are still in the early stages of development, and 
show some promise of enabling fast beam steer- 
ing with lower-cost devices and drivers, but show 
little improvement in resolution performance 
over the existing acousto-optic and electro-optic 
technology. 


10.5 SCANNER OPTICS 


10.5.1 Reduction-type array imaging 
lens 


In a reduction-type CCD scanner, a high-quality 
multi-element imaging lens images the scanned 
document on the detector array. This lens must be 
specially designed for the purpose, because a stan- 
dard photographic quality camera or enlarger lens 
is not suitable. 

The requirements that must be addressed in the 
design include 


e Magnification: consider both the ratio between 
document width and array length and the ratio 
between required spatial resolution (in terms of 
addressability) and detector element spacing. 
In some systems, the lens must be corrected for 
a range of magnifications, requiring multi- 
configuration optimization. 

e Focal length: a compromise between mechani- 
cal layout and achievable optical performance. 
Short focal length increases the field angle, 
which makes the design more difficult. 

e Spatial resolution: MTF must be specified for 
both sagittal and tangential directions, for 
the full range of field positions, and for each 
wavelength. The effect of sampled imaging 
must be considered; in particular, the Nyquist 
frequency acts as a limit to resolution. 

e Color correction: for color scanners, excellent 
color correction is required, often involving 
secondary spectrum correction and there- 
fore careful glass type selection. Lateral color 
is often a limiting aberration, which causes 
highly visible color fringes on off-axis image 
features. 


e Aperture: often determined by the radiomet- 
ric requirements of the detector. The effect on 
diffraction-limited resolution and lens perfor- 
mance must also be considered. 


The complexity of lens designs ranges from rela- 
tively simple Cooke triplet types to complex dou- 
ble-Gauss derivatives with six or more elements. 


10.5.2 Contact-type array scanner 
optics 


Input scanning using a long array detector with 
small elements can be achieved without complex 
lenses, if the scan line length and resolution can 
be identical to those of the array. In this case, a 
configuration with the detector in contact with the 
scanned object is usually not feasible because 


e Movement of the detector across the object will 
cause scratching. 

e Fora reflective object, there is no clear path for 
illumination of the object around the detector. 

e Detector arrays are usually packaged ina 
hermetically sealed form, with a window and 
air space between the sensor surface and the 
object. This gap results in a loss of light collec- 
tion efficiency and resolution. 


The usual solution is to include a micro-optical sys- 
tem that images the object on the detector at unity 
magnification. This system can be based on either 


e A linear array of glass microlenses and 
microprisms 

e A linear SELFOC (gradient index lens) array 
lens 

e An optical fiber array 


The magnification in these systems must be non- 
inverting, so that the images from the array lenses 
overlap to form a linear image at the detector array. 
This results in a compact, low-cost configuration, 
and is commonly used in hand-held and desktop 
scanners. Compared to reduction-type systems, the 
numerical aperture and the resolution of these opti- 
cal systems are low, and the performance of the very 
long arrays is inferior to wafer-scale CCD arrays. 
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10.5.3 Flat-field laser scanning lenses 


In a pre-objective flatbed scanning system, a spe- 
cially designed lens is required to focus the scanned 
beam to a line. A theoretically perfect thin lens 
produces a curved field of radius f, so a flat-field 
lens must be designed with the correct amount of 
negative field curvature to compensate the natural 
image curvature. 

In addition, in most cases, it is highly desirable 
that a constant rotation rate of the scanner deflec- 
tor should result in a scan spot with linear velocity 
at the lens focus. Thus, the linear spot deflection y 
at the image plane must be related to the scan angle 
by the relationship, known as the f—0 characteristic 


y= fo (10.14) 


where fis the lens focal length and 0 the scan angle, 
so that 


cy 27 


‘ 10.15 
dt dt ( 


A distortion-free lens has the characteristic 


y= tan0 (10.16) 


so the lens must additionally be designed with the 
correct amount of barrel distortion to produce the 
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desired f—0 characteristic. Such a lens is therefore 
referred to as an f—O lens. 

Another common requirement is for telecen- 
tricity in image space, ie., the central, or chief rays 
of the beam are parallel with the lens axis for all 
scan angles. For a thin lens it can be seen from 
Figure 10.11 that this is achieved when the entrance 
pupil of the lens is positioned at the back focus of 
the lens. Telecentricity ensures that the focused 
spot is not elongated into an ellipse by the angle of 
incidence at the image plane. The drawback is that 
at least the final lens element must be longer than 
the scan line. In practice, true telecentricity is often 
not required, and a reasonable divergence of chief 
rays from the scan lens can be tolerated, greatly 
reducing the size of lens elements. The fractional 
increase of spot dimension in the y-direction is 


As_ 1 


s  cos() 7 cee 


so, for example, an angle of 15° increases the spot 
dimension by only 3.5%. In practice, this would be 
acceptable in many systems. 

With modern lens design software, it is quite 
possible to design lenses that are simultaneously 
corrected for field flatness, f—0 distortion and tel- 
ecentricity. However, excellent optical correction is 
also usually required, to ensure uniform spot size 
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Figure 10.11 Telecentric scan lens. 


10.5 Scanner optics 337 


over the scan line. The fewer the constraints placed 
on the design, the better the achievable lens resolu- 
tion performance. Although it is sometimes pos- 
sible to design around an existing lens design, the 
system-specific requirements for wavelength, scan 
line length and resolution most often dictate a need 
for a custom lens. 

Conventional, spherical-surfaced glass lenses 
typically comprise three or more lens elements. 
Although these can be of a single glass type, better 
correction is obtained, even for a monochromatic 
requirement, by using more than one glass. Using 
a combination of crown and flint glasses, color 
correction can be achieved for two, or even three, 
wavelengths. Usually, diffraction-limited perfor- 
mance is required; this becomes more difficult 
to achieve as the spot size decreases and the scan 
angle increases. The number of resolved spots per 
scan line is a good measure of the lens quality, with 
anything over 25,000 spots/scan line being difficult 
to achieve. 

For systems that are manufactured in large 
numbers, one-or two-element injection-molded 
polymer lenses are usually used. As the mound 
tool determines the lens shape, aspheric surfaces 
are economic, and the lens can therefore achieve a 
good optical performance with fewer surfaces than 
can be achieved with spherical surfaces. By using 
very complex, asymmetric surfaces, it is possible to 
correct even the effects of pupil wander—the small 
movement of the beam relative to the lens axis dur- 
ing polygon rotation [27]. 

Because of the limited choice and range of 
materials, color correction is not possible with 
plastic lenses, and the number of spots per scan 
line is usually less than 10,000. Thermal stability 
is often an issue, even within a normal office envi- 
ronment, because of the high temperature coeffi- 
cients of expansion and refractive index of plastics. 
It is possible to passively compensate for tempera- 
ture variations by carefully designing hybrid glass/ 
plastic optics [28]. 


10.5.4 Deflector error correction 


A number of design techniques have been devel- 
oped to reduce, or eliminate, cross-scan deflection 
errors caused by manufacturing and alignment 
tolerances of prismatic polygons. These techniques 
allow relaxation of what would otherwise be pro- 
hibitively tight tolerances. 


Active deflection error correction uses a small- 
angle, high-speed deflector in the optical path just 
before the main scanning polygon. This controls 
the input beam angle in the cross-scan direc- 
tion, to compensate for cross-scan angle errors 
in the polygon. In an open-loop control system, 
the fixed pyramidal errors of the polygon mirror 
are programmed in a calibration procedure. In 
a closed-loop control system, a separate optical 
measurement system is used to measure the cross- 
scan deflection angle, and a servo control system 
dynamically adjusts the input angle to correct any 
deviation. In the latter case, non-repeatable errors 
such as bearing wobble can be corrected in addi- 
tion to polygon fabrication errors. 

Passive deflector correction can be achieved at a 
much lower cost using non-rotationally symmetric 
optical surfaces in the flat-field lens. The principle 
of the correction system is to use a cylindrical lens 
to focus the input beam on the polygon facet only 
in the cross-scan plane, and then to use cylindri- 
cal or toroidal elements within the flat-field lens to 
make the facet surface conjugate with the image 
plane in the cross-scan direction (Figure 10.12). 
Where the optics consists of conventionally pol- 
ished glass optical elements, a toroidal lens is added 
to the usual flat-field lens either before the lens, or 
close to the image plane. However, where molded 
plastic optics is used, the toroidal elements, incor- 
porating aspheric surface forms, may be integrated 
into the lens design, minimizing the number of 
optical elements and achieving better cross-scan 
error correction. 

Other correction systems are possible, using 
multiple reflections to cancel the deflection errors. 
A pentaprism has the well-known property that 
the deflection angle is a constant right angle, 
independent of the angle of incidence, and this 
is generally true for double-reflection systems. A 
pentaprism [29], or any of its open mirror equiva- 
lents [30], might be used as the deflector element 
of an internal drum scanner. These internal cor- 
recting reflector systems have the disadvantage 
of requiring significant complexity in mechani- 
cal mounting and balancing. This disadvantage 
does not apply to external correction systems. An 
external double-reflection element may be used to 
correct deflection error in a polygon. In practice, 
this greatly compromises the system design, sig- 
nificantly increasing the polygon size and flat-field 
lens aperture and reducing scan duty cycle. 
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Figure 10.12 Use of anamorphic lenses for deflection error correction. (a) Uncorrected and 


(b) Corrected. 


10.6 INPUT SCANNING SYSTEMS 


In this section, we look at examples of designs of 
scanning systems used for input of data. These are 
essentially electronic imaging systems, but they 
can give far more information than a visual image, 
providing precise measurement of color, scattering 
properties, size or fluorescence at all points within 
the scanned area. 

Scanning systems have been developed for one- 
dimensional (e.g., supermarket barcode scanners 
and wire diameter measurement systems), two- 
dimensional (e.g., desktop document scanners) 
and three-dimensional (e.g., reverse engineering of 
aerodynamic automobile models) measurements. 
Three-dimensional scanners are mainly one of the 


three types. The first type uses laser radar princi- 
ples, i-e., a two-dimensional spatial scan combined 
with high-speed time-of-flight measurement of 
fast laser pulses. 

The second type uses optical triangulation to 
measure step changes in the height of a projected 
laser line, measured by a two-dimensional sen- 
sor array at an angle to the line projection plane. 
Lastly, confocal scanning microscopy [31,32] uses 
a small-aperture detector at the focus of a retro- 
reflected beam to produce a scanned image with 
very small depth of focus, enabling high-resolution 
three-dimensional images when the object is 
stepped through focus on successive scans. 

Input systems can be classed as either passive 
(ie., using ambient or fixed illumination with a 
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scanning detector) or active (ie., providing illu- 
mination via a scanned beam for a fixed or scan- 
ning detector). FLIR thermal imaging systems [33] 
detect infrared thermal emissions in the 8-12 um 
band. Active input systems that use the scanned 
illumination beam (usually a laser) to define the 
scanned image resolution are known as flying spot 
scanners. Examples of applications that use these 
two classes of scanning are discussed later. 


10.6.1 CCD document scanner 


CCD scanners are based on linear array CCD 
detectors, which image the width of the document, 
with a linear mechanical scan stepping along the 
length of the document. Scanners are classed as 
either contact type, where the resolution is essen- 
tially that of the detector elements, or reduction 
type, in which higher resolution is achieved by 
imaging the document on the detector array with 
a magnification less than unity. Reduction-type 
scanners have higher performance, but are more 
expensive and bulky. 

A schematic optical layout of a typical large- 
format document scanner is shown in Figure 10.13. 
Illumination along a line of the document (which 
could, for example, be a poster, a map or an 
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Figure 10.13 Large-format document scanner. 


engineering drawing) is provided by two cylindri- 
cal fluorescent lamps, concentrated by cylindrical 
reflectors. In a camera assembly, a multi-element 
lens images the line on a tri-linear CCD device. 
This simultaneously produces three red, green and 
blue color-separated images of a line of the docu- 
ment, immediately after the time-delays between 
the lines are corrected. This provides sufficient 
data to accurately measure the color of the docu- 
ment at each point, after calibration. 

The spatial resolution is determined by the 
number of elements in each line of the CCD array, 
typically 5000, and the lens magnification. The ele- 
ments are typically 8 jum square, and there is no 
gap between elements in a line. To obtain higher 
resolution, two or more camera systems are opti- 
cally butted together, to increase the number of 
resolved spots per scan line. This is achieved by 
slightly overlapping the images of the two CCD 
sensors at the document, and ‘stitching’ the elec- 
tronic images together by software processing. 

The CCD array scans the line image data elec- 
tronically, and the line is scanned along the docu- 
ment by a motorized pair of pinch rollers that 
transport the document across the stationary 
optical system. To achieve the necessary precision 
in this transport system, a sophisticated control 
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system is required to ensure accurate speed and 
constant tension in the scanned document. 


10.6.2 Flying spot scanners 


The term flying spot scanning was originally 
applied to CRT scanning, but nowadays it is more 
often applied to laser scanning. The resolution of 
the system is defined by the laser spot size at the 
object, and a non-imaging detector system collects 
the light that is transmitted, reflected or scattered 
by the object. Laser barcode scanners work on this 
principle. 

A more complex system, used for on-line 
inspection of defects in float glass, is shown in 
Figure 10.14. In this system [34], a 633 nm HeNe 
laser beam is scanned by a rotating 12-facet pris- 
matic polygon across the 2m wide glass sheet 
during manufacture, as the glass is moved on a 
conveyor in a direction perpendicular to the scan- 
ning direction, producing a continuous raster scan 
of the sheet. Glass defects are detected and ana- 
lyzed on-line by measurement of the transmission 
and scatter of the 0.45 mm diameter laser spot. 


Polygon 
mirror 
Detector 1 


In this system, the detector system is contained 
within the scanner head, by employing a retro- 
reflecting screen of micro spheres to return the 
light transmitted through the glass to the scanning 
system. The reflected light forms stationary images 
of the scanning spot at two detector positions, 
returning via two facets of the scanning polygon. 
One of the detectors is partially masked by a cir- 
cular obstruction corresponding to the laser spot 
size, and this measures the scattered light. The 
other detector simply measures the transmission 
through the glass. 

The polygon facet size is determined by the light 
collection requirements of the detector system, 
rather than by the size of the illuminating laser 
beam. As the retro reflecting sheet produces a cone 
of about 2° angle, a large polygon with facets of 
50 mm X25 mm is required, which makes the poly- 
gon manufacture challenging. 


10.7 OUTPUT SCANNING SYSTEMS 


In output scanning, a light source (almost always 
a laser) is used to write image data to a screen or 
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Figure 10.14 Glass sheet inspection scanner. 
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to a light-sensitive output medium. Systems vary 
greatly, depending on the size and characteristics 
of the laser, whose wavelength can be between 
350nm and 10.6ym, and whose power can be 
between 1 mW and 1kW. 

Most output systems are two dimensional, 
imaging to a flat or cylindrical surface. Image size, 
resolution and speed are the key scanner charac- 
teristics. Laser displays take advantage of the eye’s 
persistence of vision to create a two-dimensional 
image for advertising or entertainment. 

The writing process can be based on thermal 
(e.g., laser marking) or photonic (e.g., photo- 
graphic) processes. In some processes, the image 
density is a monotonic function of incident power 
density, in which case the image is described as 
continuous tone, or contone. Other processes are 
sharply thresholded, so that the image density 
takes one of two (or rarely, more) discrete values 
depending on the incident power density. In the 
latter case, the image is made up of a fine array of 
dots, and is described as halftone. 

The power density variations are produced by 
modulating the continuous-wave laser power as it 
is scanned across the image surface. Modulation 
may be controlled by an external acousto-optic or 
electro-optic modulator, or by controlling the laser 
current in some cases, particularly in laser diodes. 
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10.7.1 Desktop laser printer 


A typical xerographic laser printer is shown sche- 
matically in Figure 10.15. The configuration may 
be described as a single-pass external drum, or as 
a flat-field line scan, with drum rotation providing 
the slow axis scan. In this example, two adjacent 
scan lines are imaged from independent low-power 
780 nm laser diodes LD1 and LD2, doubling the 
imaging (drum rotation) speed. The image is pro- 
duced by a spatial variation of electrostatic charge 
on the pre-charged photoconductive drum sur- 
face, which is discharged by incident light. Toner 
particles are attracted to the charged areas, and 
transferred by contact to sheets of paper to produce 
the finished print. 

In the example shown, an eight-facet polygon 
produces the scan, rotating at 30,000 rpm. A three- 
element f—0 lens produces an elliptical spot of 
50pm in the scan direction and 60 pm in the cross- 
scan direction, for a printer resolution of 600 DPI 
over a scan length of 312mm. This allows a print- 
ing speed of 20 A3 copies per minute. 

Key features of the design are 
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efficiently combined using a polarizing beam 
splitter, aligned in such a way that the beams 
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Figure 10.15 Laser printer. 
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overlap at the polygon—which is also the 
entrance pupil of the f—9@ lens—and are at an 
angle to each other to separate the beams by 
exactly one scan line at the image drum. 

A photodiode just beyond the image area 
monitors the beam power and position, as part 
of the control feedback loops for the laser diode 
and polygon motor drivers. 

Deflector error correction is achieved using an 
anamorphic lens design, with a cylindrical lens 
before the polygon and different f—0 optical 
power in in-scan and cross-scan planes. 

One of the f—0 lens elements is glass, with 
spherical surfaces. The other two elements are 
injection-molded plastic with anamorphic, 
aspheric surfaces. The use of glass enables 
thermal compensation of the lens performance 
over a 10°C-40°C range, which would be a 
problem with an all-plastic lens. 
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11.1 INTRODUCTION 


It is said that man has five senses—sight, touch, 
smell, taste, and hearing—at his disposal. Clearly, 
the one he finds most generally valuable is his sight. 
This is hardly surprising, as it is excellent for remote 
sensing, gives an effectively instantaneous response, 
has truly enormous parallel information capacity, 
and provides far more reliable and quantitative data 
than any of the others—it is common to say “you 
believe the evidence of your own eyes.” Optical 
sensor technology may still have some way to go 
to match the same compact design and all-round 
performance as the eye/brain combination, but 


11.5.10 Distributed sensing using a 
Sagnac loop interferometer 
Distributed sensing using a 
counter-propagating optical 
pump pulse 
Use of the “synthesis of the 
optical coherence function” 
method for distributed sensing 
by monitoring the gain process 
associated with stimulated 
Brillouin scattering (K Hotate, 
RCAST, University of Tokyo) 
11.5.13 Distributed sensing to 
determine profiles of strain 


404 
11.5.11 


407 
11.5.12 


408 


and temperature in long 
in-fiber Bragg gratings 
11.6 Applications of OCT and low- 
coherence interferometry 
11.6.1. Instrument design 
11.6.2 Center thickness, sagittal 
depth, and refractive index 
measurements of contact 
lenses in solution 
Automotive HUD windshield 
wedge angle and thickness 
assessment 
Measurement of cell phone 
displays 
Imager flatness assessment 


410 


410 
411 


414 
11.6.3 


415 

11.6.4 

417 

11.6.5 
and simultaneous thickness, 
surface and index profiling 418 

11.6.6 On-line coating thickness 

measurements 

11.7. Conclusions 

References 


Bibliography 


419 
422 
422 
430 


the promise is clearly there! As an aside, it is worth 
noting that we, as humans, are not able to remotely 
sense electrical or magnetic fields well (although 
migrational birds are believed to use magnetic sen- 
sors), so, if one believes that nature often chooses the 
best methods, the technology of optical sensors may 
perhaps evolve to overtake that of electrical ones in 
time. Modern camera and CD player technologies 
have shown that even highly complex optoelec- 
tronic systems can be manufactured cheaply in vol- 
ume, so cost should not present a major barrier for 
large-scale application of standard devices. 

This chapter reviews the many ways in which 
optical fibers may be used, in conjunction with 
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optoelectronic instrumentation, to sense physical 
or chemical parameters. This is done using either 
intrinsic sensors, where the fiber itself acts as the 
sensing element, or extrinsic sensors, where the fiber 
acts in its normal communication role, to carry light 
to and from an external optical sensor. The fiber can 
enable remote and rapid sensing to be performed by 
sensing at specific points or over specific cable paths. 
This complements the capability of optical sensors 
such as cameras, which have valuable imaging capa- 
bility, as fiber cables do not need a direct “line-of- 
sight” to operate and measure in inaccessible places. 
Because there are now very many varieties of optical 
fiber sensor, it has been necessary to concentrate on 
a few important concepts and only describe a few 
practical sensor types in more detail. A far more 
detailed account of fiber sensors is given in the text- 
books mentioned in bibliography References [1,2]. 
Optical fibers are widely used in long distance 
trunk telecommunications systems (see Vol II, 
Chapter 1) and their use in shorter distance appli- 
cations is growing rapidly. For communications 
systems, it is desirable to utilize a cable having 
transmission properties that are insensitive to 
environmental changes. Intrinsic fiber sensors, in 
contrast, rely on deliberately configuring the opti- 
cal system to be sensitive to external influences on 
the fiber or cable. This involves the arrangement 
of the fiber cable, its associated light source, the 
optical receiver, and the chosen signal process- 
ing scheme to maximize (and detect) a change in 
transmission properties that is characteristic of 
the parameter to be sensed. This may be carried 
out either by using especially sensitive cables or by 
interrogating conventional fiber cables in a manner 
that highlights small changes in transmission. The 
resulting changes in transmission can then be used 
to measure features of the external environment. 
An extrinsic fiber sensor usually uses a conven- 
tional optical fiber (preferably sheathed within an 
environmentally insensitive cable), which is only 
employed as a convenient light-guiding medium 
to transport light to and from more conventional 
optical sensors at the end of, or at specific points 
on, the cable. This sensor, in response to an exter- 
nal physical parameter, modifies the light coupled 
back from the input fiber into the return fiber and 
guided back to the detector system. Such a sensor 
may still be undesirably affected by environmen- 
tally induced changes in cable properties, unless 
particular care is taken with its design. Avoiding 


unwanted cable-induced errors is a very impor- 
tant practical consideration, in both intrinsic and 
extrinsic sensors. In a good sensor system design, 
care will be taken to avoid significant sensitivity to 
undesirable losses in the optical network, such as 
could occur, for example, due to fiber bending or 
connector variations. 

The research area of fiber optic sensors is a very 
fruitful one for original thought, as there is not 
only a very large number of physical and chemical 
parameters that may be sensed, but also a multitude 
of ways in which the parameter may be arranged 
to modulate the optical transmission. However, in 
order to make a successful sensor, the normal prin- 
ciples of good instrument design must be adhered to 
li.e., low cross talk to undesirable parameters, good 
signal-to-noise ratio (SNR), and repeatable and reli- 
able operation] and the sensor must be designed to 
be cost effective for the intended application. Many 
of the successful commercial applications of such 
sensors are in niche market areas, which rely on the 
inherent advantages of optical sensors over more 
conventional electrically based sensors. The pri- 
mary technical advantages include the following: 


e Intrinsic freedom from electromagnetic inter- 
ference (EMI), lightning strike, etc. 

e Intrinsic safety in hazardous (explosive vapor) 
environments (provided optical signal power is 
low, which is almost invariably the case). 

e High electrical isolation, enabling their use 
in medical applications and for data collec- 
tion from points at high voltage. The fiber also 
gives freedom from problems of electrical short 
circuit and open circuit. 

e Excellent resistance to chemical corrosion (can 
be used in highly corrosive environments, e.g., 
saltwater, acid, and alkali). 

e Passive operation; no electronics circuitry or 
electrical power is required at the remote sens- 
ing point. 

@ May be used in high-temperature areas, where 
electronic systems would not survive. 

@ Optical fibers are smaller, lighter, and cheaper 
than electrical cables. 

@ May be used for distributed sensors of extreme 
length due to the low losses (0.2 dB km“) 
achievable in optical fibers. 

e Where optically based data transmission is 
already envisaged, sensor information can be 
carried in the same optical fibers. 
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e Optical sensing can provide a very rapid 
response in many applications as changes in 
optical transmission can be detected almost 
instantly. 


When the use of optical signal cables becomes 
more common for short distance routes, it is likely 
that optical transducers will start to find more 
general applications, even in areas where the above 
intrinsic advantages are not a major consideration. 
The ability to sense, communicate, and multiplex 
signals within an optical network is an impor- 
tant attribute, which will lead to their greater 
application. 

We shall start by considering the different types 
of sensor, and the physical means by which each 
can operate. 


11.1.1 Summary of intrinsic types 
of sensors, where the optical 
fiber is used directly as the 
sensing element 


The intrinsic optical fiber sensor takes advantage 
of measurable changes in transmission character- 
istics of the fiber itself. The principal parameters of 
interest for sensing are as follows: 


e Light generation in the fiber due to physical 
interactions (e.g., scintillation or Cerenkov 
radiation). 

e The propagation time of light in the fiber 
(proportional to the length and inversely 
proportional to the velocity of light). This can 
be measured as a temporal delay or as a phase 
change in the light. 

e The optical power transmitted by the fiber 
(either the total power or the spectral varia- 
tions in transmission). 

e The distribution of optical power between 
the various modes of propagation. (This can 
be measured from either the near-or far-field 
waveguide mode patterns at the end of the 
fiber.) 

e The state of polarization (SOP) of the transmit- 
ted energy through the fiber (or backscattered 
energy from the fiber). 

e The light scattered from within the fiber core 
material. This can include elastic scattering 
(same wavelength as incident light) or involve 


Raman scattering, Brillouin scattering, or 
other nonlinear interaction, such as optical 
gain by Raman or Brillouin processes. 


11.1.2 Summary of extrinsic types 
of sensors, where the optical 
fiber is used merely as light 
guiding medium, to address 
more conventional optical 
sensors 


The main methods of operation of extrinsic fiber 
sensors are outlined as follows: 


e Light due to a chemical or physical interac- 
tion is collected by the fiber (e.g., radiation 
due to high temperature pyrometry, opti- 
cal scintillation of a semiconductor, and 
chemiluminescence). 

e The optical power transmitted by an external 
optical modulator coupled to the fiber (either 
the total power or the spectral variations in 
transmission) is monitored. Examples include 
mechanical optical shutters, electro-optic 
switches or modulators, variable optical filters 
(such as Fabry-Perot, grating monochromator, 
temperature-dependent semiconductor). 

e The optical power reflected by an external opti- 
cal modulator coupled to the fiber (either the 
total power or the spectral variations in trans- 
mission) is monitored. Examples as above. 

e The optical power scattered elastically (no 
wavelength change) by an external medium 
coupled to the fiber is monitored. This effec- 
tively measures turbidity of the medium, in 
reflective mode. 

e The optical power scattered inelastically 
(scattered-wavelength changes) by an external 
medium coupled to the fiber is measured. This 
includes processes such as Raman scatter- 
ing, fluorescence, and photon correlation 
spectroscopy. 

e The SOP of the energy transmitted by an exter- 
nal optical modulator coupled to the fiber (or 
that reflected back into the fiber) is monitored. 


11.1.3 Evanescent field sensors 


Evanescent field sensors exhibit some of the 
characteristics of intrinsic sensors and extrinsic 
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types but do not exactly fit into either category. 
As with intrinsic sensors, the light is guided by 
the fiber (or by an optical waveguide attached to 
the fiber) in the sensing region, but, in the case of 
evanescent field sensors, a portion of the optical 
energy travels outside the physical limits of the 
waveguide material. In the lateral direction, the 
field decays rapidly as distance from the wave- 
guide increases, a behavior known as evanescent 
field decay. Because there is light energy outside 
the guide material, it is possible for this light to 
interact with (e.g., be absorbed, scattered, or 
excite fluorescence in) the surrounding material. 
Then, for example, the evanescent field sensor can 
detect the optical properties of a fluid in which 
it is immersed or solid material in which it is 
embedded, simply by measuring the light trans- 
mitted through the fiber. Unfortunately, because 
the evanescent field region is very thin (typically 
<1 pum um thick), this type of sensor is very 
prone to surface damage or contamination. Also, 
the evanescent field penetration depends strongly 
on the refractive index of the surrounding mate- 
rial (and hence on temperature) and on whether 
or not the guide is bent, so it is very sensitive to 
the operational environment. 

Instead of immersing the evanescent field sen- 
sor directly into a solution to be monitored, it can 
be coated with an active layer, such as a polymer 
or a semipermeable, glass-like, sol-gel coating. 
These coatings can be made sensitive to desired 
chemicals by incorporating (immobilizing) an 
indicator chemical into them. Provided the layer is 
semipermeable, chemicals can diffuse in from the 
surroundings and change the optical transmission 
of the indicator material. Clearly, the presence of 
this layer reduces the problems of contamination 
of the critical evanescent field region, as this now 
lies inside the solid material. 

A further variation is to coat the light guide 
with a thin metal layer, which can enhance the field 
in the evanescent region by a mechanism called 
plasmon resonance, which, as the name implies, 
involves excitation of the “electron gas” present 
within all metallic conductors. This plasmon reso- 
nance mechanism can give greatly improved sen- 
sitivity to any absorption in the evanescent field 
region and still allow indicator layers to be coated 
on as before. 

Because of the ease of contamination and the 
environmental sensitivity of evanescent sensors, 


they are usually better suited to qualitative testing 
for chemicals, rather than a means of performing 
quantitative chemical or spectral analysis of fluids 
in which they are immersed. Nonetheless, they can 
be a very sensitive means of detecting trace quanti- 
ties of chemicals, provided they are designed and 
used with care. 

The following types of evanescent field sen- 
sor (with bracketed notes to indicate where more 
details can be obtained from bibliography 1) are 
commonly used: 


e Unclad fiber, with evanescent field extend- 
ing directly into the gas or liquid to be sensed 
(bibliography [1], vol. 1, p. 607). 

e Fiber with a physically sensitive (or chemically 
reactive) cladding layer, with the evanescent 
field extending into cladding. The chemical 
must diffuse into, and physically modify, or 
react with, the layer (bibliography [1], vol. 4, 
pp. 352-354). 

e Polished half-coupler sensor. Here, the fiber 
is set in a slit in a glass block, with part of the 
cladding polished away to expose the external 
medium (or a reactive polymer coating layer) 
to the evanescent field region (bibliography [1], 
vol. 1, pp. 213-215). 

e Integrated optics sensors, with natural propa- 
gation of evanescent field above surface of 
chip. Again there is exposure of the external 
medium (or reactive polymer layer) to the 
evanescent field region (bibliography [1], vol. 1, 
chapter 9). 

e Plasmon resonance. Here, a thin metallic layer 
enhances the optical coupling of the light in 
the fiber to the evanescent field region in the 
external medium (or reactive polymer layer). 
This enhancement can be used with most of 
the above configurations (bibliography [1], 
vol. 1, pp. 203-206). 


Please note that Chapter 3, by G. Stewart, in vol. 3 
of bibliography 1, gives a detailed overview of eva- 
nescent field devices. 

Below, we shall give more detailed descriptions 
and case studies of optical sensors. The remain- 
der of the chapter is split into four sections, each 
written by a different coauthor. Section 11.2 (John 
Dakin) covers the basics of intensity-based sen- 
sors and Section 11.3 (Kazuo Hotate) treats inter- 
ferometric types. Section 11.4 (John Dakin) covers 
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how sensors may be multiplexed and Section 
11.5 (john Dakin, Bob Lieberman, and Kazuo 
Hotate) shows how sensors that operate on a fully 
distributed basis can be made. The final section, 
section 11.6 (Mike Marcus) deals with a detailed 
case study of a type of interferometric sensor that 
has many industrial uses. Because the chapter is 
composed of contributions from several authors, 
there are small differences in style of writing and 
diagrams reflecting their different contributions. 
It should be emphasized again that not all types 
of sensor can be covered in such a short chapter, 
and there has had to be much careful selection 
of which types to include and which case stud- 
ies to present in more detail. The reader requiring 
more detail is again referred to the bibliography, 
to the “Optical fiber sensors” and the “Europt(r) 
ode” international series of conferences and also 
to the many SPIE-organized (http://www.SPIE. 
org) conferences covering this area. 


Fast optical receiver 


11.2 INTENSITY-BASED OPTICAL 
FIBER SENSORS 


11.2.1 Physical sensors 


11.2.1.1 SIMPLE OPTICAL INTENSITY 
SENSORS 


We shall first describe the very simplest form of 
intensity-based sensors, where only the transmitted 
or reflected light level is measured. We shall start 
with intrinsic sensors, then discuss extrinsic types. 


11.2.1.1.1 Light generation in fiber itself 


A few mechanisms allow generation of light in a 
fiber, as a result of a direct interaction with the 
physical field to be monitored. There are two prin- 
cipal types here. One is a fiber sensor for detection 
of pulses of ionizing radiation, having extreme 
peak intensity, of the type that can occur due to 
thermo-nuclear events (see Figure 11.1). 
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Figure 11.1 An optical sensor for detection of intense ionizing radiation (relativistic charged particles) 


using the Cerenkov effect. 
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If a relativistic charged particle (one with a 
velocity in free space close to the velocity of light in 
vacuum, c) passes through a medium such as glass, 
of refractive index, n, and if its entrance velocity 
initially exceeds the velocity of light c/n, in that 
new medium, then it will lose energy rapidly in 
the form of electromagnetic radiation. The energy 
loss occurs due to the Cerenkov effect [1,2] and is 
emitted mainly in the form of blue and ultraviolet 
(UV) light, but also with some energy extending 
to longer wavelengths. If an intense pulse of such 
radiation strikes a fiber, the temporal variations in 
the pulse intensity can be monitored, by detecting 
the light at the far end of the fiber. 

A sensor that is more sensitive to lower levels 
of radiation is one that uses fiber fluorescence [3] 
resulting from the radiation, either fluorescence 
intrinsic to the silica fiber or that generated more 
efficiently by doping the fiber with fluorescent 
material. Arrays of polymer fibers, doped with 
fluorescent organic dyes, have been used as long 
linear scintillators to track the passage of ioniz- 
ing radiation in particle accelerator systems, with 
detectors at the end of each fiber to collect the 
light. 


11.2.1.1.2 Microbend sensors 


Microbend sensors [4,5,6] are intrinsic sensors, 
which take advantage of the loss in optical fibers 
when they are bent. The word microbend is used 
because gradual bends, i-e., bends of large radius 
>50mm, cause very little loss in a fiber, or they 
would not find the extensive uses that they have in 
communications, whereas microbends (or sharp 
“kinks,” of small bending radius of a few mm or 
less) can cause high losses, even if present over very 
short lengths. 

The losses in the fiber are due to conversion 
of energy in guided fiber-core modes to cladding 
modes, which are then usually lost by absorption 
or scattering in the fiber sheathing material. In 
the case of monomode fibers, the mechanism is 
relatively simple, as there is only one guided mode 
(strictly two, if the duality of polarization modes 
is taken into account) to be coupled out, and the 
extent of coupling, and hence the loss, can be 
uniquely determined by the degree of bending. 

In the case of the multimode fibers, more com- 
monly used with such sensors, the situation is much 
more complex. First, bending causes mode conver- 
sion, in particular coupling of lower order modes 


(associated with rays in the core which travel at a 
small angle to the fiber axis) to higher order ones 
(associated with rays traveling at larger angles to 
the axis) and eventually to cladding modes. This 
“chain” of events, coupling energy from low order 
modes, via higher order ones, to radiation ones, 
depends in a complex way on not only how the 
fiber is bent, but also on which modes were initially 
present in the fiber in the section immediately 
before the bend. Because of the latter aspects, it is 
extremely difficult to derive quantitative informa- 
tion from multimode microbend sensors, particu- 
larly if several are cascaded along the length of a 
single fiber, as the bend condition of each one will 
affect the response of subsequent ones. 
Multimode microbend sensors have a particu- 
larly strong response if they cause the fiber to be 
periodically bent (Figure 11.2), with a particular 
spatial period that corresponds to the “zig-zag” 
period associated with the highest order rays that 
the fiber is capable of guiding [7]. These highest 
order rays are those striking the core cladding 
interface at the normal critical angle associated 
with Snell’s law of total internal reflection. Thus, 
a strong response can be achieved by pressing 
the fiber between corrugated plates, with off- 
set corrugations (or plates covered with parallel 
metal pins), which will periodically deform the 
fiber in a wave-like manner, with the appropriate 
spatial period (usually around 1-2 mm, for typi- 
cal multimode fibers) needed to ensure strong 
mode coupling. Under these conditions, very 
high losses can be induced in the fiber, even with 
a transverse displacement of only a few microns 
amplitude. However, the response is highly non- 
linear and care must be taken to avoid the pos- 
sibility of exceeding the long-term mechanical 
bend limitations of the fiber. Clearly with soft 
polymer coatings, such as acrylate, slow mechani- 
cal creep will be a problem, leading to variations 
of sensor response with time. Attempts have been 
made to improve the mechanical stability and 
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Figure 11.2 Schematic showing how light is lost 
at periodic bends in a multimode fiber. 
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Principle of the pressure-sensing cable, 
using microbend sensor. (Courtesy Herga 
Electric, U.K.) 


Figure 11.3 The pressure-sensitive cable of 
Herga Electric, first devised by Harmer. The 
spirally wound filament causes periodic bending 
of the inner glass fiber, when the outer cable is 
deformed by lateral pressure. 


reproducibility, particularly at higher tempera- 
tures, by using metal-coated fibers. 

Probably the most practically useful, albeit 
rather qualitative, microbend sensor is the dis- 
tributed cable version, first devised [8,9] by Dr. 
Alan Harmer at Battelle labs, Geneva. This con- 
sists of an optical fiber which has a thin polymer 
fiber thread helically wound around it, before the 
combination is sheathed within an outer polymer 


Spirally wound 
Kevlar thread 
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tube (see Figure 11.3). The spatial winding period 
of the helically wound fiber is designed to corre- 
spond with the mode-coupling length. When the 
outer tube is compressed from opposite sides, the 
fiber is deformed to cause high loss. This assem- 
bly would clearly also suffer from slow mechani- 
cal creep, but it finds most use as a cable to sense, 
essentially in an on/off manner, the presence of 
lateral force or weight on the cable. Applications 
are detection of the pressure due to a human 
foot (e.g., for safety mats near machinery or for 
intruder detection on fences or perimeters) or as 
a safety device to detect, and hopefully prevent, 
trapping of a hand or head in a sliding door or 
window (e.g., lift door, automobile window, and 
sun-roof). A variety of such cable and cable-in- 
mat sensors have been sold commercially by 
Herga Electric, UK [6]. 

A more recent development in microbending 
sensor technology [10,11] has been to produce 
a sensor to detect the presence of water around 
a fiber cable, for example, for the detection of 
floodwater in cable ducts. This is important, as 
water can freeze and break fibers, or can cause 
hydrogen generation by corrosion of metals. The 
sensor cable design (Figure 11.4) has a hard inner 
cylindrical core, coated with a hydrogel layer 
(so this hydrogel layer has a hollow cylindrical 
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Figure 11.4 Illustration of a cable designed for sensing presence of water, which also responds to 
areas of high humidity [11]. The hydrogel layer absorbs water and swells, causing the glass fiber to be 
pushed outwards. The outer spiral filament constrains it at regular spatial intervals, causing it to suffer 


periodic microbending, and hence loss. 
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geometry). A light guiding fiber is held against 
the outer cylindrical surface of this hydrogel layer 
using a helical plastic fiber to keep it in place. If 
the hydrogel layer gets wet, it swells dramatically, 
forcing the optical fiber outwards, except where 
held in place by the helical fiber. This periodi- 
cally deforms the fiber to make another form of 
microbending sensor, in this case one sensitive to 
water. The diagrams in Figures 11.2 through 11.6 
illustrate the microbend sensor, plus a number of 
other loss-based fiber sensors. 


11.2.1.1.3 Other intrinsic fiber-loss 
sensors 


There are several other types of intrinsic sensor 
that are based on attenuation in optical fibers. 

An attractive sensor [18] is the radiation 
dosimeter (Figure 11.5), in which the attenua- 
tion of the fiber is increased by ionizing radia- 
tion. Silica fibers are suitable for high levels of 
radiation, as they need large doses to suffer sub- 
stantial change, whereas lead-glass fibers have a 
significant response at lower levels. In all cases, 
however, the losses exhibit a time (and hence a 
dose rate) dependence and the losses also reduce 
after thermal annealing. This annealing can be 
significant even at room temperature. 

Another commercially useful sensor [13] for 
detecting leaks of cryogenic fluids (e.g., lique- 
fied natural gas or petroleum gas) was devised 
(Figure 11.6) using a type of commercially available 
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polymer-clad silica (PCS) fiber. The type of fiber 
used consisted of a pure silica core with a lower 
refractive index optical cladding, composed of 
silicone polymer material. When cooled to a tem- 
perature well below 0°C, the polymer shrinks more 
than the silica, due to its higher index of thermal 
expansion, and its refractive index rises to exceed 
that of the silica core. This causes the attenuation 
of the fiber cable to increase dramatically, so if light 
is launched at one end, there is a reduction in the 
detected light intensity at the output end, thereby 
indicating a leak of cold fluid at some point along 
the cable. 


11.2.1.2 PROPAGATION-DELAY OR TIME- 
DELAY SENSORS 


Propagation-delay sensors (Figure 11.7) allow the 
monitoring of physical or chemical effects by 
examining the time light takes to travel through or 
return from the sensing element. 

This element is usually the fiber itself. Its 
length can change if it is mechanically strained 
(delay changes arise due to change in physical 
length or due to refractive index changes aris- 
ing from the strain) or if it changes temperature. 
If the fiber is very long, it is possible to examine 
small changes in the arrival time of a short opti- 
cal pulse that has traveled through it. However, 
because of the extremely high velocity of light, it 
is difficult to perform such a measurement with 
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Figure 11.5 The optical fiber dosimeter. lonizing radiation causes increased attenuation in fibers, 
particularly lead-glass ones, causing the detected signal to reduce. 
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Figure 11.6 The cryogenic leak detector of Pinchbeck and Kitchen. Silicone-PCS fibers suffer attenua- 
tion when strongly cooled, because the cladding refractive index increases to equal or exceed that of 


the fiber core, preventing light guidance. 
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Figure 11.7 The concept of the propagation 
delay sensor. The delay is a function of both 
temperature and strain. In intensity-based 
sensors, the time delay has to be measured by 
giving the light some form of intensity modula- 
tion (Interferometric detection of delay will be 
discussed later.) 


high precision. An alternative way is to modulate 
the light source at a very high frequency (typically 
between 500MHz and a few GHz) and observe 
phase changes in the intensity modulation enve- 
lope caused by delay changes in the optical carrier 
signal. (Of course, by far the most precise method 
of monitoring is to launch a continuous lightwave 
through a length of fiber and examine the changes 
in optical phase due to the external influence, but 
this method comes into the area of interferometric 
sensors, which will be covered in detail in Section 
11.3 of this chapter.) 


11.2.1.3 EXTRINSIC INTENSITY-BASED 
SENSORS 


11.2.1.3.1 Light-collecting sensors 


An attractive type of extrinsic intensity-based sen- 
sor is one in which the fiber merely collects light 
from an external source and guides it to a detec- 
tor. The simplest example of this is a fiber used to 
check that a light bulb (e.g., a vehicle headlight) is 


operational. Fibers have also been used to moni- 
tor the temperature of hot bodies [14] such as tur- 
bine blades in jet engines, by collecting Planck 
hot-body infrared radiation and guiding it to a 
detector (remote pyrometry). Clearly, without 
care, there is a risk of surface contamination of the 
fiber. One alternative to solve this problem [15] is 
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Figure 11.8 Fiber pyrometer of Dakin and Kahn, 
with opaque coating or closed tube over fiber 
end tip. Such a long coating or tube acts as an 
excellent black-body source, and is the way com- 
monly used to approximate to a perfect black 
body. Lower curve shows how the detected 
signal rises dramatically with temperature. 
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to deliberately cover the fiber end with an opaque 
metal coating or sealed tube, and collect the energy 
emitted from the inside surface of the layer or tube, 
which produces an excellent approximation to a 
perfect black body (Figure 11.8). 

Later generations of such sensors are now 
used in commercial instruments. With such fiber 
pyrometers, the simplest approach is to measure 
total intensity. The signal conveniently varies 
with temperature, but even more dramatically 
than would be expected from the normal fourth 
power law for the total radiated power, as the fiber 
detector combination usually only allows trans- 
mission and detection of energy from the shorter 
wavelength, higher energy, “tail” of the Planck 
radiation curve, so the measurement accuracy 
is reasonable even with this simple approach. 
However, the accuracy can be improved [16] 
by monitoring at two or more wavelengths, to 
determine the effective color temperature of the 
source. This then reduces the power-referencing 
errors that can occur when simply measuring 
intensity on a single detector. 


11.2.1.3.2 External modulator sensors 


Many forms of external modulator can cause varia- 
tion of coupling between outgoing and return opti- 
cal fibers. The simplest is just a mechanical shutter 
between the two fibers, which moves to shut off 
light. Such simple shutters can act as safety inter- 
locks, e.g., for fire doors or hazardous areas, where 
only the safe condition allows light to be transmit- 
ted. Such sensors can be connected in series, such 
that light only passes if all are in a safe condition, 
making a logical “AND” condition to ensure all are 
safe. Input light can be modulated to ensure that it is 
not simply ambient light being collected by breaks 
in the system. In contrast, electrical switches can 
always short circuit or open circuit, particularly in 
corrosive chemical environments, with conductive 
fluids, or in seawater environments. 

A liquid level sensor devised by Pitt [17] 
involved connecting a small right-angled micro- 
prism to the end of a fiber, or of a pair of parallel 
fibers, such that light was reflected back by total 
internal reflection when the tip was in air or gas 
(Figure 11.9). When wetted by the liquid, the ret- 
roreflection was greatly reduced (as only weak 
Fresnel refection now occurs), indicating the liquid 
level has reached the position where the prism lies. 


An alternative, in transmissive mode, is a liquid- 
level sensor [18], made using fibers connected via a 
small gap, that is filled when the liquid reaches it, and 
monitoring the transmission at a wavelength where 
this liquid absorbs strongly (e.g., at 980 nm or 1.4m 
to monitor water). As the transmission change due to 
absorption can be very strong, particularly for water 
at 1.4m, and particularly because a second (unab- 
sorbed) wavelength of light can be used as an inten- 
sity reference signal, this sensor is potentially more 
“fail-safe” in nature than the prism type. 

A useful high-resolution proximity/distance 
detector (Figure 11.9) can be made simply from two 
or more parallel fibers, with their end faces termi- 
nated in a common reference plane [19,20]. One 
or more incident fiber is excited with light from a 
source, and the return fiber(s) collects the scattered 
or reflected light. Light exits the incoming fiber(s) 
in an illumination cone, of divergence angle deter- 
mined by the fiber numerical aperture, and the 
receiving fiber can only receive light that is reflected 
or scattered back from a point lying within a similar 
receiving cone. Ifa reflective point or surface is too 
close, no light is received by the return fiber, as these 
cones do not intersect. As distance to the reflective 
point increases, there is first a very steep increase in 
signal as the cones start to overlap, then a fairly rapid 
decrease, this time because return signals reduce 
due to the much smaller signal that can be received 
by a tiny fiber core located a long way away from 
the reflective point. A simple method is to use a split 
(bifurcated) bundle of many optical fibers, where 
light is launched through one set and received by 
the other set. The measurement end is where all the 
launch fibers and receiving fibers are combined, in 
a randomly distributed manner, across the polished 
end of the bundle. Such simple intensity-based sen- 
sors can monitor surfaces to resolutions of a micron 
or less, depending on the core sizes of the fibers, the 
nature and consistency of the reflecting surface they 
are monitoring, and the SNR of the optoelectronic 
detection system. This latter aspect of SNR is very 
important for many intensity-based sensors and 
will now be quantified for simple systems. 


11.2.1.4 CALCULATION OF SNR OF 
INTENSITY-BASED SENSORS 


It is useful to be able to calculate the signal to noise 
ration (SNR) expected for intensity-based sen- 
sors. Most low-frequency optical receivers used in 
sensors are based on P-instrinsic diode (PIN) or 
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Figure 11.9 The optical fiber proximity detector and its modification to produce a liquid level sensor. 
If care is taken to avoid the two-way ambiguity in the detected signal from a surface, it can also be 
used to obtain approximate distance measurement, but errors clearly arise if the reflective properties 
of the surface vary. Adding a prism to the fiber ends (lower picture) produces a liquid level sensor. In 
this latter case, immersion prevents total internal reflection and hence greatly reduces retroreflection 


by the prism. 


positive-intrinsic-negative silicon diodes, followed 
by a field effect transistor (FET) transimpedance 
amplifier to convert the photocurrent to an output 
voltage. The noise in such receivers is usually lim- 
ited by the Johnson noise in the feedback resistor up 
to a point where the output voltage reaches about 
50 mV, above which the receiver becomes shot-noise 
limited. Most practical intensity sensors operate in 
this shot-noise-limited regime, so we shall assume 
that this is the performance limitation here. 

If we assume the mean optical signal power 
received at the detector is P,,, and the fractional 
change in signal due to the effects of the mea- 
sured parameter is A (A is an intensity modulation 
index): 


Then the desired modulation component in 
the optical signal due to these is A-P,,, and 
the resulting modulation component in the 
detected photocurrent is A-R, P,.,, where R, is 
the responsivity of the detector (in A‘W"). 

The mean level of the detected photocurrent, I,,.an> 
is given by R,.Puct- 

The shot noise in the photocurrent is given by the 
standard formula: I,,=(2-q-Imean'B)°°, where q is 
the electronic charge and B is the postdetector- 
filter bandwidth of the signal. 

The SNR is, therefore, given by A-R, Pq, 

2:4 nec B)™. 

Manipulating, the SNR=A-R, P4...(2:q-Ry-PeB)°>. 

Hence, SNR=A-(R,,P4_4/2-q-B)™. 
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The SNR, therefore, increases in proportion to 
the modulation index, A, induced by the measur- 
and, and also in proportion to the square root of 
detected signal power. Reducing the detection 
bandwidth, B, improves the SNR in proportion to 
the square root of the factor by which the band- 
width is reduced. 


11.2.1.5 PROBLEMS WITH SIMPLE 
INTENSITY-BASED SENSORS 
AND THE NEED FOR SENSOR 
REFERENCING 


The use of intensity-based sensors is very attractive 
because of the ease of measuring the intensity (e.g., 
from the detected photocurrent in a silicon pho- 
todiode). However, simply measuring the output 
intensity from a single fiber containing a sensor 
can lead to a number of sources of error/uncer- 
tainty in the state of the sensor. The following fac- 
tors can affect the output signal and hence cause 
errors in the observed sensor reading: 


Light source variations 

Fiber lead and connector variations 
Variations in detector response 
Detector preamplifier noise 


In practice, the noise and variations in response of 
the optical receiver are not usually a major prob- 
lem when using silicon detectors at low frequency. 
However, typical changes in light source intensity 
can be significant and changes in the transmission 
of optical fibers and connectors can be a real prob- 
lem, particularly where there is mechanical defor- 
mation. In order to resolve the above problems, it 
is usually necessary, when making a quantitative 
measurement, to reference the sensor. This involves 
providing a reference optical signal, which, ideally, 
has not been changed by the measurand in the 
same manner as the sensing signal. 

The simplest method of referencing is based on 
the provision of more than one fiber path from 
the light source, leading to two separate detectors, 
with the sensing section included in only one of 
the paths. The sensing signal is then normalized 
to the reference signal by division. More complex 
systems are possible, with routing of signals via 
separate paths using couplers [21] or fiber switches. 
Alternatively, fiber paths of different length can 
be used to separate signals from a pulsed source. 
Here the separation occurs in the time domain, 


according to the propagation delay each signal has 
experienced in its separate optical path. 

In some cases, the need to measure the actual 
intensity can be avoided if the light is intensity 
modulated in a periodic way by the sensor head, 
and, rather than measuring the intensity, the 
quantity monitored is the frequency (rate) or time 
of modulation. Measurement of modulation fre- 
quency is not only easy, but it also avoids the need 
for sensor referencing, and the accuracy of mea- 
surement of cycles in a given period increases lin- 
early with the measurement period, provided no 
modulation-cycle counts are missed, so very high 
accuracy is possible over extended periods. Very 
simple optical tachometers (Figure 11.10) can be 
designed, using mechanical parts that give a peri- 
odic amplitude variation of light by virtue of varia- 
tions in their light-blocking (e.g., a light-chopper 
wheel with radial slots) or light-reflective (e.g., a 
cylindrical metal shaft, with flats polished on it) 
properties as they rotate. 

Another attractive method uses silicon reso- 
nator technology, of the type now used in many 
modern electronic transducers. Here, the modu- 
lator element is a tiny micromachined silicon 
structure, such as a flexible cantilever or bridge 
that is driven into mechanical oscillation (in con- 
ventional sensors, by electrostatic or piezoelec- 
tric actuation) and its period is measured. Many 
physical influences (e.g., temperature, mechanical 
tension, and pressure) can be arranged to inter- 
act and change the resonant frequency of such 
a structure, e.g., by changing the tension in the 
bridge element. The optical equivalent [22,23] 
is essentially the same concept, but the element 
is driven by light exiting from an optical fiber, 
which is modulated to cause periodic heating of 
the microstructure (Figure 11.11). 

This forms a primitive heat engine, which gives 
rise to periodic thermal expansion and contrac- 
tion of one side of the tiny silicon bridge (although 
photonic interactions have other means of causing 
mechanical changes in semiconductors), which 
drives this tiny element into resonant oscillation. 
The oscillation is of very low amplitude, unless 
the light modulation rate matches its resonant 
frequency. 

The small motions of the oscillating element 
are also monitored optically by changes in reflec- 
tion amplitude (possibly involving simple opti- 
cal interferometry to enhance the effects of small 
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Figure 11.10 The optical fiber tachometer devised by M Johnson. The rate of beam interruption 
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Figure 11.11 The silicon micro-resonator sensor, which is an optical equivalent of a sensor concept 
now common using conventional semiconductor technology. In the case of the optical sensor, excita- 
tion is by intensity-modulated optical radiation from the incident fiber. Detection of the resulting 
oscillations in the tiny silicon bridge can be performed by monitoring reflected intensity. 


displacements) and the frequency of modulation is 
swept so the resonance can be detected, and even 
electronically locked onto. Such sensors appear at 
first sight to be a dream come true, seemingly solv- 
ing the problems of measuring optical amplitude, 
and providing a “frequency-out” measurement 


that has very high precision due to the high res- 
onant frequency (typically ~100kHz) of such 
microstructures. Unfortunately, there is a subtle 
problem because the incident light heats the oscil- 
lating microstructure and so changes its resonant 
frequency in a manner depending on the incident 
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optical power. This gives the sensor an unfortu- 
nate degree of undesirable dependence on incident 
light intensity that still needs to be taken care of in 
design or operation of the sensor. 


11.2.1.6 SPECTRAL FILTERING SENSORS 


A particularly attractive method of sensor refer- 
encing against intensity changes is to use spectral 
encoding in the sensor, where the sensor head 
provides an optical filtering function. Then, at the 
receiver, the relative signal strength at two or more 
optical wavelengths is monitored. If the sensor can 
filter out just one narrow spectral band, having a 
central wavelength dependent on the measurand, 
then this is even more attractive, as this wavelength 
can then be monitored almost independently of 
the optical power level or intensity. 


11.2.1.6.1 Intrinsic spectral filtering 
sensors using doped fibers 


Rare-earth-doped fibers can change their spec- 
tral attenuation significantly with temperature, as 
the occupancy of internal electronic energy levels 
changes. These fibers can be used in short lengths or 
coils, as point temperature sensors, or simply as sen- 
sors designed to warn of fire or overheating affect- 
ing a section of what may be a much longer fiber. 


11.2.1.6.2 Fiber grating sensors 


Fiber grating sensors [24-26] use optically writ- 
ten structures in the light-guiding core of the 
optical fiber, which act as very narrow band opti- 
cal filters (Figure 11.12). These have a very nar- 
row band reflection spectrum (typically between 
0.1 and 1nm bandwidth) and usually have a peak 
back-reflection of between 5% and 100%. In trans- 
mission, they have a similar bandwidth, but act as 
blocking filters over a similar narrow band. 

The grating consists of a short region of fiber 
(typically 1-20 mm length, L), where the refractive 
index of the fiber core has been modified to cause 
it to exhibit a periodic variation with length in the 
axial direction. This variation is achieved by lateral 
illumination of a fiber with two converging beams 
of UV light (Figure 11.12a), either created with 
beam-splitting optics or created by diffraction of a 
single beam in a phase mask. The beams interfere 
to give fringes, with a predesigned spatial variation 
of optical intensity, depending on the wavelength 
and angle of convergence. This bright and dark 


fringe pattern gives rise to a corresponding refrac- 
tive index variation in the photosensitive fiber core 
as a result of a photorefractive effect. This effect 
is small, yet significant in germania-doped silica 
fibers. Even a very small periodic refractive index 
variation can build up coherently to cause a very 
significant (even close to 100%) reflection of light at 
the Bragg wavelength, Aj... This is the wavelength 
at which each low-intensity wavelet, reflected from 
each minor undulation in refractive index, can add 
coherently with all the others from other parts of 
the grating, and is given by 
Maas =2- Neg A, 

where n,, is the effective refractive index of the 
fiber core and A is the grating period. 

It is this need for this phase matching, or con- 
structive addition of back-reflected wavelets, in 
order to give significant reflection, that results in 
the desired narrowband filtering effect; the longer 
the grating, the narrower the filter effect and the 
higher the grating reflection coefficient for a given 
index change. 

The in-fiber grating is an excellent sensing 
element [27] as it is tiny, and even without exter- 
nal components, and can be configured to sense 
mechanical strain influences. As with traditional 
electrical resistance strain gauges, the wavelength 
of the grating depends not only on strain, but also 
on temperature, so there is a need to measure or 
compensate for temperature. The review on grat- 
ing sensors by Kersey [28] and the textbooks 
in the bibliography cover many ways in which 
this may be done, but the simplest method is to 
have another (unstrained) grating as a reference 
thermometer. If it is merely required to monitor 
bending of a thin plate, then it is possible to bond 
identical gratings on opposite sides of the surface 
and simply monitor the differences in wavelength 
shifts observed [29]. 

From Reference [27] a typical thermal sensitiv- 
ity of a grating with 1300 nm center wavelength 
is of the order of 0.01 nm °C~, with the frac- 
tional change per unit temperature being typically 
~8.2X10-° °C at all wavelengths. The fractional 
response of center wavelength to strain is typically 
of the order of 75% of the strain. 

A major application area is in monitoring 
[30,31] of composite materials (with the grating 
embedded in the composite) or of large mechanical 
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Figure 11.12 The in-fiber Bragg grating sensor. This shows (a) how a fiber grating can be written using 
interference between converging beams of light, (b) how a grating can be bonded onto the surface 
of a structure to be monitored (embedding into composite materials is also common) to form a strain 
sensor, and (c) the response of the sensor as the structure is strained. 


or civil engineering [32] structures, with the grat- 
ings embedded in grooves or bonded to the sur- 
face (Figure 11.12b) or attached in surface patches. 
With appropriate interrogation systems, and with 
mechanical amplifiers, grating sensors are even 
being considered for optical hydrophones to detect 
weak acoustic signals in the sea. Clearly, there are 
many physical parameters that can be arranged 
to give a strain in the fiber (e.g., hydrostatic pres- 
sure or depth and magnetic field). Although they 
are only optically written, the gratings have been 
shown to withstand elevated temperatures of sev- 
eral hundred degrees Celsius, but of course, the 


wavelength varies with temperature when heated 
or cooled. 

In order to sense the grating wavelength, 
many interrogation methods [31,33-37] have 
been devised. The most common method is to 
launch light from a broadband source (LED or 
fiber superluminescent source) into the fiber lead 
to the sensor and then detect the peak reflective 
wavelength of light reflected from the in-fiber 
grating, using some form of spectrometer (Figure 
11.12a and b). The most common types used are 
fixed diffraction-grating/ charge-coupled-detector 
(CCD) spectrometers, which usually require some 
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form of fiber depolarizer to avoid errors [37] due 
to polarization effects, or scanned narrowband fil- 
ters (e.g., using Fabry-Perot, as in Reference [28] 
or acousto-optic tunable filters, as in Reference 
[36]. Systems where another grating is simply 
stretched to form a filter that can be tuned to track 
the sensor grating have been reported by Jackson 
et al. [33]. There are also simpler passive methods 
where the wavelength change is converted to an 
intensity change using some form of optical filter 
having a high slope of transmission versus wave- 
length. Examples include interference filters and 
wavelength-selective fused fiber couplers [31,38], 
of a type often used for separation of two closely 
spaced wavelengths. Ideally here, the light is split 
into two channels, one increasing in intensity, the 
other decreasing, as wavelength is increased, so 
the two complementary outputs can be ratioed to 
remove the common mode intensity changes that 
could occur due to variations in the light source, 
fiber leads or connectors. 

Apart from the methods where the grating 
merely acts as a filter, it can also be used as sensor, 
in a configuration where it forms part of an active 
fiber laser [35]. There are two basic types here, the 
first being where a conventional in-fiber grating 
(the sensor grating) acts as one mirror of an opti- 
cally pumped rare-earth-doped fiber laser and the 
other mirror is arranged to be broadband. As the 
grating is stretched (or heated), changes in its Bragg 
wavelength cause the laser output wavelength to 
change. The advantages are that the linewidth is 
much smaller, giving improved spectral and strain 
resolution, and the optical output power is much 
larger than for the alternative simpler configura- 
tion with filtering ofa broadband source. A second, 
more compact, form of the same idea is achieved 
using in-fiber distributed feedback Bragg lasers 
(DFB fiber lasers) as the sensing element. Here 
the active rare-earth-doped lasing region forms 
part of the actual Bragg grating sensing element. 
The resolution of such sensors is nothing short of 
dramatic, as they oscillate at a frequency around 
1014 Hz, yet they can be arranged to be monitored 
to 1Hz resolution, using “mixing-down” by beat- 
ing or heterodyning with a reference laser, giving 
a potential resolution of 1 part in 1014. More usu- 
ally, the beat frequency between two orthogonally 
polarized lasing modes has been monitored, as this 
is more easily used, being typically in the range 
of 0.5-2GHz, where the beats can be monitored 


directly using a fast optical detector [39,40]. These 
publications also discuss how thermal compensa- 
tion is possible using monitoring of both the actual 
laser frequencies and the beat frequency between 
two mixed lasing modes. 


11.2.1.7 EXTRINSIC SPECTRAL FILTER 
SENSORS 


Many types of extrinsic spectral filtering sensor 
have been constructed [41]. One of the first meth- 
ods [42] was to use a material that changes its spec- 
tral transmission characteristics with temperature 
(Figure 11.13). 

There are many examples of such materials (e.g., 
crystal, glass, or polymer). As one attractive group 
of materials, most semiconductors have steep trans- 
mission band edges, beyond which they suddenly 
become reasonably transparent. At the band edge 
of all common semiconductors (e.g., Si, Ge, and 
GaAs), both the transmission versus wavelength 
slope and the wavelength of 50% transmission vary 
significantly with temperature. There is also a large 
family of commercially available long-pass optical 
filter glasses (e.g., Schott and Corning glass com- 
panies) with semiconductor-like optical behavior. 
Clearly, all of these can be used to make practical 
and stable sensors, and many low-cost spectrome- 
ters are now available to interrogate them. They are 
attractive for monitoring in remote or inaccessible 
locations, or in areas of high E-M field. 

A class of spectral-filtering sensors is based on 
fiber-coupled versions of common bulk optical 
spectrometer components, such as Fabry-Perot 
filters (closely spaced mirrors), diffraction grating, 
zone-plate, or prism spectrometers or monochro- 
mators. The Fabry-Perot filter is particularly use- 
ful, in view of its small size and narrow linewidth, 
and the ability to coat fiber end surfaces to make a 
tiny version from a short fiber section with reflec- 
tive end coatings [43]. 


11.2.1.8 EXTRINSIC SENSORS USING 
MONITORING OF FLUORESCENCE 
SPECTRUM 


Apart from using simple transmission measure- 
ment, the fluorescence spectrum of some materials 
can be monitored, a method again well suited to 
the use of direct-bandgap semiconductor crystals 
[44] in the sensor probe. However, in this mode, 
there is now the possibility of using translucent 
materials, such as the types of phosphors used in 
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Figure 11.13 An optical fiber thermometer, where the thermal dependence of the band-edge posi- 
tion of a semiconductor (or special filter glass with semi-conductor-like properties) is monitored with a 


simple spectrometer. 


fluorescent lights and in TV display tubes, which, 
with the aid of a directional coupler, can be con- 
veniently monitored in back-“scatter” mode via a 
single fiber lead. This fiber-coupled configuration 
(see later Figure 11.15b) will be described again, 
when we discuss chemical sensors. 

Decay lifetime sensors can sense various param- 
eters by monitoring the time delay after excitation 
of the sensor probe material. Excitation is either 
by a short optical pulse or by a repetitive pulse 
train, or using modulated light having some other 
(e.g., sinusoidal) periodic intensity variation. The 
most common mechanism is to look, directly or 
indirectly, at the time decay of fluorescence in the 
probe (Figure 11.14). 

The fluorescence decay curve, which is usu- 
ally of an exponential shape, can be monitored, 
to determine the lifetime. Alternatively, the phase 
delay in the detected fluorescence intensity signal, 
relative to the initial intensity modulation wave- 
form (usually sinusoidal or square-wave) of the 
incident light, can be monitored. 

Several practical physical and chemical sensors 
use this mechanism, and it has proved particularly 


useful for temperature monitoring. Thermal 
changes affect the fluorescent lifetime of many 
substances (e.g., phosphors, semiconductors, laser 
crystals, or glasses), but phosphors [45] and ruby 
crystals [46] have been used most for thermal sens- 
ing (see also review by Gratton and Palmer [47]). 
Many of these materials have been used extensively 
for long periods under conditions of energetic elec- 
tron bombardment or powerful optical illumina- 
tion and have excellent long-term stability. Laser 
crystals, in particular, are not only optically and 
thermally stable, but also often (e.g., ruby) have 
excellent mechanical hardness and strength. 


11.2.2 Intensity-based chemical 
sensors 


There are two basic types of optical chemical sen- 
sor, one using direct optical spectroscopy of mate- 
rials to be detected, the other making use of a 
chemical indicator, i.e., a compound that acts as an 
intermediary, with a strong, hopefully chemical- 
species-specific, change in its optical properties 
when exposed to a target chemical or group of 
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Figure 11.14 Sensor to measure temperature, where the decay of fluorescence of a phosphor (or 
semiconductor chip), attached on the end of an optical fiber, is monitored. The lower curves show 
typical responses, following pulsed excitation from the light source. 


chemicals. Below, we shall cover these two types, 
but, in this short outline, it will not be possible to 
cover all the concepts concerned, so the interested 
reader is referred to the textbook in bibliography 
[1] (particularly vol. 4, Chapters 7 and 8) for fur- 
ther reading. For reasons of lack of space, the dis- 
cussion of chemical sensors here will be shorter 
than that for physical sensors. 


11.2.2.1 CHEMICAL SENSORS USING DIRECT 
OPTICAL SPECTROSCOPY 


Because materials can absorb or emit light only 
at wavelengths corresponding to allowed internal 
energy level changes, characteristic of particular 
elements or functional groups, spectroscopy has 
become one of the most valuable tools of the ana- 
lytical chemist. Most of the standard spectroscopic 
techniques used by analytical instrument design- 
ers can also be implemented in fiber optic form (see 
examples in Figure 11.15). 


Possible methods include measurement of 
transmission and turbidity, attenuated total reflec- 
tion, fluorescence, Raman scattering (including 
surface-enhanced Raman scattering, or SERS), to 
give just a few examples. The great advantage of 
fibers is the real-time, online measurement capa- 
bility, allowing the instrumentation to be kept in 
a benign environment, away from the probe end, 
that may be remote or inaccessible, and could have 
any variety of dangerous, corrosive, toxic, or flam- 
mable materials present. 


11.2.2.1.1 Transmission (absorption) and 
turbidity measurements 


Transmission defines the fraction of light passing 
through a component or medium. Absorption is 
the total loss of light arising from conversion of 
light to heat, whereas turbidity is where light is 
merely lost from a collimated path by scattering, 
such that it is no longer collected. 
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Figure 11.15 Optical fiber sensors for remote spectroscopy. This shows (a) transmission spectroscopy, 
where, both the source and interrogator unit can be an integral part of a commercial spectrophotom- 
eter, modified to allow fiber extension leads; (b) sensing of backscattered signals, where, a filter or 
spectrometer can be used to select out Raman or fluorescent signals from the incident light if desired; 
and (c) a modification to allow a single fiber lead to the measurement head. 


The power, P(A), transmitted through a sample 
in a small wavelength interval at a center wave- 
length A, is given by Lambert's law: 


P(A)=By(A)-exp[-cx(A)- 6} 
where P,(A) is the power entering the sample in this 


wavelength interval, a(A) is the attenuation coef- 
ficient of the material at wavelength A, and ¢ is the 


optical path length through the sample to the point 
at which P(A) is measured. Lambert’s law does not 
apply if scattering is high, such that multiple scat- 
tering occurs. 

The sample can be said to have a transmission 
T(A), at the wavelength A, where 


T (A) =exp| —a(A) 0]. 
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Alternatively, the sample can be said to have an 
absorbance A(A), where 


A(A) = logioL1/T (A) |= logiol Py (2) /P (A) }=0.4300(2)- 2. 


The factor 0.43, or log(e), has to be included to 
account for the use of logl0 for absorbance cal- 
culations, whereas natural exponents are used for 
attenuation coefficients. 

Transmission, absorption, or turbidity mea- 
surements can be achieved most easily by using 
a commercial spectrophotometer with extension 
leads. These have a unit that fits into the cell com- 
partment of a standard instrument, with a first lens 
that takes the collimated light that would normally 
pass through the sample chamber, focuses it instead 
into a large core diameter (usually >200 pm) optical 
fiber down-lead and with a second lens that recolli- 
mates light coming back from the return fiber lead 
into a low-divergence beam suitable for passage 
back into the instrument. There is also a remote 
measurement cell, connected to the remote end of 
both these fiber leads, where a first lens collimates 
light coming from the down-lead into an inter- 
rogation beam, which passes through the remote 
measurement cell, after which a second lens col- 
lects the light and refocuses it into the fiber return 
lead going to the spectrometer instrument. Such 
optical transformations lead to inevitable losses of 
optical power, of typically 10-20 dB (equivalent to 
losing 1-2 units of absorbance), but as most mod- 
ern instruments have a typical dynamic range of 
>50 dB, this is a price that many users are prepared 
to pay for a useful remote measurement capability. 

It should be noted that the optical power losses 
occur mainly due to the imperfections of the focus- 
ing and recollimation optics and due to reflection 
losses at interfaces, rather than to fiber transmis- 
sion losses. If suitably collimated beams were to be 
available in the instrument, if large core diameter 
fibers could be used to connect to and from the 
probe, and if all optics, including fiber ends, could 
be antireflection coated, there should really be very 
little loss penalty. Such losses, therefore, arise pri- 
marily because of the need for the fiber leads to 
be as flexible as possible (so hence choice of small 
diameter fibers) and the usual need to compromise 
design on grounds of cost. 

There are many other probe head designs 
that are possible. The simplest design, for use 


with measurement samples showing very strong 
absorption, is simply to have a probe which holds 
the ends of the down-lead and return fiber in axial 
alignment, facing each other across a small mea- 
surement gap, where the sample is then allowed to 
enter. Losses are low for fiber end spacing of the 
same order as the fiber diameter or less but rapidly 
increase with larger gaps. The probe is far easier to 
miniaturize and handle if the fiber down-lead and 
return lead are parallel in one cable. This can be 
achieved using a right-angled prism or other ret- 
roreflecting device to deflect the beam in the probe 
tip through the desired 180° that allows it to first 
leave the outgoing fiber, pass through a sample and 
then enter the return fiber. Use of a directional 
fiber coupler at the instrument end allows use of 
a single fiber, but then any residual retroreflection 
from the fiber end will be present as a cross talk 
signal, adding light signal components that have 
not passed through the medium. Clearly, there are 
many variants of such optical probes, some involv- 
ing more complex optics (e.g., multipass probes), 
some constructed from more exotic materials to 
withstand corrosive chemicals. 

A very simple option that has often been used 
with such single fiber probes, for monitoring the 
transmission of chemical indicators, is to dissolve 
the indicator in a polymer that is permeable to 
the chemical to be detected, and also incorporate 
strongly scattering particles in the polymer. When 
a small piece of such a polymer is formed on the 
fiber end, the particles give rise to strong backscat- 
tered light, and the return fiber guides a portion 
of this to the detection system. This backscattered 
light had of course to pass through the indicator 
polymer in its path to and from each scattering 
particle, so the returning light is subject to spec- 
tral filtering by the indicator. Although this is a 
very lossy arrangement, it is extremely cheap and 
simple and has formed the basis of many chemical 
sensors, for example ones using pH indicators [48]. 

There are now many commercial types of min- 
iature CCD spectrometers that have been specially 
designed to analyze the spectrum exiting from an 
optical fiber. These generally use the light-guiding 
core of the return fiber as the input “slit” of a dif- 
fraction grating monochromator, using a sensitive 
CCD detector array to provide a set of parallel out- 
put signals, one from each narrow-band spectral 
component of the received light. These have high 
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optical efficiency, as nearly all the energy is incident 
on the detector. For ultimate resolution, however, 
the well-known Fourier transform spectrometer 
principle can be used, where a scanned optical 
interferometer is used to analyze the spectrum of 
the received light. Each optical spectral compo- 
nent gives its own electrical frequency component 
in the detected photocurrent signal, observed as 
the interferometer is scanned (a narrowband laser 
signal input will give a pure sinusoidal output from 
the scanned interferometer). The entire spectrum 
can, therefore, be obtained by Fourier analysis of 
the temporal variations of the detected signal, to 
extract the relative magnitude of each frequency 
component in this detected signal. 


11.2.2.1.2 Chemical sensing by 
detection of fluorescence and 
Raman scattering 


It was stated above that, for transmission and tur- 
bidity measurements, there should really be only 
a small additional loss penalty due to using opti- 
cal fibers, provided full use is made of expensive 
precautions to avoid loss of light. This is unfortu- 
nately not true when fluorescence or Raman mea- 
surements are required, as light is scattered over all 
angles, and the return fiber can only ever collect a 
relatively small portion of this light. In addition, 
the processes where this inelastically scattered (i.e., 
different wavelength to that incident) light is gen- 
erated, often have a poor quantum efficiency (i.e., 
have a low ratio of total re-emitted to total incident 
photon flux). 

An arrangement similar to that of Figure 11.15b 
has been used for oxygen sensing, using Ruthenium 
dye complexes [49], where the fluorescence decay is 
quenched by the oxygen gas. 

There is no space here for a full discussion of 
these processes, but their potential value, particu- 
larly that of Raman scattering, for chemical sens- 
ing, has meant workers will continue to persevere 
to get useful performance, despite the low return 
light levels encountered with fiber-coupled sys- 
tems. Both these mechanisms involve excitation of 
a sample with light, usually at a wavelength shorter’ 


* Measures to reduce this will be discussed later, but one 
useful way to improve an OTDR system is to include 
optical fiber amplifiers to boost the launch power and 
to act as an optical preamplifier for returning signals. 


than the scattered light to be observed, and then 
the re-emitted light is collected and narrow-band 
filtered. This filtering is first to separate it from the 
incident light, but also, in the case of Raman, to 
examine it for the spectral features characteristic 
of a target compound to be examined. 

It is useful to briefly estimate the approximate 
magnitude of additional losses when using fibers 
with these processes. The loss due to launching 
of excitation light into a fiber is usually negli- 
gible with Raman, as powerful narrow-line laser 
sources are used, but ultimately the limit may 
be set by nonlinear processes or, in the case of 
large-core multimode fibers, by optical damage 
thresholds. Similar excitation can be used for 
low-level fluorescence monitoring, provided no 
photo-bleaching or other photodegradation of the 
monitored substance can occur at high illumina- 
tion intensity. The main potential loss is therefore 
that of light collection. If we assume the medium 
is excited close to the end of a fiber, only a region 
of dimensions of the same order as the fiber core 
diameter will be intensely excited. Thus, if a very 
large 200 pm core diameter fiber is used, a region 
of approximately 200 to >500 pm in length will be 
excited effectively. As a re-emitting point in the 
excited medium gets further from the fiber tip, 
the brightness of its illumination reduces and the 
effective angle subtended by the fiber core collect- 
ing the light (which can be approximately consid- 
ered to be re-emitted in all directions) gets smaller. 
The collection can, therefore, be approximated as 
that received by the fiber numerical aperture (say 
NA of 0.3) from a region around 200m long. In 
a normal Raman spectrometer, the sample can be 
excited by a narrow focused laser beam. There can 
therefore be a useful and collectable Raman emis- 
sion from a thin sample region, of length equal 
to that of the spectrometer input slit on which 
it must be focused. The length of the slit may be 
perhaps 10mm long, and light entering this may 
perhaps be collected with a wide acceptance angle 
monochromator, perhaps having an NA as high 
as 0.4. Thus, the fiber-based system may per- 
haps have a light collection reduced by a factor 
of 50x 16/9 (product of the useful excited length 
ratio and the square of the acceptance NA ratio), 
or approximately 90 times, when compared to a 
bulk optical system. A similar factor also applies 
to fluorescence detection. 
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Apart from these photometric limitations, 
Raman scattering has a particularly poor quan- 
tum efficiency, and the already weak scattered 
signals will be typically two orders of magnitude 
weaker when coupled into return fibers, with their 
poor collection efficiency for divergent light (see 
bibliography [1], Chapters 7 and 13). Despite this, 
however, several commercial fiber-coupled Raman 
systems are available. These combine the great 
advantage of Raman scattering (which, merely by 
careful spectral filtering, allows rejection of elasti- 
cally scattered light from turbid samples) with the 
ability of fibers to probe into inaccessible, remote, 
or hazardous environments. 

A few practical examples of chemical sensing 
using direct spectroscopy are given in vol III. 

There are many more means of performing 
direct spectroscopy with optical fibers, as the above 
has only presented a few. Other methods will be 
discussed in the later section on distributed and 
multiplexed sensing. As space is limited here, it is 
instructive to summarize, in the form ofa brief list 
of examples, where systems have been built for prac- 
tical applications. In many cases, active research is 
still being pursued in many of these areas: 


e Sensors examining transmission of groundwa- 
ter to track pollution. 

e Fiber-probe sensors to examine the transmis- 
sion or reflection spectrum of blood to deter- 
mine oxygenation state. 

e Fiber-probe sensors to examine bilirubin in the 
digestive system of the body. 

© Gas sensors, based on remote absorption mea- 
surements, using fiber probes. 

e Sensors examining the transmission of petro- 
chemicals to determine octane rating. These 
can be extrinsic sensors as in Figure 11.15a or 
can use evanescent field monitoring. 

e Fluorescence sensors to determine oil or other 
aromatic hydrocarbons in water. 

e Technologies using arrays of optically addressed 
“microdots,” each dot in the array having a 
different composition. For example, arrays 
of fluorescent indicator dots, each dot having 
different optical properties, can detect several 
different substances using one fiber (or can give 
multiple signals to cancel cross talk from other 
substances, using mathematical regression 
models). 


e Methods using evanescent wave coupling to the 
measured substance, for example, with a bare 
glass or silica fiber. Alternatively, a conventional 
monomode fiber with part of the cladding pol- 
ished away, or a “D-type” fiber that has a very 
thin cladding on one side, can be used. In such 
sensors, the field of the guided light extends 
beyond the vitreous waveguide into the chemi- 
cal to be measured, allowing absorption of the 
latter to be observed as fiber attenuation. 

e Technologies as above, but using surface plas- 
mon resonance to enhance the coupling to the 
evanescent field. 

e Refractive index sensors, where a fiber with 
Bragg gratings is side polished, removing 
cladding material to allow the evanescent field 
to extend outside the fiber. Then, the peak 
reflected wavelength of the grating is affected 
by the effective propagation constant of the 
fiber, which is now also a function of the 
refractive index of this surrounding medium. 
Again, octane rating of fuels is of interest here. 


The textbooks in bibliography [2] give a very com- 
prehensive overview of many of these types of opti- 
cal chemical sensors, and a few applications are 
presented in vol III. 
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We shall now discuss interferometric sensors. 
Here, the sensing action involves the interaction of 
fiber-guided light beams, where there is coherent 
addition of the electric field components of their 
electromagnetic waves. This leads to a mixing con- 
dition that can vary from constructive to destruc- 
tive interference, depending on the relative phase 
of the combining light beams. In order to obtain 
high visibility interference of interfering free-space 
beams, it is necessary to match their intensity pro- 
files, their wave front shapes (i.e., beam direction 
and divergence) and their polarization states over 
the full transverse width of the interfering beams. 
In order to observe or detect the effects of interfer- 
ence, however, the light beams must also eventu- 
ally fall onto a “square law” optical detector. All 
standard optical detectors, including the human 
eye, monitor optical power or intensity, which is 
proportional to the square of the electrical field 
component, hence the term “square law.” 
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11.3.1 Interferometers using fiber 
optic technology 


A single mode optical fiber guides not only one 
propagation mode, but also strictly two, if the two 
possible orthogonally polarized modes are con- 
sidered. The fundamental guided mode is usually 
called the HE,, mode. This fiber mode has quite a 
simple transverse power distribution, very closely 
matching the well-known Gaussian TE,, intensity 
profile, typical of that of the beam from a single 
transverse mode gas laser, with a central (axial) 
peak in the intensity. The big advantage of mono- 
mode fibers is that (apart from the possibility of the 
two different principal polarization modes) only a 
single spatial mode is allowed, the fiber acting as 
a perfect mode filter to ensure that the two over- 
lapping fields have identical spatial characteristics. 
The slight curvature of E-M field lines in the fiber, 
varying a little with transverse position, does not 
significantly reduce fringe visibility, as the other 
interfering signal (or signals) guided in the same 
fiber have a matching curvature. 

Because of this behavior, we can fabricate 
interferometers using single mode fiber to define 
the optical paths and using compact fiber coupler 
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components as beam splitters or combiners [50]. 
In Figure 11.16a and b, we show a Michelson and 
a Mach-Zehnder type interferometer, respectively, 
both implemented with single mode fibers. 

Unfortunately, there is a big disadvantage 
when light is guided by an ordinary single-mode 
telecommunications-type fiber, as, unlike the situ- 
ation for free-space, or in-air, beams, the polar- 
ization direction of the HE,, mode can easily be 
changed by environmental influences. For these 
to change the polarization of fiber-guided light, 
they need only to cause a significant asymmetri- 
cal physical distortion of the fiber. Such influences 
can occur due to, for example, fiber bends, lateral 
mechanical stresses, and transverse thermal gradi- 
ents. In the extreme case of this so-called polar- 
ization fading, the visibility of the interference can 
fade completely, as orthogonally polarized guided 
beams cannot interfere. This is simply because 
electric fields at 90° can no longer cancel the resul- 
tant intensity now being independent of optical 
phase of the combining beams. 

To compensate for possible signal fading due to 
this polarization fluctuation, it is common to use 
polarization controllers (PCs) when using ordi- 
nary single mode fibers. However, in real-world 
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Figure 11.16 Illustration of two commonly used fiber interferometers: (a) Michelson arrange- 
ment, (b) Mach-Zehnder arrangement, and (c) the typical detector response to phase difference. 
Note that the zero position on the phase-difference scale, where intensity would be at a peak, is 


unknown in this figure. 
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sensors, it is not attractive having to have to con- 
tinually adjust polarization in order to compen- 
sate for environmental effects, unless the PCs are 
themselves controlled by automated optoelectronic 
feedback systems. Although the latter is possible, 
it is still a rather complex and expensive solu- 
tion. Fortunately, it is possible to greatly reduce 
such effects with passive solutions. One method 
is to use polarization-diversity optical receivers, 
where the combined light is first split into differ- 
ently polarized components before detection on 
several separate detectors. A simpler and far more 
efficient passive solution, however, is to use polar- 
ization-maintaining fiber (P-M fiber). In this fiber, 
the direction of the linear polarization of the HE,, 
mode is constrained to lie along one of its princi- 
pal polarization axes, as the lightwave propagates 
through it. Such a fiber is made with a deliberate 
transverse asymmetry, such that either of the two 
principal polarization modes, once launched, can- 
not readily couple to the other, unless there is a 
very severe localized deformation of the fiber. An 
even more effective method is to use single polar- 
ization fibers, which are polarization-maintaining 
types where one polarization mode is made to be 
highly lossy, such that only the other mode can 
effectively propagate without high attenuation. If 
any small degree of polarization mode conversion 
then takes place in this fiber, the light in the unde- 
sired mode is rapidly lost. 

In the case of a dual-reflective path, optical fiber 
Michelson interferometer, we have another ingenious 
way to stabilize for any in-fiber polarization fluctua- 
tion. To achieve this, Faraday rotator mirrors (FRM) 
are placed at the end of the fibers. A Faraday mir- 
ror device has the property of ensuring the reflected 
lightwave has a returning SOP rotated by 90° with 
respect to the incident wave. Consequently, after a 
two-way propagation along the fiber, the SOP of the 
lightwave always remains orthogonal to the incident 
one, even though the actual SOP of each beam along 
the fiber will always fluctuate. If a wave is split, by 
a coupler, into two arms, each having Faraday mir- 
rors at their far (distal) ends, both returning beams 
or guided modes return with orthogonal polariza- 
tion states to that of the launched light, so both have 
identical polarization states. 

In a fiber interferometer, the detected output, I,, 
is given simply by 


Iq =A+BcosA0, 


where A and B are constants and A@ is the phase 
difference between the two lightwaves returning 
from the reference and the sensing arms. The out- 
put intensity changes, as a function of phase differ- 
ence, AO, are shown schematically in Figure 11.16c. 
In Figure 11.16a or b, a relatively large phase drift 
can always be caused by temperature or mechani- 
cal fluctuation, which results in the presence of 
low-frequency components, or drift, in AO. 

For many sensors, such as acoustic wave (e.g., 
fiber hydrophones) or vibration monitors, it is 
desired to monitor or measure very tiny phase 
changes, usually of a cyclic or transiently varying 
nature, caused by the external measurand, typi- 
cally having a low- to mid-AC frequency content 
in the range 10 Hz to 10 KHz. In order to measure 
these small, relatively rapidly changing signals, 
other sources of slower drift, for example, due 
to slow thermal changes or due to slowly vary- 
ing mechanical strain, must be corrected for or 
stabilized, so as to keep the interferometer at its 
most sensitive operating point. This is the maxi- 
mum slope of the sinusoidal response, termed the 
quadrature point. 

To achieve the desired stabilization, an optical 
phase shifter (PS) can be placed in the reference 
arm, as shown in Figure 11.16a and b. In these 
stabilized interferometers, the detector output 
is fed-back to the PS, so that the output signal is 
held at quadrature point A, where the slope of the 
curve is steepest. In this way, tiny alternating opti- 
cal phase signals, as small as 10~* rad or less, can 
be detected. At an optical wavelength of 1 jim, this 
represents an optical path length change of only 
10-“™, corresponding to a change in the physi- 
cal length of the sensing fiber by a tiny amount, 
corresponding to a small fraction of the diameter 
of a hydrogen atom! (Of course, individual atoms 
at the fiber ends have positional uncertainties 
greater than this, but the end position is the aver- 
age of enormous numbers of such atoms). This 
means that fiber interferometric sensors can real- 
ize extremely high sensitivity and this sensitivity 
can be further increased by using a multiturn fiber 
coil as a sensing element. 

For many applications, it is convenient to 
place an optical frequency shifter (FS) in one 
arm to deliberately induce a frequency difference 
between the two lightwaves. This is also shown in 
Figure 11.16. Under these conditions, the detected 
output, I,, can be expressed as 
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I, = A+Bcos(2nAft + AO). 


This new configuration is called a heterodyne inter- 
ferometer, a term which implies that the interfering 
lightwaves now have different frequencies. The phase 
of the detected electrical signal can now be detected 
by conventional phase or frequency demodulation 
schemes, of a type commonly used in commercial 
or domestic radio receivers. One such electrical 
frequency demodulation method is to delay one 
signal compared to the other and use an electronic 
multiplier or mixer, followed by a low-pass filter, an 
arrangement that conveniently converts the elec- 
tronic phase difference of two electronic signals to 
amplitude, with a linear-saw-tooth response char- 
acteristic. Another common method, again well 
known to radio engineers, is to use phase-lock loop 
demodulation technology. In order to distinguish 
the earlier, rather simpler, interferometer (in which 
the two lightwaves had the same frequency), from 
the heterodyne, or difference frequency one, we have 
just discussed that the earlier one has been termed a 
homodyne interferometer. 

In the interferometer shown in Figure 11.16a 
and b, the optical angle-modulation elements, ie., 
the PS or FS modulators, are placed in the refer- 
ence arm, which is sometimes located adjacent 
to or near the sensing arm to assist with thermal 
compensation. If the electronically driven PS or 
FS devices are placed in, or near, the physical mea- 
suring environment, they might pick up electrical 
interference, which would reduce the advantage 
that an optical fiber sensor is normally not affected 
by, nor induces electromagnetic noise. This would 
remove the inherent electromagnetic compatibil- 
ity of the sensor. To improve the electromagnetic 


Tunable laser 


behavior, a configuration shown in Figure 11.17 
has been proposed, where, rather than using a sep- 
arate modulator element, a frequency modulation 
is created in the laser light source, which then, due 
to different optical time delays, creates a phase dif- 
ference AO,, between lightwaves that have traveled 
over different length optical paths. 
This then gives a phase difference expressed by 


where AL is the optical path length difference 
between the two arms. The lasing frequency of 
a semiconductor diode laser can conveniently 
be changed merely by modulating the drive 
or injection current, providing a very simple 
current-controlled optical oscillator. By using this 
property, we can compensate for the effects of slow 
thermally or mechanically induced drift in the 
interferometer, without the need for any separate 
PS or FS. Using this configuration, there is then no 
need for any electronic or electrical components 
in the sensing environment, giving excellent free- 
dom from electrical interference, etc. When the 
laser frequency is modulated by a sinusoidal wave 
of frequency wp, we induce a set of components at 
frequencies m Xp in the detector output, where m 
is an integer. Each of these components has a phase 
corresponding to the optical phase difference to 
be measured. This configuration shows, in many 
ways, a similar output to that of the heterodyne 
interferometer, but because no separate modula- 
tor is used, it is called the phase-induced carrier 
method. 

Because of the over-riding need to reduce 
polarization fluctuation and phase drift in highly 
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Figure 11.17 Configuration of a passive remote Mach-Zehnder interferometer. This allows remote 
interrogation by sweeping the laser source, avoiding the need for modulators with electrical drive 
signals to be included in the remote interferometer section. 
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sensitive fiber optic interferometer sensors, many 
different methods have been proposed and devel- 
oped by researchers. The textbooks and review 
papers in the bibliography present greater details of 
many of these schemes, but below we will present 
a case study of the optical fiber gyroscope to intro- 
duce some of the methods that can be employed. The 
gyroscope is a very special case, where it is essential 
to achieve a very high degree of steady-state sta- 
bility. Fortunately, many other types of interfero- 
metric fiber sensor are designed to measure only 
dynamic changes, such as might arise from acoustic 
signals (e.g., the fiber hydrophone) or from other 
mechanical vibrations (e.g., the fiber vibrometer 
or accelerometer), so these do not suffer from quite 
so many undesired sensitivities to slow drift from 
environmental aspects as the fiber gyroscope can. 
The discussion below illustrates that despite many 
fairly complex difficulties, a host of potential prob- 
lems can be overcome to make a cost-effective prac- 
tical sensor, although a great deal of background 
research was necessary to reach this point and the 
gyro still needs careful design. 


11.3.2 High-sensitivity sensing with 
fiber interferometers 


11.3.2.1 THE FIBER OPTIC GYROSCOPE 


The fiber optic gyroscope [51-53], which we shall 
call “FOG” for short, was one of the earliest types 
of interferometric fiber sensor, and is the one which 
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has perhaps received the most research funding 
and scientific attention. 

A FOG detects rotation relative to an inertial 
frame. The basic operating principle of this sensor 
is based on a concept known as the “Sagnac effect” 
[54], which originally used two optical beams, 
each directed in opposite directions around loops 
using mirrors, before being caused to interfere on 
a detector. The basic configuration of the all-fiber 
version is shown in Figure 11.18. 

It can be considered that two lightwaves, prop- 
agating in opposite directions in the same closed 
optical fiber coil, exhibit a traveling-time differ- 
ence, which is proportional to the rotation rate of 
the optical path with respect to the inertial frame. 
This time difference results in a phase difference, 
0, between the beams at the output of the loop, 
given by 


= AnLa Q 


0 ; 
cA 


where L, a, c, A, and Q are, respectively, the length 
of the fiber coil, its radius, the speed of light in vac- 
uum, the optical wavelength and the coil rotation 
rate. The magnitude of the Sagnac phase shift is gen- 
erally extremely low for most typical rotation rates, 
particularly for those typical of vehicle navigation, 
where directional changes are usually quite slow. To 
overcome this limitation, a very long (and hence nec- 
essarily low loss) fiber coil needs to be used for the 
sensing coil. For aircraft navigation, a rotation rate 
resolution of only 0.01° h™ is required. Even when 
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Figure 11.18 Basic configuration for a FOG. This shows the phase modulation elements used to bias 
the device contained on an integrated optics chip (I-FOG chip) that also contains the polarizer and a 


beam-splitting element. 


374 Optical fiber sensors 


using a sensing fiber as long as 1km, this typically 
corresponds to an induced phase difference as small 
as 1 prad. To measure such small and slow-changing 
phase changes, a successful FOG requires extremely 
careful control of the many subtle noise and drift 
factors that can otherwise occur. Fortunately, after 
extensive research over many years, these noise fac- 
tors have been studied in detail, and many excellent 
countermeasures to reduce them have already been 
invented. 

As a result of this success, FOGs are now fea- 
sible for many new applications of gyroscopes, 
such as car navigation, antenna/camera stabilizers, 
radio-controlled vehicles, unmanned track naviga- 
tion, and so on. These new applications could only 
be realized because of unique advantages of the 
FOGs compared to traditional mechanical gyros, 
such as short warm-up time (no need to build up 
mechanical speed, as in normal types), less mainte- 
nance and low cost. FOGs have also now been used 
in traditional application fields, such as airplane 
navigation, rocket launching and ship navigation. 
A part of the navigation system of the Boeing 777 
now uses FOGs. 

To measure tiny phase differences, of the order of 
micro-radians, between the two lightwaves travel- 
ing in the fiber coil of Figure 11.18, a special process- 
ing scheme had to be established. When the system 
is in the rest state, the signals interfering at the 
detector in Figure 11.18 would normally give rise 
to a maximum in the sinusoidal response (dashed 
curve). At this peak, the gradient is of course zero, so 
there is no initial response to small phase changes, 
so hence zero sensitivity for a small rotation rate. In 
order to solve this problem, a phase biasing scheme 
is needed, in order to drive the state of the inter- 
ferometer to a region of the phase response curve 
where the gradient is nonzero. To achieve this, a 
phase modulator, driven by a signal of sinusoidal or 
square waveform, is applied at, or near, the end of 
the sensing fiber coil. Due to the timing difference 
of the modulation between the clockwise (CW) and 
the counter-clockwise (CCW) waves, the two light- 
waves now have a periodic phase difference when 
they impinge on the detector, and give a “mixed” or 
detected output signal having the same frequency as 
the applied phase-modulation waveform. 

By detecting the electronic output synchro- 
nously (ie., by multiplying it with an electronic 
reference signal at the same frequency as the 
applied phase modulation, and then low-pass 


filtering), the signal shown by the solid curve can 
be obtained. Now it can be seen that the detected 
output changes, as desired, with input rotation. 
The polarizer shown in Figure 11.18 is required to 
reduce drift effects due to the polarization fluctua- 
tion in the sensing fiber coil. Essentially, it ensures 
that light travels in each direction in the loop in 
different directions, but in the same polarization 
state, although, of course, each individual beam 
still exhibits polarization changes as it propagates 
round the loop. The system shown in Figure 11.18 
is called the “minimum reciprocal configuration” 
for the FOG. 

Even when the minimum configuration is used, 
many noise, drift, and signal-fading factors can 
still exist. The first major problem to be solved is 
the need to avoid polarization fading. When the 
state of the polarization fluctuates in the fiber, 
the light power received at the detector changes 
and this can result in significant reduction of 
the SNR—in the worst case, the signal can even 
totally disappear! A way to avoid this problem, 
using only a passive component, is to insert a fiber 
depolarizer at some point in (most conveniently 
at the end of) the sensing fiber coil. The depolar- 
izer is fabricated simply using a short birefringent 
polarization-maintaining fiber, in which the two 
orthogonally polarized propagation modes have a 
difference in velocity. When the usual broadband 
LED (or a superluminescent fiber) wide spectrum 
(low coherence) light source is used to excite the 
FOG, the variations in differential mode delay 
result in a different output polarization state from 
the polarization-maintaining fiber section for each 
wavelength component. This effectively reduces 
the degree of polarization by “scrambling” it, to 
give a different polarization at each wavelength, 
and to render it effectively unpolarized when the 
effect is averaged over the full bandwidth of the 
source (even though each individual wavelength 
component is still strongly polarized). Hence, 
using this “depolarizer,” the polarization fading 
problem can be overcome, as some wavelengths 
will still interfere without fading. Unfortunately, 
an undesirable polarization component is induced, 
which is perpendicular to the polarization axis of 
the polarizer. Due to the finite extinction ratio of 
practical polarizer elements, this configuration is 
not suitable for realizing FOGs of very high resolu- 
tion. This configuration is therefore only used for 
low-cost, moderate-grade gyros. 
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To increase the optical efficiency, and hence the 
sensitivity, a polarization-maintaining fiber coil is 
usually used. With such a coil, undesirable cou- 
pling to the orthogonal polarization component 
is greatly reduced. Moreover, even if a small cou- 
pling were to occur, the two polarization modes 
in the fiber have different propagation velocities, 
so the undesired component has a large opti- 
cal delay compared with the desired component. 
Therefore, when using a low coherent source, such 
as an ELED (edge-emitting LED), the undesirable 
component cannot interfere with the desired one. 
This configuration is used for intermediate grade 
gyros. To improve the sensitivity more, a LiNbO, 
integrated optical circuit modulator can be intro- 
duced as shown in Figure 11.19. 

This integrated optical circuit is called the FOG 
chip, in which one coupling branch, two-phase 
modulators, and a polarizer are all integrated 
together on a common planar substrate. This con- 
figuration is used for high-grade FOGs. When a 
proton-exchanged LiNbO, planar waveguide is 
used, it also acts as a polarizer with quite a high 
(>60 dB) extinction ratio. In this system of Figure 
11.19, a sophisticated signal-processing scheme 
has also been included, which is explained below. 


A very small Faraday rotation effect can occur 
in the fiber-sensing coil, due to the Earth’s mag- 
netic field, and this results in another error fac- 
tor in the FOG. The Earth’s magnetic field lines 
are essentially parallel over the small dimensions 
of the coil, so the Faraday effect induced in one 
half of the coil would ideally be canceled by that 
induced in the other half. However, when birefrin- 
gence exists in the fiber coil due, for example, to 
bending or other mechanical stress, the FOG can 
suffer an error or drift. One way to reduce it is 
again to use polarization-maintaining fiber for the 
sensing coil. A polarization-maintaining fiber with 
high birefringence can effectively reduce the drift 
because it prevents the lightwave suffering polar- 
ization rotation due to the Faraday effect. Another 
way to reduce the drift is to place a depolarizer at 
each end of an ordinary single-mode fiber coil. 
Recently, a general formula to describe the Faraday 
effect induced drift has been derived, in which the 
mechanism of the reduction of this drift has been 
theoretically derived. 

If the temperature distribution in the coil 
changes with time, in a manner asymmetric with 
respect to the center of the fiber length, the CW 
and the CCW lightwave will experience a very 


I Y Induced phase 
0 t f difference 
2m ry 
Y To kK 
I-FOG chip Phase modulator 
PM fiber 
coupler 
[°=2.".°*."."] Z 
ad —4 iS [oe] Fi 
\ 
| 
Detector Polarizer 
Y \ 
A/D Bias modulation 
y | <. ' 
Digital signal processing i 
- Demodulation LI D/A 9 f 
- Signal processing > Ye . 
- Waveform generation Digital serrodyne waveform 


y 


Output 


Figure 11.19 Schematic of a more sophisticated gyroscope arrangement with digital serrodyne 


modulation. 


376 Optical fiber sensors 


slightly different line-integral of temperature 
along the coil. Transient thermal changes of very 
small magnitude can give rise to a significantly 
different phase change in each direction in the 
coil, resulting in output drift. A temperature rate 
of change of only 0.01°C s7 can induce a very 
substantial drift of the order of 10° hin an FOG 
with a badly designed coil! Special coil wind- 
ing technologies have been invented, in which 
every section of fiber in the coil lies adjacent, and 
closely thermally coupled to a similar section of 
coil that is located at a symmetrical position with 
respect to the fiber center (i.e., both sections lie at 
equal in-fiber distance from, but at opposite sides 
of, the coil center). This makes the overall tem- 
perature distribution symmetrical with respect 
to the center. To make a high-grade model, with 
0.01° h* sensitivity, such techniques have been 
applied. 

For high-grade applications, such as aircraft 
navigation, a wide dynamic range of about seven 
decades, and a good linearity of about 10 ppm are 
required. To realize these, a special signal-process- 
ing scheme, involving closed-loop operation, has 
been proposed and used. 

The schematic configuration of this has already 
been shown in Figure 11.19 and we shall now 
describe the signal-processing scheme. The phase 
modulation waveform shown in the figure is used 
to intentionally give a more complex phase-differ- 
ence modulation between the CW and the CCW 
lightwaves. This phase difference occurs because 
of the different effective position of the modula- 
tor in each path and the phase modulation wave- 
form used is called the digital serrodyne waveform. 
The modulation waveform can be thought of as 
having a “saw-tooth” envelope, but with steps, as 
can conveniently be generated by a D/A converter. 
Each phase step is adjusted to correspond to one 
roundtrip traveling time, t, of the lightwave in the 
fiber coil. The amplitude is kept at 2a. After the CW 
and the CCW waves have both suffered the same 
time delay of t, these two waves arrive with a phase 
difference at the detector, this difference being 
equal to the phase step ¢ in the serrodyne wave- 
form. The Sagnac phase is then compensated, using 
this phase difference as input to a feedback control 
loop. Under the condition where the amplitude of 
the waveform is set to 21, the phase difference 
is proportional to the frequency of the serrodyne 
waveform. Consequently, the input rotation rate is 


converted to a modulation frequency, which can 
then be measured with a frequency counter, to pro- 
vide a wide dynamic range output from the sensor, 
which also has good linearity. 

Applications of such gyros are expanding rap- 
idly in various fields. The required sensitivity and 
dynamic range can vary greatly according to the 
application. FOGs have already been developed, 
in moderate, intermediate, and high-grade forms, 
with progressive increases in cost and complex- 
ity. For moderate-grade applications, the analogue 
output of the detector is directly measured, this is 
called open-loop operation. For high-grade appli- 
cations, the closed-loop operation method should 
be used. 

Japanese gyro makers have created new appli- 
cation fields for industrial and consumer applica- 
tions, for example, car navigation systems, and 
control systems for cleaning-robots, forklifts, 
agricultural machines and unmanned dump 
trucks suitable for hazardous environments. A 
camera stabilizer, to provide stable TV pictures 
from a helicopter, has also been developed with 
FOG technology for the sensor. Radio controlled 
helicopters with I-FOGs have also been produced 
for agricultural applications, such as planting 
seeds and spraying chemicals. A North-finder 
“optical fiber compass,” using an open-loop FOG 
has been developed. The National Aerospace 
Development Agency of Japan used an inertial 
sensor package with FOGs, for the first time, in 
their rockets, for micro-gravity-mission experi- 
ments. The FOG was selected for this mission as a 
silent gyro. The first launching took place in 1991 
and was the first application of the FOG in space. 
The Institute of Space and Astronautical Science, 
Japan, has developed a rocket having an inertial 
navigation system (INS) with closed loop FOGs. 
The first successful flight of this M-V rocket was 
on 12 February 1997, using an additional radio- 
wave guidance technique. The M-V-1 launched a 
Satellite MUSES-B, with a mission to construct 
a VLBI (very long base-line interferometer) for 
radio astronomy, when in radio contact with 
other antennas on the Earth. Also, in this satel- 
lite, an open-loop FOG, having a 0.05° h” bias 
stability, was used for rate control. 

It should be pointed out that the Boeing 777 
uses a inertial navigation system that combines the 
use of six more conventional ring-laser gyros, of 
0.01° h™ grade, with four more recently developed 
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all-polarization-maintaining-fiber open-loop 
FOGs, having 0.5° h™ capability. 

Applications requiring a sensitivity even greater 
than 0.001° h* exist, such as for space applications 
and ship navigation. Potential applications include 
deep-space and precision spacecraft navigation, 
and space pointing, and stabilization. However, for 
such higher-grade applications, the ELED that is 
commonly used to excite simpler FOGs has insuffi- 
cient power and lacks wavelength stability. Because 
of this, laser-pumped superluminescent Er-doped 
fiber sources have been developed, and, using such a 
source, higher power and extremely high wavelength 
stability of typically a few ppm °C™ can be obtained. 
A rotation resolution better than 0.001° h7 has 
already been demonstrated by several companies, but 
requires carefully temperature-stabilized conditions. 


11.3.2.2 FIBER OPTIC HYDROPHONES 


We shall now discuss fiber sensors for detection 
of acoustic signals in water, as required for many 
military (detection of marine vehicles) and civil 
applications (seismic oil exploration). 

When an acoustic wave impinges on a fiber 
in water, the sound pressure induces change in 
its density and length, which results in a phase 
change of the lightwave propagating in the fiber. 
The change is of course very small in quiet seas, but 
it can be detected using interferometric configura- 
tions with a very long length (100-400 m) of sens- 
ing fiber. Using the compensation schemes for slow 
drift of temperature and mechanical strain, which 
were described above, a highly sensitive acoustic 
wave sensor in water, called a hydrophone, has 
been developed. As we discussed above, signals, as 
small as 10-°rad or less, can be detected, represent- 
ing an optical path length change of only 10m at 
1pm. This corresponds to a change in the physical 
length of the sensing fiber, which is only a small 
fraction of the diameter of a hydrogen atom! It is 
clear this will give tremendous acoustic sensitiv- 
ity. Best sensitivity is achieved using mechanical 
amplifiers, such as mechanically compliant (e.g., 
made of easily deformable, perhaps even air-filled, 
materials) rods as coil formers. The fiber is wound 
around these, such that when acoustic waves inter- 
act, a compressive change in diameter of the soft 
rod is transferred to length changes in the fiber, 
giving a much greater phase change for the same 
acoustic influence, than would occur for bare fiber. 


By using a wide variety of sensor multiplexing 
techniques, such as time-division multiplexing 
(TDM), large arrays of such hydrophones have 
also been realized. These will be discussed in more 
detail later in Sections 11.4 and 11.5 of this chapter, 
and the textbooks and the review paper by Kersey 
in the bibliography adds yet more detail. 


11.3.2.3 THE FIBER OPTIC CURRENT SENSOR 


High-voltage power systems often deal with enor- 
mous voltages (as high as 750kV or more!) and 
very large currents, often several kA. In this area, 
optical fiber current measurement schemes [6] can 
provide excellent electrical insulation and almost 
total immunity to EMI. Optical fiber current sen- 
sors (OFCS) satisfying such requirements are soon 
expected to take the place of traditional electrical 
current transformers (CT). After many years of 
careful research, rigorous field tests of the OFCS 
are finally showing good performance, and the 
research now seems to be in the final engineering 
stages. 

The OFCS measures current indirectly, by mea- 
suring the rotation of the SOP induced by it by the 
total magnetic field component along the axis of 
the sensing fiber. Figure 11.20a shows a typical 
configuration of a polarimetric OFCS. The sens- 
ing fiber coil is wound around a current-carrying 
conductor, and linearly polarized light is launched 
into the coil. The SOP rotation is due to the Faraday 
effect of the magnetic field, in the direction of the 
fiber axis, the magnetic field being induced by the 
current-carrying conductor inside the fiber coil. 
The plane of polarization of the propagating HE,, 
mode in the fiber is rotated through an angle, @, 
given by 


ov] Hal, 


where ¢, V, H and dl are, respectively, the Faraday 
rotation angle, the Verdet constant, the axial mag- 
netic field component and the length along the 
fiber. Because the sensing fiber coil is formed into 
closed path, Ampere’s law gives the relation 


k=QHal, 


where I, is the current in the electrical conductor 
passing through the fiber sensing coil. Suppose the 
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Figure 11.20 Two types of optical fiber current sensors: (a) using polarimetric interferometer and (b) 


using a Sagnac interferometer. 


fiber has a number of complete turns, n, then, from 
the two equations above, the relation 


d=nVI, 


is obtained. These form the well-known fundamen- 
tal equations for the OFCS. These relationships are, 
however, only valid when the sensing fiber coil has 
no linear birefringence. If significant linear bire- 
fringence is present, the polarization state in the 
fiber is disturbed, and the output of the OFCS is 
changed, giving measurement errors. To overcome 
such problems, and reduce the linear birefringence 
of the fiber, several solutions have already been 
developed. Twisting a single mode fiber during 
the winding process is one method, and thermally 
annealing the fiber coil, to relieve strain induced 
by the winding process, is another. Additionally, 
a low birefringence fiber using flint glass has also 
been developed, which also has a stronger Verdet 
constant. This has been demonstrated to give a 
high performance, suitable for meeting practical 
engineering requirements [55]. 

The configuration shown in Figure 11.20a is 
only a polarimetric interferometer, i., mixing 
of polarization components is enabled in a polar- 
ization analyzer. This should ideally have its axis 
inclined at 45° to the axis of each interfering beam. 
Another configuration for the current detection 
has been developed, in which a Sagnac interferom- 
eter arrangement, similar to that used for the fiber 
optic gyro, is adopted. The configuration is sche- 
matically shown in Figure 11.20b. In this system, 
the SOP in the sensing fiber coil is arranged to be 


circular by using quarter wave elements at both 
ends of the coil. In this configuration, the circu- 
lar SOP is maintained throughout the propagation 
in the coil, but, via the Faraday effect, a phase dif- 
ference is induced between the CW and the CCW 
traveling in the fiber coil. The phase difference is 
read out from the interference signal at the detec- 
tor, using the same type of signal processing as the 
FOG. Hence, all the sophisticated and compact 
optical and signal processing modules, already 
developed for use with the gyro can also be used in 
this version of the current sensor. 

We shall now conclude our discussions on the 
different types of single-point (discrete) sensors. 
The above discussions have made occasional ref- 
erence to multiplexed or distributed sensors, but 
the following two sections will now concentrate on 
these aspects more fully. 


11.4 MULTIPLEXED OPTICAL FIBER 
SENSOR SYSTEMS 


11.4.1 Introduction 


This section shows how sensors may be multi- 
plexed [56] and the following one will deal with 
truly distributed [57] optical fiber sensors. It is 
convenient to define multiplexed sensors as those 
designed to collect data from a number of discrete 
sensing points, or sensing regions, and distributed 
sensor types as those that operate on a continu- 
ous length of fiber, and are capable of determin- 
ing the variations of a desired parameter along 
the length of the fiber as a continuous function of 
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distance. Figure 11.21 depicts the basic concepts 
of distributed and multiplexed sensors. The upper 
curve, Figure 11.21la, shows a schematic of an in- 
line example of both types (fan-out topologies and 
many more arrangements are possible for multi- 
plexed sensors) and the lower curve, Figure 11.21b, 
shows the type of response to monitor the values of 
parameters, that each type can provide, as a func- 
tion of physical location. 

Extending the capability of a single measure- 
ment terminal, to address a multiplexed array of 
many passive sensing heads, rather than a single 
one, not only makes the use of a more complex 
monitoring station more economically viable, but 
it can also lead to a more accurate and reliable 
comparison (see Figure 11.21b) of values of the 
measured parameter, because the same interroga- 
tor is used to read each one. The distributed sensor 


(a) Schematic illustration of discrete, multiplexed and distributed sensors 


In 


Discrete sensor 


Out 


In ee 
Multiplexed sensor 


Out 7" 


Tings 


Distributed 
sensor 


Out 


(b) Advantages of distributed sensors 


Value 


Discrete 
sensor 
Position 


Value 


Multiplexed 
sensor 


Position 


Feature which could 

be missed by discrete 
Me ox multiplexed sensor. 

(for example, hot spot) 


Value 


Distributed 
sensor 


Position 


Figure 11.21 Schematic illustration showing (a) an 
in-line configuration of multiplexed and distrib- 
uted sensors and (b) the different types of spatial 
coverage offered by each type of sensor. 


(bottom curve, Figure 11.21b), however, goes a 
stage further, allowing the full distribution of the 
measured quantity to be determined with no gaps 
in coverage. In both multiplexed and distributed 
cases, one or more passive sensors, or sensing-fiber 
sections, with controlled environmental param- 
eters, can be used as a calibration aid for the 
interrogator. 

As with multiplexing in communications, the 
ultimate information gathering capacity is funda- 
mentally limited by the available bandwidth and 
by the SNR of the detected optical signals, and 
similar care must be taken to avoid undesirable 
cross talk between signals from apparently inde- 
pendent information sources; represented in this 
case by the individual sensor elements. 

In describing multiplexed sensors, we might 
perhaps start with a historical note. One of the 
earliest schemes for sensor multiplexing in opti- 
cal fibers was reported by Nelson et al. [58], where 
a fiber optic branch-tapped network was used, in 
conjunction with an optical time-domain reflec- 
tometer (OTDR). This system, which will be 
described later, is capable of receiving and inde- 
pendently monitoring the separate returns from a 
series of reflective sensors. 

The following sections will describe many more 
methods, outlining the most significant develop- 
ments in the technology, with the sensors being 
grouped or classified according to the method used 
to address the various sensing elements. 


11.4.2 Spatial multiplexing (separate 
fiber paths) 


Starting with the simplest low technology 
approach, the technique of using separate fibers 
to communicate with each separate sensing ele- 
ment (Figure 11.22), although trivial in scientific 
terms, is virtually guaranteed to avoid one of the 
pitfalls of multiplexing, that of cross talk between 
sensors. 

As it is the easiest method to implement, it was 
one of the first to have been used for practical appli- 
cations. Perhaps the most unusual and dramatic 
use so far has been the application of 152 separate 
graded index fibers, each 0.6km long, for nuclear 
weapon diagnostics [59]. For this particular appli- 
cation, light was generated at the sensor head end, 
so the light source and outgoing fiber paths shown 
in Figure 11.22 were not required, and the multiple 
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Figure 11.22 Spatial multiplexing using separate fiber paths: with (a) fixed paths and (b) with fiber 
switches to enable one source and detector to be used. 


fiber channels were used to guide light from the 
event. Each individual receiving fiber guided light 
to produce a “pixel” of an image on a fluorescent 
phosphor imaging screen, presumably located in a 
monitoring area at a safe distance! 

For cost-sensitive applications, where a slow 
update is acceptable, it is convenient to incorporate 
a fiber switch into a single optoelectronic processor 
unit, with a single light source and detector to per- 
mit it to “poll,” i.e., sequentially connect to, each of 
the sensing heads in turn (Figure 11.22b). 

A multiplexed system for monitoring fluores- 
cent dye concentration in water, via separate 600m 
lengths of optical fiber to each sensing head, has 
been reported [60]. In this system, a mechanically 
scanned mirror system was used to inject light 
from an exciting laser sequentially into each sepa- 
rate sensor head via separate fibers, the returning 
fluorescent light traveling through separate fibers 
to acommon photomultiplier detector. This system 
was capable of monitoring fluorescent dye concen- 
tration down to levels of 10~° by weight in water. 

Although usually used to address independent 
sensors, multiple fibers have been used to permit 
precision digital sensing in systems where each 
“bit” of binary-coded information is carried by 
a separate fiber. This technique has been used in 
simple position-encoding schemes, using binary- 
coded discs of the type used in conventional optical 
shaft-angle encoders [61]. A more complex system 
has been described for the “digital” measurement 
of temperature, using a series of isothermal (i.e., 


all in close thermal contact) birefringent crys- 
tals, each sensing crystal having a different length 
according to the significance of the desired order 
of the “bit” of binary-coded information [62]. The 
temperature-dependent birefringence of each is 
separately monitored via a polarization analyzer, 
which converts these changes in the birefringence 
into changes in the amplitude of the transmitted 
light, each with a different sinusoidal response to 
temperature. The separate outputs were combined 
to produce the equivalent of a binary word, which 
defined the temperature of the crystals. 


11.4.3 Time-division multiplexing 


The concept of TDM, in combination with high- 
speed digital technology, has revolutionized mod- 
ern-day communications. It is hardly surprising 
that it is also an attractive choice for multiplex- 
ing sensors! The time differences between signal 
returns from each sensor, necessary for a TDM 
system, are usually achieved by arranging differ- 
ing total optical propagation delays for the signals 
from each sensing element, using extra coils of 
fiber, whenever longer delays are needed. In addi- 
tion, in order to distinguish, and separate, the sig- 
nals from each sensor, it is necessary to modulate 
the light source with a temporally varying signal. 
The simplest form of encoding, for ease of both 
modulation and demodulation, is to use a repeti- 
tive pulse, having a short duration so that the set of 
returning differentially delayed pulses from each 
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sensor does not coincide at the detector. Also, it 
must have a pulse repetition rate that is low enough 
to allow each reflected pulse “echo” to return back 
from the most distant element of the sensing array 
before the first of the subsequent set of reflected 
pulses returns from the nearest sensing element. 

One of the first multiplexing methods suggested 
for use with hydrophone sensor arrays was of this 
type [63]. One of their proposed methods is shown 
in Figure 11.23a. This used two parallel fiber-optic, 
cross-coupled, highways, the first to distribute a 
transmitted optical pulse and tap a portion of it 
into each transmissive (loss-modulation type) sen- 
sor, and a second highway to collect signals from 
each individual sensor and guide the set of return- 
ing pulses to the detector. 

The second of their proposed arrangements 
used a single tapped highway, with reflective sen- 
sors, each having a measurand-dependent reflec- 
tion, connected to this highway with directional 
fiber couplers (Figure 11.23b). This array was 
interrogated using a conventional OTDR arrange- 
ment, with a semiconductor laser source and an 
avalanche photodiode (APD) detector. The basic 
method of optical time-domain reflectometry, 
which we mentioned briefly in the chapter intro- 
duction, was devised by Barnoski and Jensen [64] 
and researched further by Personick [65]. The con- 
cept is depicted in Figure 11.24, in a configuration 
commonly used to monitor losses in optical fibers 
and reflections from discontinuities. It is based on 
an optical radar (or LIDAR) concept, where a short 
pulse of light is launched into a fiber waveguide 
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and variations of backscattered light signal with 
time are monitored. 

Usually, light from a pulsed semiconductor laser 
(or Q-switched fiber laser light source) is launched 
into a section of fiber via a directional coupler, 
which serves also to direct a portion of the back- 
scattered light fraction, returning from the fiber 
on test, to a high-speed positive-intrinsic-negative 
field effect transistor or APD detector. The time of 
flight determines the distance, and the intensity 
variation normally indicates properties of the fiber 
under test (see lower curve of Figure 11.24). It has 
become a standard test-gear instrument for use by 
optical fiber engineers and researchers, to exam- 
ine the continuity and attenuation of optical fibers, 
and observe reflections from fiber breaks, connec- 
tors, or other discontinuities. 

If used to interrogate multiplexed sensors, it 
is possible to measure variations in either the 
reflected power from, or the transmission loss in, 
each sensor element (distributed sensing will be 
discussed in more detail later). Desforges et al. 
[66] have also reported experimental results with 
reflective sensors using an OTDR, but in their case 
the sensors were located at regions where the opti- 
cal fibers are deliberately bent, to cause noninva- 
sive coupling of light into and out of a continuous 
fiber to the reflective sensors. 

The OTDR may also be used to monitor dis- 
crete loss-modulation sensors, of the “microbend” 
type we discussed earlier, situated along the length 
of a continuous optical fiber. This technique has 
been proposed to monitor strain or deformation 
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Figure 11.23 TDM of fiber sensors: (a) ladder network with transmissive sensors and (b) branched 


network with reflective sensors. 
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Figure 11.24 Basic concept of the OTDR, showing (a) optical arrangement and (b) typical OTDR 


returns from a fiber. 


in civil engineering structures [6,67,68] and for 
nondestructive testing of engineering composite 
materials [69]. However, care must be taken with 
such multiplexed arrangements if quantitative 
results are required, because such microbend sen- 
sors cause severe mode conversion in multimode 
fibers, and the loss of each such sensor is strongly 
dependent on the mode excitation of the fiber 
immediately before the micro-bent section. Thus, 
when multimode fiber is used to interconnect and 
form an array of closely located sensors, there will, 
in general, be significant interelement cross talk in 
the response. 

The main disadvantage of a simple pulsed 
source for addressing multiplexed arrays is the 
poor optical efficiency due to the low excitation 
duty cycle. This problem is exacerbated, either by 
the peak-power limitations of semiconductor light 
sources, or by the onset of nonlinear optical pro- 
cesses in monomode fibers. A similar problem has, 
however, already been previously met, and dealt 


with, in radar systems, where it was also desired 
to distinguish differentially delayed signal returns 
without need for enormous transmitted power 
levels.” The radar technique most relevant to time- 
division optical sensor multiplexing is that where 
the transmitted radio signal has a pulse-code 
intensity modulation envelope that has a 50% duty 
cycle and has an autocorrelation function cor- 
responding to that of a periodic pulse train [70]. 
Simple cross-correlation of the detected signal, 
with differently phased versions of the code origi- 
nally transmitted, can then be used to separate the 
pulses corresponding to the individual sensors, 
just as if a single pulse of much higher peak power 
were to be used. 


* Anti-Stokes Raman light, at wavelengths shorter than 
the incident light, although normally very weak, can 
still have a useful intensity if it is measured at a wave- 
length very close to the incident wavelength. It then 
often has less interference from fluorescence light. 
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An alternative means of reducing the peak 
power requirements for the optical source (again 
one commonly used in radar systems) is to pro- 
duce a temporally extended “chirp” signal from an 
electrical pulse, using a dispersive surface acoustic 
wave (SAW) filter, and to use this signal to modu- 
late the intensity (amplitude) of the light source 
[63]. The detected return signal is then subjected 
to an inverse transformation, using a second SAW 
filter, thereby reconstituting a replica of the origi- 
nally transmitted pulse from each reflected signal; 
the net result from a sensor array, with delays of 
different lengths, being a time-division multi- 
plexed stream of pulses, similar to those which 
would be obtained without the SAW filters. 

As mentioned earlier, the TDM technique is also 
attractive when used with interferometric sensors. 
If an optical heterodyne method is used, where the 
returning signal is mixed with a reference signal of 
shifted frequency, the amplitude of the returning 
signal will, provided it remains constant, have no 
effect on the phase of the resulting beat signal. Such 
sensors, therefore, will require no amplitude refer- 
encing, as the output of the sensor is represented 
by the phase of the beat signal. A multiplexing 
scheme of this nature was first reported for hydro- 
phone applications by Dakin et al. [71]. This partic- 
ular implementation (Figure 11.25a) involved the 
launching of a consecutive pair of optical pulses, 
each having slightly different frequency, into a lin- 
ear array of interferometric intrinsic fiber strain 
sensors, each joined by partially reflective splices. 
The intrinsic sensors are coils of monomode fiber, 
usually potted into polymer cylinders, to enhance 
their response to acoustic pressure waves. The ini- 
tial optical pulses transmitted were consecutive, 
ie., did not overlap in time, but the differential 
delay of returning pulses from adjacent splices 
gave rise to coincidence of the second pulse, from 
the nearer splice of a sensor element, with the first 
pulse, from the subsequent splice (Figure 11.25b). 

Thus, with suitable pulse separation and dura- 
tion, a time-division multiplexed stream of hetero- 
dyne beat signals was obtained from the receiving 
detector, each heterodyne pulse corresponding to 
an element of the array, and carrying phase modu- 
lation proportional to the changes in optical path 
length. These path length changes were a direct 
measure of the strains arising from insonification 
of the corresponding fiber hydrophone element. 
Phase demodulation of each time-demultiplexed 


channel yielded the acoustic signals, free from 
any dependence on the amplitudes of the reflected 
light signals. In a patented modification, the same 
research group [72] first showed that it is pos- 
sible to compensate the unbalanced interferom- 
eter arrangement using an optical loop of fiber 
as a “pre-delay,” thus balancing optical paths and 
greatly reducing the effects of undesirable phase 
noise that arise from frequency fluctuations of the 
source laser (see later Section 11.4.7 on frequency- 
modulated carrier wave (FMCW) methods). 

The attraction of such coherent heterodyne 
TDM approaches is the great sensitivity to even 
tiny phase changes that can be achieved using the 
coherent detection process and, secondly, the excel- 
lent dynamic range that is possible using electronic 
phase demodulation of the intermediate frequency 
signal. The improvement in the detection process 
is greatest if a returning reflected signal is mixed 
with a strong local oscillator signal, derived at the 
monitoring station from the initial source [73]. The 
advantage of differential sensing of the distance 
between adjacent reflective splices, achieved by the 
method shown in Figure 11.25, may theoretically 
be retained if a three-wave mixing process is per- 
formed [74]. Using this latter method, the beat sig- 
nal between two weak received signals should be 
recoverable, after mixing with a strong local oscil- 
lator signal on the detector. 

Based on many of the concepts arising from 
those early publications, the field of marine acous- 
tic sensing has been one of the major practical 
success stories in the field of optical fiber sensors. 
The applications are in two main areas, firstly for 
naval surveillance applications (all the usual ones 
of towed arrays, vehicle arrays and fixed sea bed 
arrays) and secondly for seismic surveys, where 
intense sound sources, often explosive ones, are 
used to investigate subsea rock strata for oil- 
bearing features. It now appears very likely that 
optical fiber hydrophone sensor arrays, having 
a conveniently passive all-fiber “wet-end,” will 
completely take over from older technology using 
piezo-electric sensors. The latter require electrical 
preamplifiers, complex electrical wiring to elec- 
tronic multiplexers, and a sophisticated electron- 
ics communications system. Electrical systems 
are difficult to design and reliably maintain in a 
corrosive and conductive salt water environment. 
An excellent review of the subject has been given 
by Kersey in bibliography [7] (this chapter). An 
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Figure 11.25 (a) Diagram of optical fiber hydrophone array. (b) Timing diagram for a four-element 
array with: (1) the input to the Bragg cell; (2) the reflection from the end of (i) the down-lead, (ii) the 
first sensor, (iii) the second sensor, (iv) the third sensor and (v) the fourth sensor and (3) the output 


from the photodiode. 


advanced seabed sensor array system and a work- 
ing sea test are described in papers by Nash et al. 
[75] and Cranch et al. [76]. 


11.4.4 Wavelength-division 
multiplexing 


The use of wavelength-division multiplexing 
(WDM), unlike the alternatives of TDM and sub- 
carrier frequency-division multiplexing (FDM) 
techniques, has the advantage that there is no 


theoretical loss penalty when compared with the 
single-fiber-per-sensor approach. The method 
involves guiding optical power to each sensor, 
and back to a corresponding sensor, via a route 
dependent on the wavelength designated for 
the interrogation of that particular channel (see 
Figure 11.26 for a very basic schematic). The path 
of light to each sensor is directed using WDM 
coupling components, similar to those used in 
communications systems. These components are 
in theory lossless, but will, in general, introduce 
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Figure 11.26 Schematic diagram of a wavelength-division multiplexed arrangement for remote detec- 
tion of the state of three amplitude-modulation sensor heads. 


a small loss in practice (typically 1-3 dB for each 
pass), imposing, therefore, a very small penalty in 
an otherwise lossless multiplexing method. 

Although the spectral width of the fiber trans- 
mission “window” is potentially enormous, when 
compared with the very much lower informa- 
tion rate theoretically required for telemetry, the 
potential of multichannel WDM was originally 
rather limited by practical difficulties in achiev- 
ing sufficient selectivity in the WDM filters. This 
has changed dramatically with the excellent wave- 
length routing components that have now been 
developed for multichannel WDM telecommuni- 
cations systems. Using broadband LED or super- 
luminescent fiber sources, or multichannel laser 
sources, the practical number of independent 
channels in the fiber window is now potentially 
very large. It is possible to use “spectral slicing” 
techniques with either LED [77] or superlumi- 
nescent fiber sources [78]. This latter technique 
involves filtering out two or more separate narrow 
channels, from within the broad spectral linewidth 
of the LED (in this case for distributed sensing, 
but the basic concept is the same). The separation 
can be performed using narrow-band filters—a 
method that inevitably decreases the average sig- 
nal in each channel by a factor at least in propor- 
tion to the number of channels required. 

Another method of achieving a large number of 
channels is to use a number of independent nar- 
row-linewidth laser sources, which removes this 


inherent loss penalty, as modern WDM combin- 
ers can, combine channels with zero loss in theory, 
and in practice they have close to 100% efficiency. 
Generally, the limit with lasers is set by practical 
considerations, such as the selectivity of channel 
filters, the availability, selection, and stabilization 
of laser sources, and the cost of a multilaser system 
of this nature. Again very dramatic developments 
in the light-sources (e.g., DFB fiber lasers) passive 
components (WDM splitters, add-drop multiplex- 
ers, etc.) and many other sub-systems have been 
made to facilitate use of WDM in telecommunica- 
tions applications, which paves the way for such 
methods in sensors. 

The WDM approach was first used to permit the 
addressing of separate “bits” in a 10-bit digital sen- 
sor, designed to measure a single physical param- 
eter, in this case the angular position of a shaft in 
a fiber optic shaft angle encoder [79]. A broadband 
white light source in the transmitter/receiver ter- 
minal launched light into the outgoing fiber to the 
sensor head (shaft angle encoder), and two diffrac- 
tion gratings were used, the first one in the encoder, 
first to separate the white light from the outgoing 
fiber lead into 10 wavelength channels, and then 
to recombine the reflected code-carrying chan- 
nels after intensity modulation of each “bit” by the 
coded disc. A second similar diffraction grating 
was used again in the decoder, to direct each of the 
returning signals, according to their wavelength, 
onto a separate detector of an array. The digital 
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encoder disc was a “Gray-coded” disc, as com- 
monly used in conventional optical encoders, but 
in this version a different color was used to inter- 
rogate each separate coded band on the encoder 
disc, hence allowing transmission through a single 
optical-fiber cable (or two-way send/return cable) 
by wavelength multiplexing. 

The multichannel WDM approach has also 
been proposed to provide additional picture ele- 
ments for remote imaging [80]. In this case, a 
diffraction grating system was again used in the 
sensor head to enable the spatial position of each 
pixel of a one-dimensional white-light image to 
be uniquely coded, according to its optical wave- 
length, before transmission to a remote monitor 
station. At this point, the image is recreated, using 
a second diffraction grating to perform the inverse 
transformation. This way, one dimension of the 
two-dimensional image is divided according to 
wavelength. The other dimension can be divided 
easily using a one-dimensional array of separate 
fibers. 

Now that WDM systems have become so widely 
accepted for fiber-optic communications systems, 
the practical capability of WDM systems to address 
large arrays sensors is becoming ever greater and yet 
more economically viable. It is applicable to many 
types of optical sensor, and there are now many key 
components for WDM routing, including multiline 
laser sources, and low-loss multichannel filters, grat- 
ing filters, add-drop filters, to name but a few. An 
example of a multielement hydrophone array, using 
a combination of both TDM and WDM methods, 
has been presented by Vohra et al. [81]. 


11.4.5 Multiplexing of in-fiber Bragg 
gratings using TDM and WDM 
methods 


In view of the importance of Bragg grating sen- 
sors, it is appropriate to discuss how these sen- 
sors can be multiplexed. In view of their narrow 
reflective spectrum, they readily lend themselves 
to WDM methods. Second, the ability to write 
them at any point in a fiber, with associated posi- 
tion-dependent variations in the two-way optical 
delay from a pulsed light source to the sensors, 
according to their position, allows TDM to also 
be used. Space here does not permit a full discus- 
sion of what is now a major research area, but the 


excellent review by Kersey et al. in the bibliogra- 
phy [7] covers this area very extensively. 

Some of the methods used to address multiple 
gratings, all situated in a single fiber cable, are 
listed below, many of which were of course men- 
tioned in the earlier section describing how indi- 
vidual gratings may be addressed: 


e Use of broadband source launched into fiber and 
a spectrometer (CCD spectrometer or Fourier- 
transform spectrometer) to interrogate the 
reflected spectrum (system as in Figure 11.12, 
but with multiple in-line Bragg gratings in the 
fiber). 

e Use of a scanned narrow-band optical filter 
(e.g., conventional bulk-optic Fabry-Perot, all- 
fiber Fabry-Perot, acousto-optic tunable filter, 
and scanned Bragg grating) in conjunction 
with a broadband source launched into fiber, 
and a detector to receive reflected light from 
arrays of in-fiber Bragg gratings. The filter may 
either be scanned to measure the peak reflec- 
tivity of each grating in turn or with the aid of 
a feedback loop, may be locked on to track each 
grating in turn. 

e Use of a mechanically-scanned (fiber is 
stretched with a lead-zinc-titanate (PZT) all- 
fiber Michelson interferometer, followed by a 
single optical detector, to interrogate reflected 
signals from arrays of in-fiber Bragg gratings. 
Fourier transformation of the detected output 
signal is performed, in the same manner as 
used in the well-known Fourier transform 
spectrophotometer using a bulk- optics inter- 
ferometer [82]. 

e Rare-earth-doped fiber laser, operating mul- 
tiwavelength, using the Bragg gratings as end 
mirrors. 


Both the review by Kersey (bibliography [7]) and 
another by Dakin and Volanthen [83] cover many 
of the pitfalls that have to be avoided to prevent 
undesirable measurement errors. Examples of 
potential problems include the following: 


11.4.6 Subcarrier FDM 


In point sensors, which are connected by separate 
fiber paths to a common detector, the technique of 
FDM of the intensity modulation waveform can 
be used, modulating the electrical signal to drive 
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a light source. Such a modulation may be used to 
facilitate separation of the outputs from individ- 
ual channels in the detected signal. The simplest 
way of using the technique is the relatively trivial 
method of transmitting signals from separate light 
sources, each modulated by electrical signals of 
different frequency, via separate fibers to each of 
the sensor heads. The outputs may then be com- 
bined (or added) into a common output fiber and 
detected on a common detector, yet still be separa- 
ble by frequency-selective electrical channel filter- 
ing. However, a more elegant FDM approach, using 
a single LED source, has been devised by workers 
at the University of Strathclyde in Glasgow [84]. 
This method, using a simple transmissive system, 
is shown in Figure 11.27a. The three subcarrier 
modulation signals add at the detector output, as 
shown in Figure 11.27b, with the resultant of their 
vector addition being dependent on their relative 
phase angles. These phase angles are a function 
firstly of the original phases of the transmitted 
envelope modulation and secondly of the different 
delays they experience in transmission through 
the optical network containing the point sensors. 
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The term FDM is generally used in optical com- 
munications systems to imply systems in which 
an optical carrier is amplitude modulated by a 
composite electronic signal, consisting of a sum 
of modulated subcarrier signals with different fre- 
quency channel allocations. In order to provide a 
more precise terminology, the prefix “subcarrier” 
has been inserted in the title of this section, but 
for the remainder of this review, the abbreviation 
FDM will, for convenience, be used to describe 
such systems. 

These signals may be separated, first by employ- 
ing multichannel phase-sensitive detectors on the 
detected optical signal (using the original applied 
modulation signals as electronic reference signals) 
and then by solving simple simultaneous equations 
(linear regression) on the resulting scalar quanti- 
ties. This process allows removal of any cross talk 
terms from light that has passed through the sen- 
sor heads of each other channel. 

The above system has the advantage over TDM 
methods of having 100% duty cycle and using 
much simpler and slower electronics. For low- 
frequency sensors, it should be easy to construct 
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Figure 11.27 Multiplexing using frequency-division-multiplexing of subcarriers (intensity modulation 
signals), showing: (a) schematic diagram of arrangement for three amplitude-modulation sensor heads 
and (b) vector diagram for the phase-sensitive summation of subcarrier signals on an optical detector. 
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with a low-noise bandwidth in each of the phase- 
sensitive detection stages. However, although the 
electronics does not require a high-frequency 
response, care must be taken to ensure stability of 
electronic and optical delays, as serious cross talk 
could otherwise result. 


11.4.7 The FMCW multiplexing 
scheme 


This multiplexing method, like many of the preced- 
ing techniques, was adapted from the radar field 
and has much in common with the FDM method 
discussed in the previous section. However, in 
this case, the RF subcarriers signals only actually 
appear at the optical detector, as a result of “beat- 
ing” or “mixing” of optical waves, where difference 
frequency (heterodyne) signals are generated due 
to the “square-law” characteristics of optical detec- 
tion. The transmitted signal in the FACW method 
is an optical carrier wave, the frequency of which 
increases (or decreases) linearly for a period, T, after 
which time it flies back to its initial frequency, before 


repeating the process. When now the source is con- 
nected to an interferometer with a differential path 
delay, the return signals on the detector differ by a 
frequency proportional to firstly the optical path 
difference they have experienced and secondly the 
frequency slew rate of the source (see Figure 11.28). 

Such a system was first proposed in single sen- 
sor form by Uttam and Culshaw [85], with a later 
description from Giles et al. [86]. If the source is 
connected via a series of interferometers, each hav- 
ing different delay paths [87], a series of RF car- 
rier signals are produced at the detector by the 
heterodyne mixing process. The various sensor 
output signals may be isolated by electronically 
filtering out the corresponding frequencies. The 
sensor output signals may be represented by the 
phase, frequency or amplitude of the recovered RF 
signals, depending on the scale of the path length 
change (small path length changes are more read- 
ily discernable as phase changes, large path length 
changes more conveniently as frequency changes) 
or the transmission changes occurring in the 
sensor. 
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(c) Multiplexed FMCW arrangement (Al-Chalabi et al. 1985) 
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Figure 11.28 The FMCW method of sensing and multiplexing, showing: (a) a schematic diagram of the 
FMCW Michelson sensor, (b) variation of frequency with time for optical and electronic signals and (c) 
an arrangement for multiplexed sensors using FMCW. 
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The technique of FMCW for sensor multi- 
plexing suffers from several practical difficulties, 
when used with a large number of sensors. The 
main problems arise from the large number of 
optical paths, which can produce a background of 
unwanted signals, from both the expected sensor 
paths and additional stray paths due to multiple 
reflections [88]. Also, when it is desired to extract 
precise phase information from the method, 
problems may arise from nonlinearities in the 
laser frequency modulation ramp. Possible “pull- 
ing” of the laser frequency by reinjection of retro- 
reflected light can be avoided by use of an isolator, 
but a semiconductor laser usually has intrinsic 
nonlinearities in its frequency chirp response due 
to the complex internal optical and thermal laser 
time constants. 

Other problems arise because the technique 
necessarily requires an unbalanced interfer- 
ometer for it to operate, so will eventually have 
reduced fringe contrast as the path differences 
increase, due to the finite coherence length of the 
source. It also necessarily suffers signal degrada- 
tion from phase-noise, due to random frequency 
fluctuations of the laser source from the ideal 
saw-tooth, as, unlike certain other methods [72] 
the phase noise cannot be passively compensated 
for by using balanced optical paths. Limits on the 
permissible path imbalances greatly reduce the 
range of heterodyne frequencies that may be allo- 
cated for multiplexed sensors, a problem that may 
be exacerbated by nonlinearities in the frequency 
ramp and undesirable amplitude modulation of 
the source. However, despite some disadvantages, 
the FMCW technique is conceptually elegant and 
has the major advantage of being able to monitor 
steady-state changes of the length of an interfero- 
metric sensor, simply from the absolute magni- 
tude of the heterodyne frequency. 

More recently, a variation of FMCW has been 
used for addressing hydrophone arrays. Here, it is 
used to generate a heterodyne carrier, and has been 
renamed the “phase-generated carrier” method. 
Rather than modulating the source directly, a fiber- 
compatible PS, such as a mechanically strained 
(piezo-driven) coil, or an integrated optic FS can 
be used. These methods are discussed in detail in 
bibliography [1], vol 4, and in the excellent review 
by Kersey. 

It should be pointed out that the FMCW 
method is not limited to situations in which the 


optical carrier is modulated. It has been shown 
by Mallalieu et al. [89] that FMCW may also be 
operated using a frequency-modulated subcarrier 
wave, rather than modulating the optical wave. 
This removes the need for both coherent sources 
and monomode fiber, and simplifies the optical 
system for sensors that do not require the use of 
optical interferometry, but because of the much 
lower frequency slew rates, will not normally 
achieve the phenomenal sensitivity to minute 
length changes that optical interferometry offers. 
It has excellent attractions for high-resolution 
interrogation of fibers, and improvements in the 
modulation of the source will be discussed again 
later in this chapter. 


11.4.8 Coherence multiplexing 


The technique of coherence multiplexing is an 
optical method, which is similar in concept to 
a technique used in spread-spectrum radio fre- 
quency communications. A signal, with a super- 
imposed random (or pseudorandom) modulation, 
may be demodulated to recover information by 
correlating the received signal with a similarly 
encoded random (or pseudorandom), but delayed 
reference signal. In sensing systems, a broadband 
source of short coherence length may be used as a 
transmitted signal, as such a source will, in gen- 
eral, be subject to naturally occurring random 
phase or frequency excursions. If such a signal is 
guided via two equal, or near-equal, monomode 
fiber paths, then the signals suffer nearly equal 
delays, and have a strong correlation, provided the 
path difference is small compared with the coher- 
ence length of the source. Under these conditions, 
high-contrast interference fringes can be observed 
if the output signals are mixed on a square-law 
detector. If, however, the paths differ by very much 
more than the coherence length of the source, the 
fringe contrast becomes close to zero. 

The arrangement in Figure 11.29, first pro- 
posed by Brooks et al. [90], shows how several 
remote Mach-Zehnder interferometers may be 
independently addressed using the coherence 
multiplexing technique, provided the optical path 
length differences, /,—1), 1,-1,, and 1,—1,, are all 
much greater than the coherence length of the 
source. The sensing method is based on observ- 
ing the fringe shifts, which occur as the path 
differences /,—J, and 1,—I, in the sensors 1 and 


390 Optical fiber sensors 


Down lead 


Light source 


Detector 1 


Detector 2 


Receiver 


Sensor 2 


Sensor 1 


Return lead 


Figure 11.29 Schematic arrangement for coherence multiplexing, shown for the simplest case of 
separation of signals from only two remote Mach-Zehnder interferometric sensors. 


2, are changed by small increments, A, and A,, 
respectively. The changes are observed by inter- 
ferometric comparison with the corresponding 
fixed lengths J,—/, and J,—[, in the receiver inter- 
ferometers. Only close-matched paths give visible 
interference fringes. 

As a means of separately interrogating the out- 
puts from a small number of remote sensors, the 
method has attractions. However, as the number 
of sensors increases, the number of possible opti- 
cal paths in the network, from source to detec- 
tor, increases very dramatically and the use of a 
very short coherence length source, such as an 
LED, soon becomes necessary. This then presents 
extreme difficulty in achieving the very close path 
length equalization required, in order to ensure 
adequate fringe contrast, and results in a rather 
small dynamic range of measurement before 
fringe visibility is lost. Further practical disad- 
vantages of the system are the need for each sen- 
sor to have a unique path length difference (and 
hence, without careful design, a different sensi- 
tivity) and the need for adequate stabilization of 
the receiver interferometers, to avoid undesirable 
errors due to drift or vibration in these reference 
interferometers. 

In conclusion, therefore, it appears likely that, 
although elegant in conception, the method is 
less likely to find practical application in systems 
with more than perhaps 10 sensors, unless other 


multiplexing methods, such as WDM and TDM 
are also used. 

A method for grating interrogation using 
coherence effects has been presented by Dakin 
et al. [91]. Here, a number of in-line Bragg-grating- 
pair sensors are interrogated, in turn, using a 
scanned Michelson interferometer. Fringes are 
only observed when the path length of the interro- 
gating Michelson interferometer matches the opti- 
cal spacing of the Bragg gratings, and the rate of 
fringe crossing gives a measure of the wavelength 
of the gratings. 

In the final section of this chapter, M Marcus 
will also give a detailed case study of work at 
Kodak on a white-light interferometer system for 
decoding multiple light paths in manufactured 
optical products. These have been used for assess- 
ing the inter-element spacing and aspects such as 
focal lengths of production cameras. 


11.4.9 Conventional sensors with an 
acoustic-sensing “fiberdyne” 
highway 


As a final sensor multiplexing method, a hybrid 
technology will be described, which is really a 
noninvasive data collection highway. Its sen- 
sor use would be for collecting data from arrays 
of conventional electrical sensors, using a 
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continuous optical fiber sensing highway. This 
highway is essentially a long, continuous, acous- 
tic sensor, based on interferometric techniques, 
to which signals from electrical sensors could 
be coupled using piezoelectric transduction ele- 
ments, held in close mechanical contact with the 
fiber or cable. In order to distinguish the sepa- 
rate signals applied to the common highway, it is 
possible to allocate electrical subcarrier signals, 
of different frequency for each sensor, and allow 
the sensor output signals to modulate (intensity 
or angle modulation) these electrical carriers and 
impart information before they are applied to the 
piezoelectric transducers. 

The original method, referred to as a “fiber- 
dyne” system, was first devised and built by 
workers at UCL, London [92]. The first version 
(Figure 11.30a) used a Mach-Zehnder twin-fiber 
interferometer arrangement, in which one fiber 
acted as a passive reference arm and the second 
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was used as the strain-sensitive highway, which 
was subjected to an acoustic influence from the 
piezoelectric transducers. The method was ini- 
tially proposed as a wideband fiber highway and 
had a high sensitivity, but, without use of a PC, 
would be subject to polarization fading when 
used with a normal monomode fiber. 

A later version, from the same research team, 
used a single multimode fiber and relied on varia- 
tion of the “speckle” interference pattern emerging 
from the fiber, as the PZT transducer caused mode 
conversion in it (Figure 11.30b). This method had 
the advantage of not requiring a separate reference 
fiber, but, as a result of the complex beating pro- 
cesses between many fiber modes, and the need for 
differential modulation of mode delays, it exhib- 
ited a somewhat lower sensitivity. It had a greatly 
reduced optical energy efficiency, as only a frac- 
tion of the speckle pattern was actually incident on 
the detector (otherwise only the total transmitted 
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Figure 11.30 “Fiberdyne” highway systems: (a) original fiberdyne concept using monomode fiber 
(Mach-Zehnder configuration), (b) multimode fiberdyne highway (speckle modulation) and (c) polari- 


metric fiberdyne highway (polarization modulation). 


392 Optical fiber sensors 


power would be detected, and phase modulation 
would not change this) and there was a strong 
possibility that either multimode fading or polar- 
ization fading could occur. The fading can only 
be prevented by arranging for diversity of phase, 
polarization and spatial arrangement of detect- 
ing the speckle. This requires multiple detectors 
and possibly active electronics to select nonfading 
channels. Despite these problems associated with 
this first implementation [93,94], a sensor telem- 
etry highway was constructed and tested. In this, 
a number of stations could be fed in, using a set of 
channels, with relatively low radio frequencies for 
the carrier signals (= 1 MHz). The channels had an 
even lower baseband-channel frequency (informa- 
tion rate) of=10kHz, which although low would 
be more than sufficient for many data collection 
requirements from sensors. This signal to highway 
coupling was carried out acoustically, as before, 
but, in these later systems, the acoustic waves were 
transmitted via the plastic protective sheathing of 
a fiber-optic highway cable, the reduction in cou- 
pling efficiency giving rise to a reduced informa- 
tion bandwidth. 

Animproved method has been devised (Figure 
11.30c), using a polarization-maintaining fiber, 
again with lateral acoustic excitation. If suitably 
aligned, this mechanical excitation has the effect 
of modulating the phase of one polarization 
mode, relative to that of the orthogonally polar- 
ized mode, resulting in the modulation of the 
polarization of the transmitted light [95]. This 
method is more efficient in its use of the received 
optical energy, as there are only two polarization 
modes in the fiber, and, when these interfere on 
the detector, no optical energy is wasted and fad- 
ing can occur provided the correct phase bias is 
maintained between the two received polariza- 
tion modes (this phase bias can be changed by 
stretchers to set the quiescent operating condi- 
tion at the quadrature, or maximum-slope point 
of the sinusoidal response of the polarimetric 
interferometer). 


11.5 DISTRIBUTED OPTICAL FIBER 
SENSORS 


We shall now discuss fully distributed sensing 
methods. Systems that permit the monitoring of 
not only the magnitude of a physical parameter or 


measurand, but also its variation along the length 
of a continuous uninterrupted optical fiber are par- 
ticularly attractive, for both economic and logisti- 
cal reasons. Distributed sensors not only allow a 
simple easily deployable sensing cable, where the 
communications link and sensor are one entity, but 
they also permit an easy and reliable comparison 
of a parameter at different points (the same inter- 
rogator takes measurements at different points) 
and the sensor cable measures at every point along 
the length with no “dead spots.” This latter aspect 
is particularly important for safety related sensors 
(e.g., fire detectors or detectors for hazardous leaks 
of chemicals) and for sensors to detect intrusion 
through perimeters. 

The simplest form of distributed sensor, the 
optical time domain reflectometer (OTDR) we dis- 
cussed briefly earlier, has been in commercial use 
for many years as a measurement method for tele- 
communications, and is commercially available 
from many manufacturers. However, in spite of the 
relatively early development of the OTDR concept, 
it was several years before its use for the distributed 
measurement of parameters external to the opti- 
cal fiber was envisaged [96]. Rogers suggested, for 
the first time, a method for the measurement of the 
spatial distribution of magnetic and electric fields, 
pressure, and temperature, using OTDR concepts 
in conjunction with polarized light sources and 
polarization-sensitive receivers. More details of 
such sensors are given below. 


11.5.1 Backscattered sensors using 
the OTDR concept (general 
introduction) 


All the sensors discussed in this section make 
use of radar-type (more accurately, LIDAR-type) 
Rayleigh backscattering (or backward-traveling 
light from other inelastic light generating pro- 
cesses, such as Raman, Brillouin or fluorescence) 
to make truly continuous measurements on 
unbroken optical fibers or fiber cables. 

The basic method of optical time-domain 
reflectometry was, as described earlier (Section 
11.4.3, Figure 11.24) the first type of distributed 
optical fiber sensor. In view of its importance 
here, we shall firstly briefly review it and then give 
more details of the principle of operation. A pulsed 
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semiconductor laser is coupled into a section of 
fiber through a directional coupler, which serves 
also to couple the backscattered light fraction, 
captured and returned via the fiber on test, to the 
high-speed optical receiver. 

In normal telecommunications fibers, the 
Rayleigh component of the scattered light repre- 
sents well over 98% of the returning signal (except 
during the short time intervals when more intense 
specularly reflected pulses return from discrete 
discontinuities, such as connectors, air-spaced 
splices or distal fiber ends). 

For uniform fibers, the detected temporally 
varying Rayleigh scattered power I(¢) varies as the 
product of at least five important physical factors, 
all of which can either be chosen by appropriate 
design or component selection. These are first the 
launched energy, E,, second the scattering atten- 
uation coefficient, «,, in the fiber, third the frac- 
tion, S, of scattered light that is captured by the 
fiber in the return direction, fourth the inverse- 
4th power A~* of the optical wavelength, 4, and 
finally the two-way optical attenuation factor, exp 
(Ja(x) dx), that occurs during propagation from 
the source to the scattering point and back to the 
detector: 


I(t)= 5 Eves exp( fowxrax), 


There are also a few other factors that are more 
difficult to change significantly, such as vg, the 
velocity of light in the guide, which has to be 
close to that of silica, as the doping levels are low, 
and the fixed numerical factor of 0.5. The fac- 
tor S has a close to quadratic dependence on the 
numerical aperture of the fiber, and hence on the 
core/cladding index difference, so is higher for 
high numerical aperture (NA) multimode fibers. 
However, the significantly lower attenuation 
coefficient, a(x), in monomode fibers can become 
a more important advantage when very long 
lengths of fiber are probed, despite their much 
lower S value. The SNR of an OTDR reduces very 
rapidly as the distance resolution improves, as 
firstly much shorter pulses are needed (reducing 
the launched energy, E,) and secondly a high- 
speed receiver has a wider noise bandwidth and 


usually a higher noise spectral density too, both 
conspiring to give far worse noise performance 
with short pulse excitation and fast detection’. 
Because the scattering coefficient in high-quality 
fibers does not usually vary significantly along 
the length, the method has proved extremely use- 
ful for measuring spatial variations of fiber atten- 
uation. However, if the geometry or numerical 
aperture of the fiber varies significantly, changes 
in the guidance properties of the fiber (primarily 
in the modal “V” number, that changes the num- 
ber of allowable modes) will cause additional 
variations in the backscattered signature [97,98]. 
These workers showed, however, that variations 
may be at least partially compensated for, pro- 
vided two separate OTDR signatures are taken 
from opposite ends of the same fiber. Clearly, this 
requires two instruments or a loop sensor with a 
two-way fiber switch. 

In the following subsections, several sensing 
methods using OTDR, and variations based on 
various inelastic scattering and fluorescent meth- 
ods, will be described. 


11.5.2 Monitoring of variations in 
attenuation using OTDR 


The original objective of the OTDR method was 
to examine attenuation variations in manu- 
factured and installed lengths of optical fiber. 
One of the first suggestions that it may be pos- 
sible to construct distributed sensors, using the 
attenuation characteristics observed was made 
by Theocharous [99], who proposed a method for 
the measurement of temperature. If an OTDR 
return signal is differentiated with respect to 
time and normalized by division by the instan- 
taneous value of the signal, a measure of the 
fiber attenuation is obtained. If, therefore, a fiber 
with temperature-dependent attenuation is used, 
variations of temperature along the length may 
be monitored. 


* It should be noted here that the radar field [70] is one 
from which numerous techniques, applicable to mul- 
tiplexed and distributed optical sensors, have been 
drawn. This source of inspiration for optical sensor 
ideas will be referred to again late" 
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The technique appears at first sight to be 
highly attractive, but a simple semiqualitative 
analysis suggests that the number of distance- 
resolution elements is likely to be low. To measure 
the average temperature within each distance- 
resolution element, an accurate loss measure- 
ment is necessary, in order to determine the 
smaller temperature-dependent changes in loss. 
Because the OTDR return is usually rather noisy, 
a temperature-dependent loss of at least 1 dB in 
the resolution element (whatever its length) will 
probably be necessary, in order to accurately 
detect small changes, say of 1% (or 0.01 dB) in 
this loss. Thus, the round-trip attenuation in this 
example will be a loss of 2dB for each resolution 
element, times the number of such elements, 
giving a likely limit to the number of measured 
resolution elements of perhaps ten, particularly 
bearing in mind the poorer SNR that will return 
from the later sections of fiber. 

A comprehensive study of rare-earth-doped 
fibers, showing how they could be used to deter- 
mine the line-averaged temperature over their 
length, was carried out by Quoi et al. [100]. Many 
different rare-earth fibers were considered, and the 
interesting concept of detecting light at two wave- 
lengths, one where attenuation increased with 
temperature, and another where it decreased, was 
introduced as a means of compensating (by ratio- 
ing the two detected signals) for losses in bends, 
splices or connectors. 

Because of the attenuation problem over lon- 
ger lengths that was discussed above, distributed 
sensing methods monitoring attenuation are most 
likely to have practical application for sensing using 
fibers or cables which have a low intrinsic loss, but 
which then suffer significantly increased attenua- 
tion at just one point or small region due to some 
event it is desired to detect or locate. Examples of 
this are largely safety or security related, such as 
the detection (and location) of fire using a cable 
that might have high loss if it becomes hot. Several 
published applications for other applications will 
now be discussed below. 

An early means of using OTDR monitored 
attenuation variations for distributed radiation 
dosimetry was presented by Gaebler and Braunig 
[12]. In this application, a short section of a fiber, 
which was exposed to ionizing radiation, suf- 
fered excess attenuation, enabling simultaneous 


detection and location of the radiation exposure 
(see Figure 11.31). If used as a sensor for the moni- 
toring of exposure over a short section of a more 
extensive overall length, the quiescent attenuation 
before radiation is merely that of a normal low-loss 
communication fiber, enabling the increased atten- 
uation of the short exposed section to be measured 
with reasonable accuracy. The method would be 
attractive for detecting leaks of highly radioactive 
material, but silica fibers are rather insensitive, and 
a large part of the radiation-induced loss recovers 
quickly with time. Lead-glass fibers are more sensi- 
tive, but have higher intrinsic losses. 

Several other loss-modulation effects in fibers, 
which were discussed in the first part of this chapter, 
are also amenable to OTDR interrogation. The first 
of these is the plastic-clad silica fiber system (see ear- 
lier Figure 11.6) of Pinchbeck and Kitchen [13] where 
the fiber showed increased attenuation when cooled, 
due to increase in the refractive index of the cladding 
polymer. Clearly, the OTDR offers the possibility of 
leak location, whereas that shown in Figure 11.6 can 
only offer detection of leaks, as it only permits simple 
detection of fiber loss (Figure 11.32). 

The second we shall mention is the commer- 
cially available “Herga” fiber cable (Figure 11.3), 
which exhibits high microbending loss when sub- 
jected to lateral pressure, as a result of its novel 
spiral plastic sheathing arrangement. Figure 11.33 
shows a schematic of an OTDR-based distributed 
sensor, using a pressure sensing cable of this type. 
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Figure 11.31 Schematic of a distributed radiation 
dosimeter, using radiation-sensitive fiber, with an 
OTDR to detect regions of increased loss. There 

is potential for leak-detection system for radioac- 
tive materials. 
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Figure 11.32 Schematic of a distributed leak- 
detection system for cryogenic liquids, using 
PCS fiber, with an OTDR to detect regions of 
increased loss (see also Figure 11.6). 


Sensors such as these have been proposed by Alan 
Harmer, of Battelle research, for use as a distributed 
sensor to detect intruders, by monitoring losses from 
pressure of the intruder’s foot on the cable. 

The same system layout of Figure 11.33 can 
clearly be used with the water ingress sensor 
described earlier ([10,11], see also “Microbend sen- 
sors” section and Figure 11.4), which is based on 
the same microbending concept, and indeed this 
sensor type was developed by the authors using 
an OTDR instrument to detect the changes. One 
useful application is to detect leakage of water in 
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Figure 11.33 Schematic of a distributed detec- 
tion system for lateral pressure, arising, for 
example, from an intruder treading on cable. 
Sensor uses pressure-sensitive cable of the type 
shown in Figure 11.3, with an OTDR to detect 
regions of increased loss. 


telecommunications ducts, as this can cause dam- 
age to optical cables, particularly if it generates 
hydrogen, or if it freezes and breaks the fibers. 

Another useful intrinsic loss mechanism that 
can be used is that of evanescent field absorption 
in the fiber cladding (Figure 11.34). Although, 
in principle, a bare unclad fiber could be used to 
detect absorbing material directly, this would 
easily be contaminated. It is sensible therefore to 
use a polymer cladding with suitable properties. 
Many polymer cladding materials will adsorb oil 
or other liquids and change their absorption [101] 
but if chemical indicators are available, and can be 
incorporated in the polymer, they can offer a more 
selective response to a desired target species. Blyler 
et al. [102] have made a sensor for measurement of 
ammonia using reactive cladding material. 

It should be noted that it is not just absorption 
that may be used to form the basis of the sensor. 
The use of fluorescent coatings will be discussed 
later, after discussion of the use of other types of 
scattering processes in sensors. 


11.5.3 Variations in Rayleigh 
backscatter characteristics 


As already discussed, the use of OTDR to moni- 
tor fiber attenuation depends on the constancy of 
the Rayleigh backscattering coefficient along the 
length of the fiber. However, this may vary signifi- 
cantly in two basic ways, even in fibers of uniform 
geometry and composition. The first form of vari- 
ability occurs in monomode fibers, using polarized 
illumination and polarization-sensitive detection. 
An arrangement as in Figure 11.24 is used, with 
a polarized light source, and now a polarization 
analyzer is used at the detection end. This diagnos- 
tic method, known as polarization optical time- 
domain reflectometry (POTDR), relies on the high 
degree of preservation of polarization exhibited 
by Rayleigh and Rayleigh-Gans scattered light in 
silica fibers, leaving the polarization changes that 
occur due to two-way propagation in the fiber itself 
to be observable. 

The POTDR method was first suggested by 
Rogers [96], who pointed out its potential for dis- 
tributed measurements of magnetic field (via 
Faraday rotation), electric field (via the Kerr qua- 
dratic electro-optic effect), lateral pressure (via 
the elasto-optic effect) and temperature (via the 
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Figure 11.34 Schematic of a distributed chemical sensor system, using fiber with a reactive cladding 
that changes loss after exposure to the chemical. The structure of the fiber, the mode field pattern 
and the refractive index profile are shown in (a). Possibilities include permeation of a polymer clad- 
ding, by, for example, oil leaks, affecting either the refractive index or absorption properties, or 

use of a cladding with a chemical indicator designed to be selective to a desired target chemical or 
chemical species. The OTDR response when detecting leaks is shown in (b). 


temperature dependence of the elasto-optic effect). 
The first experimental measurements were reported 
by Hartog et al. [103], who used the technique for 
a distributed measurement of the intrinsic bire- 
fringence of a monomode fiber, and Kim and Choi 
[104] who measured the birefringence induced by 
the bending of a wound fiber. Ross [105] carried 
out the first measurement of a variable external 
field, using the Faraday rotation of polarization as 


an indication of the magnetic field environment of 
the fiber. A comprehensive theoretical treatment of 
the POTDR method has been presented by Rogers 
[106]. 

On first consideration, the POTDR technique 
appears to be attractive for the measurement of 
a large number of parameters. However, its main 
drawback, as with many other potentially useful 
sensing methods, is the variety of parameters to 
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which it can respond, the sensitivity to strain and 
vibration being particularly troublesome when it 
is desired to measure other things. In addition, 
POTDR requires the use of monomode fibers, which 
can, when used with narrow linewidth laser sources, 
have particular problems due to coherent addition 
from multiple Rayleigh backscattering centers [107]. 

Moving on now from considering variations in 
the polarization of backscattered light, the scalar 
magnitude of the Rayleigh scattering coefficient 
may vary with temperature in some types of fiber. 
If so, a simple OTDR arrangement, now preferably 
without polarization sensitivity, can be used to 
perform distributed temperature measurements. 
Unfortunately, in normal vitreous fibers, of silica 
or multicomponent oxide glasses, the majority 
of backscattered energy arises from Rayleigh or 
Rayleigh-Gans scattering from frozen-in refrac- 
tive index variations. These are tiny regions where 
high-temperature thermally induced density 
changes were “frozen” into the glass structure as 
it is cooled from the melt. As a result, once the 
glass has cooled, the scattered light intensity var- 
ies very little with the temperature of the glass. In 
many normal room-temperature liquids, however, 
the scattering arises from real-time thermody- 
namic fluctuations in the refractive index, which 
are dependent on the ambient temperature and, 
therefore, the scattering will now show a signifi- 
cant temperature coefficient. 

This effect has been exploited for distributed 
sensing of temperature, in the system described by 
Hartog and Payne [108]. Unfortunately, the use of 
liquid-filled fibers presents difficulties, which, at 
best, complicate the method and, under some cir- 
cumstances, restrict its use: 


1. The use of liquid-filled fibers is inconvenient 
because of the need for expansion reservoirs. 

2. There are temperature range restrictions 
imposed by the freezing and boiling points of 
the liquid. 

3. If the central region of a long length of such a 
fiber is rapidly cooled, it is possible to create 
voids by rapid thermal contraction, before 
pressure differentials can refill the tube against 
the resistance of viscous forces. 

4. Impurities or dust particles may increase the 
scattering cross-section or increase the localized 
loss of the fiber, giving the appearance of false 
temperature variations in the OTDR return. 


5. The numerical aperture of the liquid-filled fiber 
shows significant temperature dependence, 
generally reducing as the temperature is raised, 
allowing hot zones in particular, to affect the 
calibration of results further along the fiber. 

(In the absence of mode-conversion, this effect 
may be reduced by the use of mode filters). 


However, despite the above potential drawbacks, 
which are not uncommon with initial ground- 
breaking developments, the system worked well 
over moderate temperature ranges and was capa- 
ble of monitoring the temperature distribution to 
=0.2°C resolution, over several hundred meters 
of fiber, with a distance resolution of the order of 
2m. This was a remarkable advance, considering 
that this was the first experimentally demonstrated 
temperature profile measuring method for use 
with optical fibers. 


11.5.4 Distributed anti-Stokes Raman 
thermometry (DART) 


If the spectral variation of backscattering from a 
germania-doped silica fiber is examined (Figure 
11.35), it may be seen that there is a strong cen- 
tral line, primarily due to Rayleigh (or Rayleigh- 
Gans) scattering, but which also contains a weaker 
(spectrally unresolved in Figure 11.12) contribu- 
tion from Brillouin scattering. At each side of the 
central line, however, there are side-lobes due to 
Raman scattering. These may be used to detect 
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Figure 11.35 Typical spectrum of Raman back- 
scattered light in germania-doped silica fiber 
(measured by N Ross, then at CERL). 
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temperature profiles in conventional vitreous com- 
munications fibers using modified Raman OTDR 
techniques [109,110]. 

A schematic of a basic Raman temperature sen- 
sor system is shown in Figure 11.36. The system is 
essentially a dual-wavelength OTDR system, where 
a WDM filter is used to select out the two Raman 
bands and direct the signals to two detectors. The 
elastic Rayleigh scattering of the strong incident 
light signal (central peak in Figure 11.35) has to be 
removed in the filtering process. A signal processor 
averages the time-varying returns, and then takes 
the two separate OTDR results from each detec- 
tor channel, divides the two signals coming from 
each selected point along the fiber to determine the 
Raman ratio, and hence determine the tempera- 
ture at each point. 

From standard texts on Raman scattering, the 
temperature-dependent ratio, R(®), of the anti- 
Stokes (higher-frequency band) and Stokes (lower 
frequency band) scattered intensity, at wave- 
lengths, 4, and X,, respectively, and assuming equal 
frequency separation from the central excitation 
laser line, is given by the relationship: 
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R(0)=(A,/A,) exp(—hev/KT), 


where h is Planck’s constant, c is the velocity of 
light in vacuo, K is Boltzmann’s constant, 0 is the 
absolute temperature and v is the frequency of the 
incident light. 

Therefore, in addition to the distance infor- 
mation provided by the time delay of returning 
signals, a measurement of the ratio of Stokes and 
anti-Stokes backscattered light in a fiber can, in 
principle, provide an absolute indication of the 
temperature of the medium, irrespective of the 
light intensity, the launch conditions, the fiber 
geometry and even the composition of the fiber. 
In practice, however, a small correction will usu- 
ally need to be made for the difference in the fiber 
attenuation between the Stokes and anti-Stokes 
wavelengths, and if convenient, it may be desired to 
use the dual-end measurement method discussed 
earlier to compensate for fiber nonuniformities. 

The Raman technique appears to have only one 
significant practical drawback: that of a very weak 
return signal, the anti-Stokes Raman-scattered sig- 
nal being between 20 and 30dB weaker than the 
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Figure 11.36 Basic arrangement of distributed temperature sensor using Raman backscatter. Range is 
determined by two-way time of flight of light and temperature by the calculation of the ratio of Raman 


Stokes and anti-Stokes signals at each range. 
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Figure 11.37 Early Raman temperature profile result, from York Ltd, United Kingdom. 


Rayleigh signal, which itselfis already typically 50 dB 
weaker than the incident light. In order to avoid an 
excessive signal averaging time, measurements have 
been taken using relatively high launched powers 
from pulsed lasers, and extensive signal process- 
ing is performed to average the signals. In the first 
experimental demonstration of the method [109], 
a pulsed argon-ion laser was used in conjunction 
with an early telecommunications-grade 50/125 tm 
GRIN fiber. Dakin et al. [110] described subsequent 
results from a considerably more compact system, 
based on a far more practical and convenient semi- 
conductor laser source. A result from an early com- 
mercial prototype constructed by the group of A 
Hartog (previously at York Ltd) is shown in Figure 
11.37. Here the result has been normalized to remove 
the effects of fiber attenuation, and clearly shows the 
hot and cold regions of fiber. 

Since those early results, the Raman OTDR 
system has reached commercial maturity, and is 
another success story in the field of fiber sensors. 
The system has been manufactured for several 
years by York sensors, by the Hartog group, which 
is now part of the large Schlumberger company, 
and in Japan Hitachi cable have demonstrated 
engineered systems. Most systems have the capa- 
bility to address multiple channels via optical fiber 
switches, and can operate over many km of fiber. 
Significant signal gains have been made using 
Q-switched fiber lasers to launch more light, but 
there are launch-power limits in monomode fibers 
due to the onset of stimulated Raman processes. 
Major applications have included fire detection, 
oil-well logging, chemical process plant and fur- 
nace measurements and in-line monitoring of high 


voltage cables and other electrical plant, although 
there clearly many more potential applications may 
become economically viable as costs become lower. 


11.5.5 Time-domain fluorescence 
monitoring 


The re-emission spectrum of most fluorescent 
materials generally exhibits a significant tempera- 
ture variation. Thus, if an optical arrangement 
similar to that used in Figure 11.36 for Raman 
OTDR is constructed, with a laser exciting source 
as before, and with the detector filters now selected 
to examine regions of the fluorescent decay spec- 
trum having the maximum possible differential 
temperature variation, a distributed temperature 
sensor should be possible. The potential attraction 
of the method, first proposed by Dakin [111], is that 
the fluorescent quantum efficiency may be many 
orders of magnitude higher than that for Raman 
scattering and higher doping levels may greatly 
enhance the signals in short distributed sensor sys- 
tems. However, there remains a problem with the 
availability of suitable fibers. 

Silica-based optical fibers, with rare-earth dop- 
ants giving high fluorescent efficiency, have been 
prepared [112], but sensors using these give very 
poor distance resolution due to the long fluores- 
cent lifetimes. It is possible to reduce fluorescent 
lifetime by using materials with increased cou- 
pling to nonradiative processes, but this unfor- 
tunately also reduces the fluorescent efficiency. 
Polymer fibers may perhaps offer more promise 
in short distance systems, as these may be doped 
with organic dye materials with an excellent 
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combination of high quantum efficiency (~50% 
or better) and fluorescent lifetimes of the order of 
only a few nanoseconds. 

Dakin and Pratt [113] made a theoretical com- 
parison between distributed temperature sensors 
based on the techniques of temperature-dependent 
absorption, scattering, Raman scattering and fluo- 
rescence. It was predicted that, although doping 
with strongly fluorescent materials will necessar- 
ily increase the loss in an optical fiber, this should, 
for short distance operation, be more than offset by 
the much higher fluorescent light levels theoreti- 
cally attainable. To make such sensors in practice, 
it will be necessary to obtain fibers having short- 
lifetime fluorescent dopants showing the desired 
thermal variation. 

Lieberman et al. [114] have reported a distrib- 
uted chemical sensor for oxygen, using a fluores- 
cent polymer cladding coating on a fiber that has 
its fluorescence efficiency quenched by oxygen. A 
low-index siloxane cladding, doped with a 9, 10-di- 
phenylanthracene dye, was used. The evanescent 
field of the incident radiation coupled into the clad- 
ding dye and excited the fluorescence, and the fluo- 
rescence was coupled back, somewhat more weakly 
of course, by an inverse coupling process, to excite 
forward and backward guided modes of the fiber. 
Although a distributed sensor, in that continuous 
measurements over an extended length were taken, 
the method does not yet appear to have been used 
to determine the variation in oxygen concentra- 
tion over the fiber length. Clearly, to do this, a dye 
with a short fluorescent lifetime would be neces- 
sary, and, as with Raman OTDR, significant signal 
averaging would probably be necessary to detect 
the weakly coupled signals. 


11.5.6 Distributed sensors using 
spontaneous backscattered 
Brillouin light 


The Raman DART system described above has 
evolved to reach commercial maturity relatively 
quickly, primarily because of its basic optical sim- 
plicity, its simple telecommunications components 
requirements, and because of its similarity to the 
already-mature conventional OTDR method. 
However, Raman signals are extremely weak com- 
pared to Rayleigh elastically scattered light, so rel- 
atively long electronic integration times (typically 


tens of seconds) are needed to achieve the neces- 
sary SNR to realize temperature resolutions of 
1°C or better, particularly if it is intended to probe 
lengths of fiber above 5km or so. 

The total light level associated with spontane- 
ous Brillouin scattering is typically two orders of 
magnitude stronger than that of Raman light, the 
only disadvantage being the very close frequency 
spacing of this light to the incident excitation line. 
Because Brillouin light occurs due to scattering 
from relatively low-energy acoustic phonons, the 
photon energy change is very small. An alterna- 
tive simple classical physics viewpoint, which also 
predicts the correct optical frequency shift, is to 
consider that the scattering arises from thermody- 
namically induced moving acoustic waves in the 
core of the glass, so they become Doppler shifted. 
The resulting frequency shift in the light is there- 
fore much smaller than that for Raman scattering, 
being typically only of the order of 12 GHz, so very 
narrow band interferometric filters are necessary 
to perform optical separation. Suitable narrow- 
band filters are interferometric filters. These may 
be of the Fabry-Perot type, all-fiber types, based 
on the Mach-Zehnder or Michelson configura- 
tions, or equivalent interferometers in integrated 
optics form. 

An alternative is to use coherent detection, het- 
erodyning the Brillouin light with an optical local 
oscillator, suitably shifted from the incident laser 
to give a conveniently low beat frequency to pro- 
cess electronically. A local oscillator with a very 
stable frequency offset can be derived by frequency 
shifting a portion of light from the main excita- 
tion laser (pump laser). The signal at the detector 
is then a low frequency beat signal that exhibits 
a large fractional change for small changes in the 
Brillouin backscatter frequency. Once this fre- 
quency separation has been performed, a number 
of very useful sensor types can be constructed. The 
method of Kurashima et al. [115,116] allowed accu- 
rate determinations of distribution of mechanical 
strain in fibers from the Brillouin frequency shift. 
The heterodyning process makes it relatively easy 
to measure the Brillouin frequency shift, v,, which 
is a function of the fiber core refractive index, n, 
the acoustic velocity, v,, in the fiber core and the 
incident optical wavelength, A: 


Vp=DV,A. 
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Figure 11.38 The distributed temperature sensing system of Wait and Newson, using measurement of 
the Landau-Placzek scattering ratio (the ratio of total Rayleigh scattered light to total Brillouin light). 


Vz is typically of the order of 12 GHz, and, accord- 
ing to various papers by Horiguchi et al. [117- 
120], varies with both temperature (typically of 
the order of 9.4x 10> fractional change per degree 
kelvin) and strain (~4.6€ change, where € is the 
tensile longitudinal strain). 

In addition to the frequency shift, a conve- 
nient temperature-dependent intensity rela- 
tionship that can be used [121] is the ratio of 
Rayleigh scattered intensity to total Brillouin 
intensity, or the Landau-Placzek ratio R,_». 
For a single component glass, the ratio, R,_p, of 
Rayleigh scattered light to total Brillouin scat- 
tered light is given by 


Rup= T;/T (pov /Br -1), 


where T, is the fictive temperature of the glass, 
T is the absolute temperature, py is the density, v 
is the acoustic velocity and B, is the isothermal 
compressibility of the melt at the fictive tempera- 
ture. Clearly, unlike the Raman ratio discussed 
before, this new ratio will depend strongly on the 
fiber core material properties, so renewal ofa fiber 
by one of different type or composition is more 
likely to create the need for system re-calibration. 
However, because the SNR can be much higher 
in Brillouin systems, there is now scope for mea- 
surement over a much longer range. A simple 
arrangement for measuring temperature using 
the Landau—Placzek ratio method [121] is shown 
in Figure 11.38. 


Microwave-frequency heterodyning can be 
used to detect the frequency shift of the Brillouin 
light. The frequency shift, which is generally a 
stronger function of strain than temperature, 
can be monitored, and the intensity signals can 
be used as a more temperature-dependent quan- 
tity, giving the possibility of distributed sensing 
of both strain and temperature. Maughan et al. 
[122] carried out simultaneous measurements of 
both these quantities over 30km of fiber. It was 
possible to measure distributed temperature pro- 
files over a length of 57 km. A strain resolution of 
100 re was achieved, and a temperature resolu- 
tion of 4K. 

More methods using spontaneous Brillouin scat- 
tering with optical frequency domain processing 
will be discussed below. The alternative nonlinear 
process of stimulated Brillouin scattering will be 
discussed later, when systems involving interaction 
of counter-propagating light beams are described. 


11.5.7 The optical frequency domain 
reflectometry technique 


This distributed sensing method is essentially simi- 
lar to the FMCW technique we discussed earlier 
as a multiplexing method. An FMCW system can 
be operated in OTDR-like backscattering mode in 
a continuous length of monomode fiber [123,124]. 
Now, a portion of the launched light signal, derived 
from the optical source with a fiber splitter, is added 
as a local oscillator, to coherently detect returning 
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signals, and a beat frequency is obtained which 
increases in frequency as a function of the distance 
to the point in the fiber from which the light was 
backscattered. Ifthe detected beat signal is displayed 
on a conventional electronic spectrum analyzer, the 
detected power within each small frequency inter- 
val represents the scattered light received from the 
section of fiber that is situated at a distance corre- 
sponding to the frequency offset observed. As the 
frequency slew rate of current-ramp-driven semi- 
conductor laser diodes may be very high (100 GHz 
sis easily achievable) and the frequency resolution 
of commercial electronic spectrum analyzers is a 
few Hz or less, the technique can have a far supe- 
rior distance resolution capability than conven- 
tional OTDR methods. Kingsley and Davies [124] 
even suggested use of the technique for distributed 
measurements over the very small scales involved 
in integrated optical waveguide circuitry, where 
resolutions as low as a cm, or often much less, are 
required. 

A major potential problem with optical fre- 
quency domain reflectometry (OFDR) is the coher- 
ence function of the source, which will modulate 
the received spectrum and therefore distort any 
spatial variation of scattering that it is desired to 
observe. Another approach to the problem uses, as 
with the FMCW method discussed earlier, a fre- 
quency-modulated subcarrier to amplitude modu- 
late the source [125]. This removes the problems 
due to source coherence, but presents a reduced 
resolution due to the lower frequency slew rate 
possible with electronic subcarrier systems. 

The FMCW technique still shows limitations 
when it is desired to provide high sampling rate 
information, or to achieve a high spatial resolution 
of about cm order. The method we will describe 
next is an improvement on this basic method. 


11.5.8 Application of the “synthesis 
of optical coherence function” 
method for high resolution 
distributed sensing, using 
elastic (Rayleigh) scattering 


An alternative technique to synthesize interference 
characteristics arbitrarily, called the “synthesis of 
optical coherence function,” or SOCF for short, 
has been proposed and extensively developed by K 


Hotate of RCAST, Tokyo [126-128]. In this tech- 
nique, the frequency of a laser light source, con- 
nected to an unbalanced fiber interferometer, is 
modulated, using an appropriate electrical bias 
waveform, and the phase of a lightwave propagat- 
ing in just one arm of the interferometer is also 
modulated synchronously with a similar wave- 
form. With this method, any arbitrary shape of 
coherence function can be synthesized, hence set- 
ting the amplitude of the interference fringe in the 
interferometer to be a function of the path length 
difference between a lightwave returning from 
the remote sensing point and that returning from 
another defined (reference) point. 

In recent years, to assist with installation and 
maintenance of optical fiber subscriber networks, 
very precise measurements of the spatial distribu- 
tion of reflective discontinuities (e.g. fiber breaks 
or connectors) are required. For applications such 
as monitoring repeater stations, it is desired to mea- 
sure at a distance of several km, with typically a few 
centimeters or less spatial resolution, a specification 
far too difficult to achieve with conventional time- 
domain methods. The standard FMCW sensing 
method described above, with its high spatial reso- 
lution and wide dynamic range, could be a possible 
candidate, but its measurement time must be made 
much shorter than at present possible, as otherwise 
the optical phase noise arising from environmental 
fluctuation is a problem, particularly over a long fiber. 

As amore viable alternative, a solution has been 
developed using the synthesis of optical coher- 
ence function method [126,127,129]. Figure 11.39 
shows the experimental arrangement. A switch 
generates a series of wide optical pulses. The opti- 
cal pulse forms a window to select and determine 
the desired “range gate,” or the boundaries of the 
chosen test region along the fiber under interro- 
gation. This is because a pulse from the reference 
path can only overlap with a pulse from the path of 
the tested fiber when the two paths have nearly the 
same lengths. A coherence function having delta- 
function-like peaks can be synthesized using the 
FM waveform shown in Figure 11.39. 

The coherence function is synthesized, in such a 
way that, firstly, only one coherence peak can cor- 
respond to the region under test, and, secondly, so 
that all other undesired periodical coherence peaks 
are masked by the time window. The desired peak 
position is then achieved by scanning, to modify the 
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Figure 11.39 Schematic of distributed sensing using the “synthesis of optical coherence function” 


method conceived and developed by K. Hotate. 


phase modulation waveform. Consequently, the dis- 
tribution of reflectivity within the narrow selected 
time/range window can be measured with excellent 
temporal, and hence spatial resolution. In order to 
measure in a new range gate or window, the refer- 
ence delay is changed to select, and determine, this 
new region. An example of the reflectivity distribu- 
tion obtained by this system is shown in Figure 11.40. 

Two reflections from optical connectors, of 
magnitude around —30dB, can be seen clearly, 
whereas a strong reflection from the far end of the 
fiber is completely suppressed by the pulse win- 
dow. Even when measuring through 5km of opti- 
cal fiber, the spatial resolution can be as good as 
6cm, using a range-gate region of 8 m. Clearly, this 
range gate can be moved to select and measure in 
any chosen region of fiber. 


11.5.9 The transmissive FMCW 
method for disturbance 
location 


The FMCW methods may be used to locate dis- 
crete points where mode coupling in a fiber has 
occurred, provided the fiber is capable of support- 
ing two modes (e.g., different polarization modes 
in a high birefringence fiber) having significantly 
different phase velocities. External disturbances, 
which cause cross-coupling from the initially 
excited single mode, will mix or beat on a suit- 
able detection arrangement situated at the far end 
of the fiber. The beat signal will have a frequency 
dependent on the distance from the source, at 
which the coupling to the second mode has taken 
place. 


404 Optical fiber sensors 


5m 20m 
Oy 
(I+ 


Optical fiber 
—20 


Connector Connector 


Measuring range (8 m) | 


~30 >< 6 cm: resolution 


Reflectivity (dB) 


—60 


~70 1 
- -4 -3 -2 -1 0 1 2 38 


Distance (m) 


Figure 11.40 Trace showing backscatter signals 
from a pair of 5-km distant connectors, in a 
simple fiber network shown at top of figure. This 
was taken using the “synthesis of optical coher- 
ence function” method of K Hotate. 


This approach, first suggested by Franks et al. 
[130], is depicted in Figure 11.41. This particular 
implementation used a birefringent fiber, with all 
the transmitted signal energy being launched into 
only one of the two principal polarization modes 
of the fiber. The disturbance to be monitored was 
a lateral pressure on the fiber cable, which caused 
coupling of light into the orthogonal polarization 
mode. 

A convenient attribute of the technique is that 
the relatively close velocity matching between the 
polarization modes, even when using the so-called 
high-birefringence fiber, allows FMCW techniques 


Polarisation 
mode axes 


Lateral pressure 
causing disturbance 


to be operated over lengths very much longer than 
the instantaneous coherence length of the source. 
Two potential difficulties exist with the method, 
however. Firstly, mechanical strains of certain 
critical magnitudes may cause coupling of power 
from one polarization mode to the other and then 
completely back again, resulting in no net beat sig- 
nal. Secondly, disturbances that occur exactly in 
the direction along a fiber polarization axis will 
cause no mode coupling. Otherwise, except for 
these somewhat unlikely conditions, the technique 
appears a simple and elegant method of locating 
the position of lateral disturbances on a continu- 
ous fiber, but is probably unsuited to measuring 
the magnitude of the disturbance. 


11.5.10 Distributed sensing using a 
Sagnac loop interferometer 


The use of Sagnac fiber-loop interferometers for 
detection of rotation has been discussed earlier. 
However, they can also be used to locate the longi- 
tudinal position of a time-varying disturbance act- 
ing at a noncentral location on their sensing loops 
[131,132]. 

The ability of this interferometer to locate the 
position of a time-varying disturbance relies on 
the counter-propagating nature of the light and 
the break in symmetry that the disturbance cre- 
ates when perturbing the loop (see Figure 11.42, 
but please initially disregard the branching into 
the delay loop and to detector 2). 

When a time-varying strain, e(f), acts on a 
Sagnac loop of optical fiber, at a distance, z, from 
the center of the length of fiber forming the loop, 
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Figure 11.41 Transmissive FMCW disturbance location sensor [130]. This takes advantage of the dif- 
ferent propagation speeds of the two principal polarization modes in a birefringent fiber, in order to 
provide the time delays needed for generation of the FMCW beat signals at the detector. There is no 
beat signal until pressure causes coupling of energy from one of these fiber modes to the other one. 
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Figure 11.42 First use of a Sagnac interferometer for disturbance location. The lower part shows how 
a separate delay loop can be added, to form a balanced Mach-Zehnder interferometer when light 
taken from the Sagnac loop is added to that from the delay loop. 


it perturbs the phase, p(t) of the guided light. Due 
to the noncentral location of the disturbance, light 
travelling in one direction is phase-modulated 
before light traveling in the other. This results in 
a net phase difference, Ad(t, z), between the two 
returning counter-propagating wave trains, when 
they return to the detector and interfere, at the 
output of the loop. It was shown by Dakin et al. 
[131,132] that, assuming a small slew rate, dc/dt, 
for the optical phase, then Ag(t, z) is given by 


A(t, Z) < ae) 


8 


> 


where V, is the group velocity of the guided light. 
This, however, presents a problem, as we have 
a sensor response dependent on two unknowns, 
firstly the rate of change of the phase perturbation 
and, secondly, its position, relative to the center of 
the fiber forming the sensor loop. The initial solu- 
tion, in the references above, was to separately mea- 
sure the value of dd/dt. This was achieved using 
the delay loop in Figure 11.42, that the reader was 
asked to initially disregard when describing the 
Sagnac loop response. This additional loop enables 
a fraction of the light which had travelled in one 
direction around the Sagnac loop to be mixed with 
a suitably delayed portion of light derived from the 
same original source. This second delay loop was 


chosen to be of similar length as the Sagnac loop to 
that point, so that a second interferometer, in this 
case a balanced-path fiber Mach-Zehnder could 
be formed, the output of which was detected to 
derive a measure of the actual value dependent on 
the phase change ¢. Differentiation of the derived 
value of # (or, if the phase changes are large and 
fast enough, simple detection of the frequency of 
fringe crossing by connecting the detector to a 
frequency counter or rate meter) yields a value for 
dc/dt. This new disturbance location method was 
applied to detect, and locate, fast thermal changes 
acting at different points of a 200m length of fiber. 

Following this first Sagnac disturbance loca- 
tion method, various new architectures using 
twin Sagnac configurations have been suggested, 
initially by Udd [133], followed by versions by 
Spammer et al. [134-136], Ronnekleiv et al. [137] 
and Fang [138]. All these avoided the need to accu- 
rately balance two independent optical paths. Such 
arrangements effectively used two Sagnac loops, 
configured to share a common fiber sensing sec- 
tion, but it was arranged, via several different opti- 
cal routing means, that the optical path lengths 
of these two loops had different effective centers. 
These allowed two separate detector output val- 
ues to be derived, now giving two response equa- 
tions, and hence allowing calculation of the both 
of the two unknown quantities discussed earlier. 
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The attraction of such configurations is that now 
the two counterpropagating paths in each Sagnac 
loop have the inherent advantage of such interfer- 
ometers, in that they are both intrinsically path 
balanced. In addition, the reference point for the 
center of the loop, where there is zero sensitivity, 
can conveniently be moved to lie outside the fiber 
sensing section if desired. 

These new configurations were operated with 
either (wavelength multiplexed) twin-source con- 
figurations or with other more intrinsically lossy 
arrangements using directional 3dB couplers and 
twin detectors. (The minimum theoretical loss 
of a dual-Sagnac system using 3dB couplers is 
18dB in each Sagnac loop.) A recently reported 
method (Figure 11.43) by Russell and Dakin [78] 
has improved designs further, and for the first time 
allowed the use of a single source and single detec- 
tor, using appropriate WDM routing components 
to “slice” the light from the source into two wave- 
length bands, and route each band of light around 
different Sagnac loops, again with different effec- 
tive optical path center positions, and the desired 
shared fiber sensing section. 

Describing Figure 11.43, a broadband (low 
coherence) Er**-doped fiber superluminescent 
source was spectrally sliced into two wavelength 
bands using wavelength-division multiplexers 
(WDMs). These routed the light along two, essen- 
tially independent, Sagnac interferometer loops, 
one for each wavelength. The first Sagnac loop was 
defined by the bi-directional path ABCEFHI, read 
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clockwise around the sensor. This circuit includes 
a fiber delay coil, C, a sensing fiber length, E, and 
a piezoelectric phase modulator, F. Similarly, the 
second Sagnac interferometer was defined by rout- 
ing light bi-directionally around a path labeled 
ABDEGHI. This again includes a piezoelectric 
modulator, D, a common sensing fiber length, 
E, and a fiber delay loop, G. The effective centers 
of each of the sensor loops are offset (in opposite 
directions) by half of the path length in the delay 
coils labeled C and G. This ensured that each 
Sagnac gives a different response to a common 
perturbation, despite sharing a common fiber sec- 
tion, allowing simultaneous evaluation of the posi- 
tion of the disturbance, its amplitude and its rate 
of change. 

Each Sagnac was phase-biased [139] with sinu- 
soidal strain signals of different frequencies (f, and 
f,), each being one of a set of natural eigenfrequen- 
cies of the fiber loop. The bias frequencies (f, and f,) 
were chosen such that the magnitude of their dif- 
ference frequency |f,—f,| was above the frequency 
range of the expected disturbance signals (ie., the 
base-bandwidth of the output of the sensor). This 
phase bias allows both of the interferometers to 
share a common optical detector, as it provides 
amplitude-modulated carriers of a different fre- 
quency for each signal generated by each Sagnac. 

This system used data-acquisition hardware to 
sample the sensor outputs in real-time and a soft- 
ware system which could be instructed to either (1) 
locate the three largest disturbances observed in 
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Figure 11.43 Twin-Sagnac-loop disturbance location system, using “spectral slicing” of a broadband 
rare-earth-doped fiber light source to obtain two wavelength channels, and WDM routing compo- 
nents to construct two optical loops with effectively different centers. Using modulation signals of dif- 
ferent frequencies, at M, and M,, the beat signals on one detector can be separated electronically. 
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the frequency domain or (2) locate a disturbance 
of selected character, ie., of a specific frequency 
or amplitude. The positions and the frequencies 
of each of the disturbances were then calculated 
and displayed. Using the system, it was possible 
to monitor sinusoidally varying phase distur- 
bances of only 0.025 radian phase change ampli- 
tude (~1.3° optical phase change, or equivalent to 
a few nm fiber stretching) acting on a 40km long 
sensing loop and locate them with 100m position 
resolution. 


11.5.11 Distributed sensing using a 
counter-propagating optical 
pump pulse 


If an optical signal from a steady-state, or CW 
source is transmitted through a fiber to a detec- 
tion system, the power level received will be 
dependent on the total attenuation in the fiber. If, 
however, an intense optical pulse is now launched 
into the optical fiber, in the opposite direction 
(see Figure 11.44), the intensity of the transmitted 
CW lightwave will now be affected by any optical 
gain processes. Such effects can arise from several 
possible nonlinear interactions with the pump. 
The first report of such a system was presented 
by Farries and Rogers [140]. They used a pulse, from 
a NdYAG-pumped dye laser at 617nm, to provide 
Raman gain in a continuous length of monomode 
fiber. The CW beam was a 633nm signal from a 


helium-neon laser source. The Raman gain is very 
sensitive to polarization and therefore the arrange- 
ment is capable of detection and location of lateral 
stresses in a fiber of low intrinsic birefringence, as 
these cause polarization mode conversion, hence 
modifying the degree of Raman gain. 

The technique was a major advance, as it was 
the first of a class of sensors, but suffered, in its 
early form, from a number of practical disadvan- 
tages. As often the case for a first laboratory form, 
it used rather large, and hence inconvenient, laser 
sources, and it is likely the results would have been 
critically dependent on the pump power level of 
the dye laser source. In addition, it would also be 
likely to suffer badly from undesirable polarization 
and other variations due to environmental factors 
such as bends in the fiber. 

More recently, sensors using stimulated 
Brillouin scattering have been reported, one of 
the most significant being the Brillouin optical 
time-domain analysis (BOTDA) system devised 
by Horiguchi [117,118], which has since formed the 
basis of a number of systems for sensing strain in 
very long fibers, using more complex versions of 
the simple arrangement of Figure 11.44. This sen- 
sor takes advantage of gain from the stimulated 
Brillouin scattering process. As with the spontane- 
ous Brillouin scattering process mentioned earlier, 
the center line of the Brillouin gain curve is off- 
set from the pump laser signal. Using a separate 
tunable laser (or more conveniently by deriving a 
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Figure 11.44 General concept of a distributed sensor using counterpropagating optical waves. The 
high intensity pump signal causes nonlinear optical gain processes to occur, which changes the inten- 


sity of the CW probe signal with time. 
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frequency-shifted signal from the one laser source) 
pump and probe beams can be arranged to be 
suitably offset in frequency, to not only ensure 
the desired Brillouin gain occurs, but also, by fre- 
quency sweeping, actually measure the frequency 
of peak gain. Taking advantage of the time-depen- 
dent nature of the signals when the pump is a short 
pulse, the distribution of Brillouin gain shift ver- 
sus distance can be derived by a combination of 
sweeping the laser frequency offset and observing 
the temporal patterns. 


11.5.12 Use of the “synthesis of the 
optical coherence function” 
method for distributed 
sensing by monitoring the 
gain process associated 
with stimulated Brillouin 
scattering (K Hotate, RCAST, 
University of Tokyo) 


Distributed strain sensing based on Brillouin scat- 
tering, as discussed earlier, is a promising technique 
for “smart materials” and “smart structures” appli- 
cations, and sensor systems using Brillouin OTDR 
are already on the market. However, the technique 
developed so far has a spatial resolution limit of 
several meters. In smart structures applications, 
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for example, in an aircraft’s wing, this limit would 
be a major disadvantage. The resolution limit of 
the conventional technique is a consequence of its 
pulse-based nature. An optical pulse for generat- 
ing Brillouin gain has to be longer than the damp- 
ing time of the acoustic wave. With shorter pulses, 
the Brillouin gain spectrum (BGS) broadens out, 
making it more difficult to determine precisely the 
frequency of the spectral peak. Because the typi- 
cal value of the damping time is 25 ns, the practical 
limit for spatial resolution turns out to be typically 
about 1 m. 

To circumvent the resolution limit, a new tech- 
nique has been developed [128]. It is based on the 
control of the interference between the pump and 
probe lightwaves that excite stimulated Brillouin 
scattering (SBS). SBS requires interference between 
two counter-propagating lightwaves. In the tech- 
nique, we control their coherence, so as to local- 
ize the SBS at a specific position in an optical fiber, 
where their correlation is high. 

Figure 11.45 shows the proposed system for mea- 
suring the distribution of the BGS along an optical 
fiber. The light from a 1.55y1m frequency tunable 
distributed-feedback laser diode (DFB-LD) is split 
by a coupler, to provide light sources for the pump 
and probe lightwaves. Light from one output of the 
coupler is intensity modulated (chopped) at a radio 
frequency, using a LiNbO, electro-optic modulator 
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Figure 11.45 Schematic of system for high resolution sensing, using stimulated Brillouin backscatter in 
conjunction with the “synthesis of optical coherence function” method of K Hotate. 
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(EOM), with an electrical drive signal. The output 
is amplified by an erbium-doped-fiber amplifier, 
and then launched into the fiber under test to serve 
as the optical pump (this is shifted in frequency by 
only a very small amount by its intensity modula- 
tion). The other output is intensity modulated, at 
a much higher microwave frequency, v, by a sec- 
ond LiNbO, modulator, so that AM sidebands are 
generated around the incident lightwave of a fre- 
quency v,. The lower sideband, at v,—v, is used to 
serve as the probe beam and the unwanted residual 
sideband and carrier are eliminated by an opti- 
cal filter. This probe beam propagates in the fiber 
under test, in the opposite direction to the pump, 
until it reaches the detector. 

An important point is that the pump and the 
probe undergo identical frequency modulation, as 
they both arose from the laser diode source, and 
the system takes advantage of its capability to be 
frequency modulated directly, by changing its bias 
current. As a result, SBS occurs exclusively at the 
position of peak correlation, the only point where 
the two lightwaves are well correlated. The corre- 
lation peak can be conveniently shifted along the 
fiber, simply by changing the FM frequency f,, of 
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the laser diode source. The increase in the probe 
power, resulting from Brillouin gain in the fiber, is 
detected synchronously using a lock-in amplifier. 
This was referenced using the electrical drive sig- 
nal used to chop the pump intensity. We obtain the 
BGS by varying the frequency of the microwave 
signal used to generate the frequency shift in the 
probe signal. By repeating the BGS measurement 
over the appropriate range of different frequency 
drives, f,,, to the FM, the BGS is obtained as a func- 
tion of position along the fiber. 

Application of the correlation-based Brillouin 
sensor has been investigated [128] for the mea- 
surement of strain distribution in small-scale (few 
cm) material samples. The sample material, cho- 
sen to demonstrate this in the laboratory, was a 
cylindrical acrylate-ring coil former, which could 
be stressed by side pressure to deform it. A single 
sensing coil of dispersion-shifted fiber was first 
wound around it, then bonded to it with epoxy 
cement (Figure 11.46a). The coil former could be 
deformed to an approximately oval cross-section 
to test the sensing system. 

The frequency shift of the modulator was set to 
3.2 GHz, corresponding to a 1 cm spatial resolution 
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Figure 11.46 (a) Photograph of an acrylate ring sample with a single turn of fiber bonded to it. (b) The 
measured strain patterns using stimulated Brillouin backscatter in conjunction with the “synthesis of 


optical coherence function” method. 
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and the strain distributions for various levels, d, of 
ring deformations were measured. Figure 11.46b 
shows that, for small ring deformations, the exper- 
imental results agreed well with the theoretical 
value (solid lines), but deviation from the theo- 
retically expected value became greater at high 
deformation values. This is because the spatial res- 
olution became insufficient to accurately trace out 
the step strain gradient at high deformation values. 
In this scheme [128] a reasonably high measure- 
ment sampling rate, of several tens of Hz, was also 
been realized. 


11.5.13 Distributed sensing to 
determine profiles of strain 
and temperature in long 
in-fiber Bragg gratings 


As a final note on distributed sensing, it should 
be noted that there are several methods capable of 
resolving the effective peak reflective wavelength 
along a fiber Bragg grating, as a continuous func- 
tion of position. Methods to produce very long fiber 
Bragg gratings have been discovered some years ago 
[141] and gratings of up to 1m in length are now 
almost routinely manufactured, primarily intended 
for dispersion compensation in optical telecom- 
munications systems. Methods to determine the 
variation of the peak reflective wavelength along 
the length of such gratings are available, making 
them highly attractive starting components for dis- 
tributed sensors to measure in-fiber strain and/or 
temperature. Very high spatial resolution (typically 
0.1mm) over distances up to 1m is possible, allow- 
ing of the order of 104 sensing elements! The meth- 
ods devised to address such gratings are now quite 
numerous and some are fairly complex, so only a 
brief outline will be given here. 

Early methods made assumptions about the 
nature of the index modulation in the grating, 
in order to derive optical delay (two-way time of 
flight) information from the wavelength compo- 
nents reflected [142] or to get wavelength from the 
delay [143]. Various means, such as tunable lasers 
have been used to determine the wavelength infor- 
mation, and interferometric phase determination 
has been used to detect the time delay. 

One attractive method of defining, more pre- 
cisely, the distance to the measurement point in the 
grating is to use low coherence interferometry. This 


operates in a similar way to that of coherence mul- 
tiplexing and to the distributed sensing methods 
described above. Volanthen et al. [144,145] used this 
method, where the long Bragg grating to be interro- 
gated (initially a chirped one, 50 mm long was used) 
was in one arm ofa Michelson interferometer, while 
the other arm contained a fixed-wavelength much- 
shorter Bragg grating. The effective distance to the 
grating in this second fiber arm could be changed 
in length, using a fiber stretcher, to determine the 
position of good fringe visibility in the long Bragg 
grating in the other arm. The attraction of this 
method was that no prior knowledge of the grat- 
ing wavelength versus distance profile was needed, 
and, unlike the earlier ones, the method could work 
with gratings that did not have a monotonic varia- 
tion of wavelength with distance. 

Some recent systems have used commercial 
coherence domain reflectometers, which conve- 
niently contain an in-built swept Michelson inter- 
ferometer, to determine the point in the long grating 
[146,147]. This grating can then be monitored at any 
point, at any one moment in time, with an external 
acousto-optic tunable filter, which allows the scan- 
ning of interrogation wavelength. In these systems, 
an external optical amplifier was used to compen- 
sate for fiber system losses by boosting the signal 
strength from the ELED source in the instrument. 
Such sensors may have useful applications for mon- 
itoring strain distributions in complex mechanical 
components during factory testing, but are likely to 
remain rather costly and complex for many vehicu- 
lar applications, such as aerospace monitoring. 

In vol II chapters 2,7,20, the more like recent 
technologies and applications for distributed opti- 
cal fiber sensors will be expanded or showing the 
great potential for these technologies. 

In the final section of this chapter, we discuss the 
development of applications of optical coherence 
tomography (OCT) and low coherence interferom- 
etry in both medical and industrial applications. 


11.6 APPLICATIONS OF OCT 
AND LOW-COHERENCE 
INTERFEROMETRY 


As discussed in the OCT section of the fiber optic 
sensors for medical applications chapter, OCT has 
become an established medical imaging technique. 
By far its greatest commercial success has been in 
the field of ophthalmology. Today it is widely used, 
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for example, to obtain high-resolution images of 
both the anterior and posterior segments of the eye 
as well as measuring the depth of eye. Today OCT is 
being used as a test for glaucoma and to monitor its 
progression [148] and to provide detailed examina- 
tion of the retina, which can, for example, provide 
a straightforward method of assessing macular 
degeneration [149], diabetic macula edema [150] 
and axonal integrity in multiple sclerosis, [151]. 

Besides ophthalmology, OCT has been clini- 
cally demonstrated in a variety of other medical 
and surgical specialties, including gastroenterol- 
ogy, dermatology, cardiology, oncology and den- 
tistry among others. Today OCT is being used 
throughout the world to assess coronary artery 
disease and is being used to image and assess arte- 
rial plaque, stents and scaffolds [152]. Quantitative 
screening for skin cancer is starting to occur using 
polarization sensitive OCT [153]. Recently OCT 
has been used intraoperatively during breast sur- 
gery to characterize the lymph nodes and to locate 
tumor margins [154]. OCT is also being investi- 
gated to assess the progress of dental caries and 
other structural changes in structure of teeth by 
observing changes in the scattering properties of 
the teeth over time [155]. Figure 11.47a shows a 
camera image of a sectioned tooth having dental 
caries in the region outlined by the black box and 
Figure 11.47b shows the corresponding OCT false 
color image obtained by scanning across the top of 
the tooth. The white region in Figure 11.1b corre- 
sponds to the decay region in Figure 11.47a. Dental 
caries is shown to increase the scattering of a tooth 
and to penetrate to a much greater depth than ina 
normal tooth. 

OCT and low coherence interferometry are 
becoming well accepted as a nondestructive test- 
ing tool in a diverse set of industrial applications 
including tubing inner diameter, outer diameter 
and wall thickness measurements, single and mul- 
tilayer material thickness and index of refraction 
measurements, medical device inspection includ- 
ing balloons and catheters [156] and in the field 
of ophthalmic metrology to characterize contact 
and intraocular lenses [157,158]. Materials can 
range from semiconductor wafers, glasses and 
polymers. Fiber-optic probe-based OCT and low- 
coherence interferometry (LCI) systems are readily 
adaptable to industrial environments and can be 
made to access and scan interiors of hard to reach 
surface and can operate in a number of hostile 


environments including radioactive, cryogenic or 
very hot. They are being used for both in-line and 
off-line process control applications. LCI with high 
speed data acquisition, analysis and submicron 
resolution with process feedback is being used to 
control manufacturing processes. LCI systems are 
also being used to determine stack layer thickness 
for cell phone displays and other large LCD and 
OLED displays [159] and have been used to evalu- 
ate the layer structure and wedge angle distribu- 
tion in head-up automotive windshields [160,161]. 
The following section shows details of a high reso- 
lution low coherence interferometer that is used in 
process monitoring, inspection, and quality con- 
trol applications in various fields. 


11.6.1 Instrument design 


The sensors described here are all variants of an 
interferometric sensor, based on a dual Michelson 
arrangement, having sub-micron distance accu- 
racy for measuring optical distance. Several ver- 
sions have been developed, for use in a variety of 
in-factory quality control and process monitoring 
applications. The basic system [162,163] includes 
both coherent and low-coherence light sources 
and employs the well-known optical configura- 
tion of the optical autocorrelator. Light from the 
low-coherence light source is guided to the sam- 
ple under test via a single-mode fiber. Light beam 
components, which are partially reflected from 
each of the optical interfaces in the sample, mix in 
the interferometer section, to give a fringe pattern 
that is dependent on the type of source used and on 
the various optical path differences. The detected 
response, as the interrogating interferometer is 
scanned in length, is used to determine sample- 
dependent parameters, such as optical distances. 
When reflection occurs from input and exit sur- 
faces, the optical thickness of sample can also be 
measured. 

The instrument described is a dual Michelson 
arrangement in an autocorrelation mode as 
described in references [163-165] and a schematic 
of the instrument is shown in Figure 11.48. With 
reference to Figure 11.48, low coherence light from 
a superluminescent light emitting diode (SLED), 
having a center wavelength of ~1310 nm and a 
bandwidth of ~ 50 nm, passes through Port 1 and 
Port 2 of an optical circulator into a sample fiber. 
The light from the sample fiber passes through an 
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Figure 11.47 Images of a sectioned tooth having dental caries. (a) Camera image of the tooth, and 
(b) OCT image obtained by scanning across the top of the tooth. 


optical probe and is focused onto the sample under 
test. Light reflecting off of each optical interfaces of 
the sample passes back through the optical probe, 
back through the sample fiber, back through 
Port 2, and passes through Port 3 of the circula- 
tor into an all-fiber dual Michelson interferometer. 
Before entering the interferometer, the light from 
a 1552nm laser diode is combined with the low- 
coherence light that was reflected back from the 
sample via wavelength division multiplexer WDM 


1. The combined laser and low coherence light 
passes through a 50/50 fiber coupler and is split 
into two beams. Each beam passes through a PZT 
fiber stretcher, and is reflected back through the 
fiber stretcher by a FRM. 

During operation, voltage waveforms are 
applied to the pair of PZT fiber stretchers, 180° 
out of phase with each other. The voltage alter- 
nately changes the path lengths of the pair of fiber 
stretchers in a push-pull configuration. The 2 
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Figure 11.48 Schematic of a high resolution dual low coherence interferometer operating in the auto- 


correlation mode. 


beams of light at each wavelength, reflected from 
the Faraday Mirrors, interfere with each other as 
they are recombined at the 50/50 coupler. The com- 
bined interfering reflected light is passed through a 
second wavelength division multiplexer (WDM 2) 
where it is separated into individual interfer- 
ing beams, at 1552 and 1310nm, respectively. The 
beams are then sent to respective photodiodes, 
which convert the interference signals to voltage 
levels as a function of time. 

The 1310 nm interfering light passes through 
a 1310nm photodiode and is preprocessed 
before A-D conversion. It is common practice 
to add balanced detectors and log amplifiers 
to increase the sensitivity of the measurement. 
The 1552 laser interferometer signal is sent to a 
zero-crossing detector, which is used to provide 
a uniform distance scale clock for acquiring the 
low coherence 1310 nm interference signal. The 
laser clock signal is used to trigger the A-D con- 
version of the low coherence interference signal, 
ensuring that data is collected at constant dis- 
tance intervals. Further details of operation and 
performance of the interferometer can be found 
in Reference [165]. 

Application-specific components of the system 
include the optical probes, optical switches, sam- 
ple and probe transport mechanisms and the peak 


processing algorithms [166,167] used to provide 
sample information and user interfaces. Example 
applications include measurement of thickness, 
and thickness profiles, of polymer films [168], 
measurement of optical retardation in films [169], 
measurement of liquid thickness distributions 
on coating hoppers [163], the length calibration 
of optical cells, assessment of the focus of digital 
camera imagers [170,171], surface profile mea- 
surements of films, wafers and imagers, head-up 
display (HUD) windshield assessment [160,161] 
and cell phone display integrity [159]. There are 
clearly many other potential uses of this versatile 
telemetry method. 

For the low-coherence light source, interfer- 
ence with visible fringes only occurs when the path 
lengths of the two arms in the interferometer are 
equal to within a few coherence lengths. In order 
for any interference to occur, light must of course 
be reflected back into the interferometer from the 
sample. This will occur due to Fresnel reflection at 
each optical interface in the sample. The distance 
between adjacent interference peaks is a measure 
of the optical thickness (group index of refraction, 
n, times the true physical thickness) of the sample 
material. In air layers, the distance between the two 
adjacent surfaces is approximately the thickness of 
the layer, as n,;, is very close to unity. Because the 


air 
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instrument uses a stabilized laser light source to 
provide constant distance interval measurements, 
the instrument measures optical path distance, 
defined, in regions of media other than vacuum, as 
n times the physical thickness. 

As mentioned above, the measurement configu- 
ration of the interferometer is that of the optical 
autocorrelation mode, in which light reflecting 
from the sample is input to both arms of the 
Michelson interferometer. In the autocorrelation 
mode, light beams reflecting from the sample are 
made to interfere in a way that both arms of the 
interferometer see reflections from all of the opti- 
cal interfaces in the sample (as an example the 
front and back surfaces of a film). As the path 
lengths of the coiled fibers in the interferometer are 
changed, a series of visible clusters of interference 
peaks are observed, indicating the optical path dif- 
ferences between adjacent optical interfaces. The 
zero-path-difference, or self-correlation condition 
occurs at the point when the two path lengths of 
the Michelson interferometer become equal; in 
which case, all optical interfaces in the sample give 
strong visible interference fringes. The measured 
distance between the largest peak, corresponding 
to this zero path length difference, and the first 
set of adjacent peaks, is a measure of the shortest 
optical path difference in the sample. One major 
advantage of using the autocorrelation configura- 
tion is that the interferometer can be located at any 
distance away from the sample interface, provided 
the fiber losses permit it. A system with the mea- 
surement head located as far as 10km from the 
interferometer has been demonstrated. It is also 
a common practice to have a reference reflection 
built into the probe so that the order of peaks in 
the observed interferogram are in the same order 
as the optical interfaces in the sample. 

The system processor uses a peak-location 
analysis technique, to find the true center of the 
envelope of a cluster of interference fringes. This 
envelope is, to a first approximation, a cosine func- 
tion, having a Gaussian intensity-modulation enve- 
lope. Various peak location algorithms to find the 
true location of the interferogram peaks have been 
developed [171] including moment calculations, 
Gaussian-peak analysis and Fourier-phase-slope 
analysis [167]. When using a sampling distance 
interval of 4/4, they provide measurement repeat- 
ability better than 10 nm, with samples having suf- 
ficient separation between adjacent peaks. Once 


the peak locations are calculated, the appropriate 
distances relevant to the measurements being per- 
formed must be computed. Distance calculations 
are noted and compared to defined acceptance 
ranges and thresholds. 

We now review a sampling of applications of 
current interest to a variety of diverse fields. 


11.6.2 Center thickness, sagittal 
depth, and refractive index 
measurements of contact 
lenses in solution 


The refractive index of a lens is an important 
parameter in lens design. When designing a lens to 
have a specific refractive power, the index of refrac- 
tion of the material affects the thickness and cur- 
vature of the lens. Lens thickness is an extremely 
important parameter in the design of contact 
lenses and intraocular lenses. In soft contact lenses 
the index of refraction also changes as a function of 
hydration and it is important to measure them in 
their hydrated form. Recently LCI and OCT have 
been applied to the measurement of the index of 
refraction of soft contact lenses in solution. 

The LCI measurement geometry is shown 
in Figure 11.49 for a contact lens mounted in 
a cuvette filled with saline solution. The opti- 
cal probe is located above the cuvette and the 
measurement location is centered on the lens. 
The physical distance from the inner wall of the 
cuvette top to the inner wall of the cuvette bottom 
is d,. The saline solution has an index of refrac- 
tion n,. The LCI measurements are performed 
with the lens being centered over the optical 
probe of the LCI. The lens has physical thickness 
t, and the optical thickness n,t, is measured where 
n, is the refractive index of the lens. The physi- 
cal sagittal depth or sag of the lens is defined as S$ 
and the optical distance n,S is measured. The top 
gap optical distance n,G is also measured during 
this step. Once the measurements are performed 
as shown in Figure 11.3, the lens can be removed 
from the cuvette without moving the cuvette so 
that n,d, can be measured at the same location. 
The solution is then removed from the cuvette 
without moving the cuvette to measure n,d, at the 
same location and calculating d, using the known 
refractive index of air. Once d, is measured then 
n, can be calculated along with S and G. The 
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Figure 11.49 Geometry for measurement of a contact lens mounted in a cuvette containing saline 


solution and the LCI parameters being measured. 


thickness of the lens ¢, is then calculated from the 
relationship d,-S—G and the index of refraction 
of the lens can then be calculated from the ratio of 
the measured optical thickness of the lens to the 
calculated thickness. 

Figure 11.50 shows a sample LCI scan of the lens 
shown in Figure 11.49 measured at the center of the 
lens. Further details of the measurement procedure to 
measure index of refraction and center thickness are 
described in reference [157] along with the appropriate 
relationships. The developed procedure allows us to 
obtain measurement repeatability for group refractive 
index less than 1x10 for materials with thicknesses 
on the order of 100 pm, when measured in liquid. The 
measurement repeatability further improves for mea- 
surements in air, or for thicker materials. Combining 
this measurement with a digital camera to determine 
the diameter of the lens allows one to also measure the 
base curve of the lens [158]. 
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11.6.3 Automotive HUD windshield 
wedge angle and thickness 
assessment 


Automotive windshields are comprised of laminated 
safety glass made from two layers of glass that are 
bonded together with one or more layers of polyvi- 
nyl butyral (PVB). Most windshields are not simple 
flat structures, but include slight curvatures in both 
the horizontal and vertical dimensions. Many wind- 
shields today include an acoustic layer to help elimi- 
nate outside noise. HUD technology has recently 
become popular in automobiles as it creates inherent 
driver safety advantages by displaying critical infor- 
mation directly in the driver’s line of sight, reduc- 
ing eyes off road and accommodation time. This 
is accomplished using a system of relay optics and 
windshield reflection to generate a virtual image that 
appears to hover over the hood near the bumper. The 


os ala 
14000 14720 


ae 
13000 


1 Lath i ' ita ve 
8000 9000 10000 11000 12000 


Optical thickness (1m) 


Figure 11.50 LCI scan signal of the lens shown in Figure 11.49 measured at the axial location 


shown in the figure. 
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windshield is an integral optical component of the 
HUD system, but in a uniform thickness windshield, 
the windshield-air interface causes a double image 
ghost effect as a result of refractive index change, 
reducing HUD image clarity. It is common practice 
to add a small constant wedge angle to totally elimi- 
nate the ghosting effect. However, this only works 
for the standard height driver and the appropriate 
wedge angle should change as a function of height of 
the driver due to changes in viewing angles between 
different drivers [161]. 

HUD windshields can be scanned over the sur- 
face of the windshield with a handheld optical probe 
[160] attached to the autocorrelator instrument 
described above an example of which is shown in 
Figure 11.51. Figure 11.5la shows how the hand- 
held fixture is used to measure the windshield of a 
2007 Chevrolet Corvette, equipped with a factory 
installed HUD system. The yellow line is the single 


(a) (b) 


Layer thickness (nm) 


mode optical fiber cable that connects the optical 
probe to the low coherence interferometer. Figure 
11.51b shows the thickness profile of each layer 
within the windshield. During the measurement, 
the probe is moved at an approximately constant 
velocity for the bottom to the top of the windshield, 
therefore the distance scale in the graph is approxi- 
mate. It is observed that the PVB layer gets thicker 
as the probe moves from the bottom to the top of 
the windshield while the thickness of the glass layers 
remains constant. The change in the thickness of the 
PVB layer indicates presence of the wedge angle as 
required for HUD. 

Figure 11.52 shows an interferometer single 
depth scan at a single measurement location of a 
five-layer acoustic windshield. The horizontal axis 
corresponds to the travel of the reference mirror 
of the interferometer. The peaks in the signal cor- 
respond to the interferometric signal registered by 
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Figure 11.51 (a) Image of a 2007 Corvette with a windshield scan in progress using a hand-held 
probe. (b) Thickness profile of each layer as a function of distance from the bottom of the windshield. 
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Figure 11.52 (a) Interferometer point scan for a five-layer acoustic windshield. (b) Windshield struc- 
ture along with index of refraction of each of the materials. 
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the detector as a function of interferometer scan 
depth. The signal has been demodulated and com- 
pressed before it is plotted. The distance between 
the adjacent peaks is equal to the optical thickness 
of the corresponding layer. The five-layer wind- 
shield contains an acoustic layer and an additional 
PVB layer, as compared to the three-layer wind- 
shield shown in Figure 11.51. Figure 11.52b shows 
the cross-section of the windshield layers along 
with their corresponding refractive indices. The 
interfaces are numbered, with the number 1 inter- 
face corresponding to the left-most signal peak on 
the interferometric graph of Figure 11.52a, while 
peak 6 corresponds to the right-most signal peak 
on Figure 11.52a. 

Figure 11.53 shows layer thickness distribu- 
tion of a five-layer windshield having a constant 
wedge angle over a portion of the windshield, 
located between 225 and 525mm from the bot- 
tom edge, where the thickness of the PVB and 
acoustic layers change linearly. This region of the 
windshield encompasses the viewing location of 
the HUD image for the nominal driver. During 
measurement the probe fixture was moved at a rate 
of 5 mm/s, resulting in 0.1mm spacing between 
successive data points. The wedge angle of the 
combined acoustic and PVB layers was measured 
to be 0.588 mrad, as calculated from the slope of 
the best fit line for all the data points. The standard 
deviation of the wedge angle was calculated from 
the local slopes using sets of five adjacent locations 
to be 0.011 mrad. 


11.6.4 Measurement of cell phone 
displays 


The most common touchscreen technology used 
in cell phones is capacitive, an example of the layer 
structure is shown in Figure 11.54a. It is based on 
detecting the change in the capacitance between 
two arrays of electrodes, when the human finger 
approaches the surface of the screen. The touch 
module consists of two layers of electrodes encom- 
passing an insulating layer (e.g., layer of glass), 
which acts as a spacer for the capacitor. The mod- 
ule is placed on top of an LCD or OLED stack, and 
the overall system is protected by a cover glass (e.g., 
Gorilla glass). In order to attach different compo- 
nents of the touchscreen together, manufacturers 
use a layer of adhesive. Unlike the thickness uni- 
formity of the glass substrates, the uniformity of 
the adhesive layer is difficult to control. In the past, 
the nonuniformity of the layers was acceptable to 
touch-screen manufacturers. However, as the tech- 
nology of touchscreens moves ahead at a high pace, 
manufacturers are experiencing more and more 
pressure to ensure that the layer stacks are uniform 
in order to improve image quality as well as the lon- 
gevity and durability of their touchscreens [159]. 
Figure 11.54b shows the interferometric signal 
acquired for a smartphone touchscreen. The mul- 
titude of peaks indicate numerous layers present 
under the surface—one can see the top cover glass, 
the substrates containing indium tin oxide (ITO) 
electrodes, the spacer between the electrodes, and 
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Figure 11.53 Acoustic windshield quantitative partial scan of wedged layers and their total thickness 
as a function of height from the bottom of the windshield. 
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Figure 11.54 (a) Example touchscreen layer structure. (b) Interferometer point thickness scan of a 
touchscreen. (c) Layers observed in the structure shown in the interferometer trace of (b) and their 


respective measured thicknesses. 


the adhesive layer between the touch module and 
the display. Below the adhesive layer (toward the 
right portion of the graph) are the polarizers and 
the layer containing thin film transistors (TFT). 
Some of the layers, such as the ITO and TFT, are 
much thinner than what the low-coherence inter- 
ferometer is able to measure. In this case, the two 
interfaces of such thin layer appear asa single peak, 
and the measured thickness of the adjacent layers 
can therefore be slightly larger than the actual 
material. The numbered thickness regions are 
defined in Figure 11.54b along with their measured 
thicknesses. The same instrument can be adapted 
to generate a visual representation of cross-sec- 
tional images during a scan along a line of a screen 
stack as described with respect to Figure 11.55. 
Figure 11.55 shows the cross-sectional OCT 
images obtained for different model smartphones 
including Samsung Galaxy S3, $4 and S5, as well as 
two different generations of ipad. These cross-sections 
were obtained while moving the optical probe in 
the middle of a screen, along the long dimension. 
The devices were lying flat, with the screens facing 
upward. The graphs match this orientation. The lines 
represent the interfaces between the inner layers, with 
the top line corresponding to the outer surface of the 
cover glass. The brightness of the lines corresponds 
to the reflectivity of the interfaces—the brighter the 
line, the more reflective is the surface. One can imme- 
diately spot two principal differences between the 
scans. First, the scans for ipads clearly show inconsis- 
tent adhesive layer thickness. Second, the thickness of 
the cover glass becomes thinner for more recent ver- 
sions of the devices. It is consistent with the manu- 
facturing trends for the display cover glass. Note, 


that because the ipad cross-sections were mapped 
by assuming that the top-most surface is flat, the lay- 
ers beyond the adhesive layer appear to be deformed. 
due the adhesive layer inconsistency. However, most 
likely both the touchscreen and the image display 
layers experience some sort of the deformation. The 
cross-sectional image can be reacquired by mapping 
the top surface with respect to a known flat surface in 
order to map out the deformations exactly. 


11.6.5 Imager flatness assessment 
and simultaneous thickness, 
surface and index profiling 


Surface profiling is an important aspect of many 
industrial processes. We describe here its applica- 
tion to the measurement of flatness of large imag- 
ers and wafers after being glued into a package. 
Here, a thick optical flat is placed above an imager 
that has been glued into a package or a large wafer. 
It is coupled to an XY scanning frame as shown 
in Figure 11.56a. The bottom surface of the opti- 
cal flat becomes the reference surface in order to 
determine the flatness of the imager. Figure 11.56b 
shows the surface profile results, at a measure- 
ment rate of 200Hz, with the instrument set to 
take y-axis steps of 0.25mm, as it is scanned at a 
rate of 50mm s21 along the x-axis. The data are 
shown relative to the best reference plane and 
are inverted. It can be seen that this imager has 
a slightly convex surface, being bowed upwards 
by about 6 jm in the center. Profiles can also be 
obtained before, during and after curing of the 
imager into the package. 
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Figure 11.55 Cross-sectional images of line scans across different cell phones. 


We now discuss how we have extended the 
concept, to perform very rapid profiling of sev- 
eral simultaneous properties of films, as they pass 
rapidly through the measurement cell shown in 
Figure 11.57. This optical cell design is used for 
simultaneous thickness, surface and (refractive) 
index profiling (SSTIP) of polymer films, glasses 
and semiconductor wafer materials [164]. In the 
cell shown in Figure 11.57 the sample is placed 
between a pair of thick optical flats and can be 
held flat. An XY scanning frame is located above 
the optical probe and the distance between the 
stationary optical flats as a function of position 
d, can be measured and assumed to be invariant. 
Variations in the top gap distance d, between the 
top glass optical flat and the sample as a function 
of position defines the top surface profile of the 
film sample. Likewise, variations in the bottom 


gap distance d, between the bottom glass optical 
flat and the sample as a function of position defines 
the bottom surface profile of the film sample. The 
optical thickness of the film nt is also measured as 
a function of position. As in the contact lens case 
the thickness of the film as a function of position is 
calculated as t=d,—d,—d, as a function of position 
and the film index of refraction is calculated as the 
measured optical thickness divided by the thick- 
ness calculated at each measured location. 


11.6.6 On-line coating thickness 
measurements 


LCI and OCT can be used to assess coating hop- 
per processes. They can be used to assess coating 
uniformity, and coating dynamics. They have been 
utilized to assess the time it takes a production 
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Figure 11.56 (a) Photograph of a testing arrangement for testing imager flatness. (b) 3-D map of 


imager surface for the imager shown in (a). 
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d,, is the distance from upper glass 
SS) plate to the top film surface 


nt is the product of the film thickness 
and the index of refraction 


d1 is the distance from the bottom film 
surface to the lower glass plate 
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Figure 11.57 Measurement arrangement to simultaneously measure surface profiles, film thickness 


and refractive index of film materials. 


process to settle down and reach steady state after 
the production fluids are first introduced into the 
coating stations. Figure 11.58 shows a photograph 
of an eight adjustable optical probe mount fixture 
with two optical probes mounted on the probe 
mounts and installed on a production multilayer 
coating station. During operation the line of opti- 
cal probes can be scanned across the width of the 
hopper to assess uniformity of all of the layers 
simultaneously. 


Figure 11.59 shows data from a hopper purg- 
ing experiment in which water was replaced with a 
coating solution. It is important to determine when 
the coating solution stabilizes to its desired final 
concentration and flow rate. The optical thickness 
of the coating solution flowing down each of the 
slots is measured as a function of time and the stabi- 
lized thickness is noted. In no cases did it take more 
than 10L of flow to reach the equilibrium thickness. 
Thus the 10-L point was used as the normalization 
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Figure 11.58 Photograph showing multiple probes for on-line assessment of liquid layer thickness 


during operation of a coating plant. 
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Figure 11.59 Graphic results of a hopper purging study. 


point for the data. The plots show the normalized 
measured thickness or concentration as a function 
of total flow through the slots. Most of the slots and 
runs show very similar trends. When we lowered 


the flow rate from 4 to 2 L/min the fluid took longer 
to stabilize at its final equilibrium point. 

This section has provided an overview of some 
industrial applications for OCT and LCI. 
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11.7 CONCLUSIONS 


This chapter has given an overview of the field of 
fiber sensors, starting with simple intensity types, 
then more complex interferometric ones. Designs 
for multiplexed and fully distributed sensors have 
been dealt with in considerable depth. The chap- 
ter has concluded with an overview of various 
applications for low coherence interferometry and 
OCT which are being used in industrial process 
monitoring and control applications. Owing to the 
breadth of the subject, even this large chapter could 
not cover many concepts, so the authors apologize 
to any researchers if their work has been omitted. 
A comprehensive bibliography is given below in 
order to allow the reader to glean further details 
of this subject. 

It has been shown that there are very many 
ways of sensing using optical fibers. Many sensors 
reported in scientific papers will retain only aca- 
demic interest, but a few technologies are already 
making significant inroads into industrial, medi- 
cal and military applications. The general trend, 
as with any relatively new technology, is first to 
fill niche markets, before costs can eventually be 
reduced to meet mass markets. Although more 
complex in nature, the use of techniques for 
achieving multiplexed and distributed sensors 
is increasing rapidly. It is difficult to general- 
ize on preferred approaches, but the TDM and 
WDM methods of multiplexing, and OTDR- 
type methods (including Raman, Brillouin and 
fluorescent variants) for distributed sensing have 
many advantages. The achievement of more sen- 
sors per monitoring station, and the correspond- 
ingly greater ease of making comparisons of the 
measurand value at each sensor head, is a factor 
likely to increase the practical use of optical sen- 
sor systems. The major area requiring evolution 
of technology is the extension of these methods 
to address (and separate effectively!) several 
different physical parameters simultaneously 
on a common optical highway in an economic 
manner. 

It will be interesting to observe which methods 
endure the passage of time to be developed into 
cost-effective and reliable system instrumentation. 
The factors in favor of optical sensor highways are 
the ever-decreasing cost of fiber cable and the dra- 
matic improvements of cost and performance of 
optical components. 
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12.1 INTRODUCTION—SENSING BY 


LASER RADAR (LIDAR) 


12.1.1 Laser radar and laser 
properties 


The myriad of modern applications for lasers 
includes the remote sensing and investigation 
of distant objects. When this work is conducted 
outdoors, at ranges of a few tens of metres to 
many hundreds of kilometres, the subject is usu- 
ally called “laser radar”. It must be admitted that 
this is something of a misnomer—the term radar 
itself derives from “radio detection and ranging”, 
which was of course developed over 60 years ago 
for detection of aircraft by long-wavelength radio 
waves. However, we are presently considering the 
use of very much shorter wavelengths—that is, 
light waves in the visible and near-visible region. 
In consequence, the terms “lidar” for “light detec- 
tion and ranging” and “ladar” for “laser detection 
and ranging” have also been introduced. While 
some attempts have been made to differentiate 
the usage of the three expressions “laser radar”, 
“lidar” and “ladar”, they are in fact generally used 
freely and interchangeably. However, one can doa 
great deal more than just “detect and range” with 
lasers. In consequence, two other terms are also 
used for more specific applications: laser Doppler 
velocimetry (LDV)—use of the Doppler prin- 
ciple for remote velocity measurements, and dif- 
ferential absorption by laser (DIAL)—chemical 
detection with lasers tuned on and off resonance 
absorption. 

The principle of laser radar is of course very sim- 
ple and straightforward. The laser beam is sent out 
from an optical transmitter towards the object of 
interest, often referred to as the “target”. The target 


Table 12.1 Laser properties 


Primary characteristics 


will scatter light from the beam and some of this 
will be reflected back towards a receiver—usually 
placed adjacent to the transmitter and often indeed 
using the same optical arrangement set up as a 
transmitter/receiver. This scattered and reflected 
light will contain information about the target so 
that, after the light has been detected within the 
receiver and converted into an electrical signal, the 
information may be extracted. Lasers have many 
very desirable properties for use in remote sensing 
and these are summarized in Table 12.1. Many of 
these laser properties and the unique characteris- 
tics of laser radiation are discussed at considerable 
length in other chapters of this volume, but it is 
worth noting a few points of particular relevance 
to remote sensing. 

Many of the secondary characteristics derive 
from the primary characteristics; thus, precise 
focusing and pointing accuracy are due to good 
coherence properties and, combined with high 
brightness, can contribute enormous power den- 
sity (it has been noted that, in certain high energy, 
exceptionally short-pulse lasers, the actual instan- 
taneous power level may be greater than all the 
power stations of the world combined). Of particu- 
lar value for lidar operation is the wide range of 
available lasers and wavelengths, extending from 
the near ultra-violet at wavelength X~ 0.3 1m up to 
the mid infra-red at }X~12m. For outdoors, any 
laser system must of course be operated in a way 
that presents no hazard whatsoever to people or 
equipment in the locality. At wavelengths between 
~0.38 and ~1.5 1m, this necessitates careful control 
of the transmitted beam to ensure laser intensity 
levels remain below established thresholds (tak- 
ing account of scintillation in the atmosphere; see, 
e.g., Reference [1]). In the so-called eye-safe regime 
(A < 0.38 and > 1.5m), where radiation is at least 


High brightness 


Good spatial coherence 


Good temporal coherence 


Secondary characteristics 


Wide range of available wavelength 


Continuously tuneable in certain regions 


Continuous wave (cw) and pulsed output 


Great pointing accuracy 


Precise focusing 
Enormous power density 
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not transmitted through to the retina, conditions 
are not quite so stringent. Thus, for many applica- 
tions, operation with such eye-safe lasers has been 
preferred. This in fact has rather profound implica- 
tions for the development of lidar technology and 
its broad division between the two classes of laser 
radar discussed in the following sections. 


12.1.2 Remote sensing applications 
of laser radar 


A list of actual and potential remote sensing appli- 
cations for lasers is given in Table 12.2. For such a 
great range of tasks, a wide variety of principles, 
techniques and systems have been applied and 
developed; some of the more important aspects of 
these are outlined in the following two sections. 
Following such basic considerations, there are a 
number of possible ways of discussing their appli- 
cation to remote-sensing problems; one could, for 
example, provide a categorization based on the 
lidar base from fixed or mobile ground platforms, 
from the air (by balloon or aircraft) or from a space 
craft. However, a categorization based on the field 
of application seems likely to be more informative. 
Accordingly, in the present report, lidar remote 


sensing is discussed under six separate head- 
ings: atmospheric sensing; problems in aviation; 
chemical and pollution studies; military applica- 
tions; geoscience; and measurements from space. 
Obviously, most of these individual topics overlap 
across several of the items listed in Table 12.2. 

The present chapter deals with active remote 
sensing by laser. For an account of passive sens- 
ing with thermal emission in the infrared, see, for 
example, Chapter 19 of Volume I. For a short list of 
further reading including books, reviews and com- 
pendia of papers, see the Reference section at the 
end of this chapter. 


12.2 SOME FUNDAMENTAL 
CONSIDERATIONS IN LASER 
RADAR 


12.2.1 Basic principles and 
phenomena for remote 
sensing with lasers 


Some of the basic principles and phenomena that 
may be utilized in laser radar are indicated in 
Table 12.3. The scattering of light from the target 
is of course fundamental to the technique and has 


Table 12.2 Some remote-sensing applications of laser radar 


Wind measurement—meteorology, airfield and aircraft shear and turbulence warning, aircraft wake 


vortices 


Atmospheric measurement—clouds, precipitation, aerosols (dust), temperature, pressure, dispersion 


Chemical species, pollutant and gas detection 


Airborne operation—avionics data, true airspeed, obstacle and terrain avoidance 


Scene and object imaging with various discriminants—intensity, glint, polarization, frequency 


shift, etc. 


Range and three-dimensional sizing, depth sounding 


Vibrational analysis of surfaces and structures 


Spaceborne measurements—cloud, atmospheric scattering, global wind field 


Table 12.3 Some basic principles and phenomena for laser sensing 


Light scattering: elastic, specular, diffuse, Mie, Rayleigh, Brillouin, Raman 


Differential absorption 
Fluorescence 
Surface heating and vaporization 


Light propagation (diffraction and speckle) 


Doppler effect 


Interference (interferometry; light mixing/beating/heterodyning) 


Timing principle 
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many possible forms. Of those noted, elastic (with- 
out gross change of frequency), specular (mirror- 
like) and diffuse (as from a rough surface) are 
terms relating to surface scattering. Mie refers to 
scattering from particles of size comparable with 
the wavelength 4. Rayleigh and Brillouin scatter- 
ing arise from density fluctuations due to thermal 
effects at the surface and (more strongly) within 
the bulk of a material, whereas Raman scattering is 
re-radiation following changes of internal energy 
levels within individual molecules. Analysis of 
light scattering thus has the capability to provide 
physical information at many different levels. 

Differential absorption, fluorescence and sur- 
face heating/vaporization are generally specific 
to the target and may be used with an appropriate 
choice of laser wavelengths to provide chemical- 
type information. 

The available laser beam sizes and speckle 
effects of laser radiation are of course determined 
by well-established principles of optical propaga- 
tion. In lidar work, the outgoing laser beam will 
usually be setup in the lowest order optical mode 
(TEMoo) with Gaussian (bell-shape) intensity dis- 
tribution. At a distance z from the minimum beam 
waist diameter d,, the beam size is given by 


D*(z)=di| 1+(422/nd3) | (12.1) 


This minimum beam size, dj, occurs where the 
beam is collimated (i.e., where the wave front is 
plane). As shown in Figure 12.1, in lidar operation, 
the laser may be transmitted in either one of two 
forms: as a focused beam on a nearby target (usu- 
ally up to ~1km) or as a collimated beam onto a 
distant target. In either case, from Equation 12.1, 
if0<d(z) then 


d(z)=A,/No (12.2) 


and the angular divergence of the beam is given by 
d(z)/z = 4M = 1.27(X/0) (12.3) 


From these expressions, it is easy to show that 
a 10cm diameter beam (D,; in Figure 12.la) at 
A= 10.6 pm may be brought to a focus of O=1.36cm 


at a range R of z=100m. Conversely, for a colli- 
mated beam at the transmitter with D;=0=10cm 
(in Figure 12.1b) the beam size D, at 10km range 
and A=10.6um would be 1.36m. These values of 
beam diameter reduce very sharply with shorter 
wavelength A (note the A term in the numerator 
of Equation 12.2). However, the refractive effects 
of atmospheric turbulence are much greater at 
the shorter wavelength. These introduce optical 
aberration in the light path so that, in the lower 
atmosphere at least, it is difficult to focus the 
shorter-wavelength beams any more sharply than 
longer wavelengths (see Section 12.3.4 following). 
This, of course, no longer holds true high in the 
atmosphere or in space. 

Figure 12.1c illustrates a laser beam of radius 
D, illuminating a rough surface that is shown as 
an assembly of small scatterers, which reflect the 
beam in all directions. Typically, at a distance R, 
the interference of the light from all these many 
scatterers will produce a random speckle pattern. 
The characteristic size D, of these speckle blobs 
is given by a similar expression to Equation 12.2 
with 


D, ~-R/Dp (12.4) 


Thus, as the beam illuminating the target gets big- 
ger, the speckle size decreases. The fluctuations of 
speckles, and their temporal and statistical char- 
acteristics for moving targets, are a complex and 
fascinating topic and will be touched on in various 
sections of this report. 

The next item in Table 12.3, the Doppler effect, 
is the well-known change in frequency of a wave 
from a moving object (as typified, for example, 
from a passing train whistle or a police siren). 
Exactly the same thing happens in the light radia- 
tion scattered from a moving target. In back scat- 
tering (i.e., that light scattered back towards the 
transmitter) the Doppler shift fp is given by 


fp =2Vg/r (12.5) 


This is illustrated in Figure 12.2a where V,, equal 
to Vcos 9, is the radial line-of-sight component of 
motion of the scattering target. 

Laser radiation in the visible and near-visible 
region is, of course, of a very high frequency that 
cannot be followed by available detectors. Green 
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Figure 12.1 Illustration of the focusing of laser beams and laser speckle. (a) Converging beam of diam- 
eter D, transmitted from a lidar telescope and focusing to a diameter 0 at range R. (b) Collimated 
beam (with plane wavefront) D; propagating to a diameter D, at a distant range. (c) Laser speckles (of 
typical size ~D,) due to a scattering from a rough surface illuminated with a laser beam of size Dp. 


light of A=0.5 um, for example, is of frequency 
f=6 x 104" or 600 THz and from Equation 12.5 
the Doppler shift at A=0.5j1m due to a mov- 
ing target with V, = 1 ms" is only 4 x 10°Hz or 
4 MHz. Such a small shift may be measured with 
advanced classical spectroscopic techniques of 
high-resolution interferometry, employing, for 
example, Fabry-Perot etalon filters. However, the 
interference principle may also be employed to 
measure this Doppler shift as added to the much 
higher “carrier” frequency of the laser f,. Thus, if 
the shifted and unshifted beams in Figure 12.2b 
are superimposed at a detector, the electromag- 
netic fields of the local oscillator beam at frequency 
f,and the scattered beam (from a moving target) of 


frequency (f,+fp) will beat or heterodyne together. 
In consequence, the detector will provide an oscil- 
lating electrical signal i, at the difference frequency 
(fitfo)-G=fp as described in the following 
section. 

Finally, we consider the timing principle as 
indicated in Figure 12.2c. If a laser beam is trans- 
mitted as a very short, sharp pulse of length At, 
the measured time-of-flight ¢; to and from a target 
gives a measure of the range R and range resolu- 
tion AR as 


R=ct;/2 (12.6a) 


AR=cAt,/2 (12.6b) 
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Figure 12.2 (a) Schematic of the Doppler frequency shift in light backscattered from a moving object. 
As shown, the line-of-sight velocity component is V cos and thus the Doppler shift is f;=2V cosO/A. 
(b) Illustration of light beating or heterodyning by mixing the two beams at the surface of a detec- 
tor which provides an oscillating electrical signal i, at the difference frequency fp. (c) Schematic of 

a simple time-of-flight laser rangefinder. If the measured time delay to the target and back is t; the 


range Ris ct,/2. 


where c is the velocity of light (~3 x 108 ms“). Thus, 
a 10 ns pulse system will readily provide a range res- 
olution of 1.5m, which may be adequate for many 
purposes; shorter pulses may be readily employed 
to provide centimetric precision or better. 


12.2.2 Basic techniques: Incoherent 
direct-detection and coherent 
light-beating 


For many applications, the scattered light from the 
target may be directed to a detector with no more 
than a simple optical filter to cut down most of the 


background light. However, for more advanced 
applications, as outlined briefly above and shown 
schematically in Figure 12.3, there are two basic 
lidar techniques for detection and spectral analysis 
of the scattered light field: 


e Direct detection or optical/frequency domain 
spectroscopy in which the light field is oper- 
ated on by optical elements prior to detection. 

e Post-detection or time domain spectroscopy in 
which analysis is conducted after the light field 
is detected, often in conjunction with a coher- 
ent heterodyne local oscillator beam. 
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Figure 12.3 Schematic of the two basic forms of laser radar showing the flow diagram of optical to 
electrical processing. Direct detection lidar in which the scattered light is collected by the receiver 
and passed directly via an optical filter to the detector. At high resolution, interferometric stability 
will be required in the optical filter. Coherent heterodyne lidar in which the scattered light is mixed 
with an optical local oscillator beam derived from the original laser as in Figure 12.2b. In this case, 
very precise, interferometric control and matching optic axes and wavefronts of the two beams are 
required as they beat together at the surface of the detector. 


The two techniques are thus different at a very fun- 
damental level and have been extensively discussed 
(see, e.g., References [2,3]). In the older direct- 
detection techniques, the beam may be operated 
on by various classical interferometric devices 
(two-beam: Michelson, Mach-Zehnder, etc.; mul- 
tiple-beam: Fizeau, Fabry-Perot, Lummer-Gehrke 
plate, echelle, grating, etc.) to form a visibility 
curve or spectrum from which spectral informa- 
tion may be derived. In post-detection, coherent 
heterodyne methods, the spectrum is formed (or 
equivalent information derived) by manipulation 
of the electrical signal as it emerges serially in time 
from the detector element. 

For coherent operation with full mixing effi- 
ciency, very good control of the local oscillator 
beam and scattered signal beam is required, with 
very precise matching of the wavefronts and optic 
axes. A fundamental characteristic of such an 
arrangement is that, in effect, the local oscillator 
beam selects just a single mode of the scattered 
radiation field. This ensures that the full speckle 
characteristics of the scattered radiation (ampli- 
tude, frequency, phase), with all the dynamic and 
fluctuation parameters, are fully preserved in the 
measured electrical signal (in contrast with direct 
detection where these properties are necessarily 
smeared out by integration over a finite optical 
aperture). In essence, the coherent technique pro- 
vides a measure of the vector sum of the amplitude 


of electromagnetic field, E,, over the detector, 
whereas, in direct detection of the scalar sum of 
intensities, |E,| is formed. The coherent beating 
technique thus provides an important and perhaps 
little appreciated spectroscopic tool; probability 
distributions, higher moments, autocorrelation 
functions, structure functions and fractal char- 
acter may be investigated for different fields (see, 
e.g., Reference [4]) and there is wide application to 
many aspects and phenomena of laser physics. 

Not surprisingly, the information transfer by 
these two different forms of operation on an elec- 
tromagnetic field is very different. The most signif- 
icant factor is the rate of photons detected within 
a single coherence area (an étendue of )2) per unit 
band pass and, most particularly, the number of 
detections from one cell of phase space, equivalent 
to the number of counts per coherence time and 
usually called the photon degeneracy parameter 6. 
In coherent time-domain techniques operating on 
a single optical mode, the available etendue or light 
grasp is limited to 4”. On the other hand, in direct- 
detection, interferometric methods, the optical fil- 
ter acts directly on the light field and one measures 
the direct current component of the signal col- 
lected in a beam of etendue U, which can be many 
times larger than )7. 

A summary comparison of the two techniques 
is given in Table 12.4. As will be seen in the follow- 
ing sections, the basic choice of technique will be 
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Table 12.4 Comparison of direct-detection and coherent techniques 


Direct detection 


Scattered radiation optically manipulated before 
being passed to detector 

Measures scattered intensity in selected frequency 
intervals 

Light may be collected in many optical modes 

Some relaxation of laser characteristics may be 
available (depending on frequency resolution 
required) 


Relatively simple optical arrangement of transmitter 


and receiver but interferometric stability required 
in analyzer 

Quasi-noise free detection at shorter wavelengths; 
additional noise source at longer wavelengths 


Coherent heterodyne 


Scattered radiation mixed (heterodyned) with 
coherent local oscillator beam 

Full phase, frequency and amplitude 
information available 

Light collected in asingle optical mode 

Laser requires to be of high spatial and 
spectral purity 


Very precise, stable, optical arrangement 
required to maintain wavefront matching of 
scattered and local oscillator beams 

Detector noise at longer wavelengths may be 
advantageously dominated by the local 
oscillator shot noise 


dictated by the problem in hand and the most appro- 
priate laser source for that problem. In broad terms, 
the large photon energy at shorter wavelengths 
means that detectors should be comparatively free 
of noise, so direct detection techniques may be pre- 
ferred for ) = 1.5m. At longer wavelengths with 
heterodyne techniques, shot noise in the local oscil- 
lator beam may be used to dominate thermal noise 
in the detector to provide quantum-limited detec- 
tion of the signal beam. A more detailed evaluation 
of signal-to-noise ratio and measurement accuracy 
is provided in Section 12.3.3 following. 


12.3 LIDAR TECHNOLOGY AND 


SYSTEMS 
12.3.1 The building blocks of a lidar 
system 


The basic elements of a laser radar are indicated in 
Figure 12.4. In the design of a system for any given 


Ophea Detector 
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ne oe ee 
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Signal 
processor 


task, there are many aspects to be considered and 
choices to be made. Immediately obvious factors 
include the nature of the target and the information 
required from it, the environment and type of plat- 
form (ground, air or space) from which the lidar 
must operate and not least the available financial 
budget. Some of the vast range of technical con- 
siderations and options are shown in Table 12.5. 
This list serves to establish both the intellectual 
challenge of laser radar and the diversity it offers 
of basic science, advanced technology and field 
measurement. In the present work, it is obviously 
not appropriate to consider all the various building 
blocks shown in Figure 12.4 and Table 12.5 in any 
detail. It is, however, worth outlining a number of 
issues. These include (1) propagation in the atmo- 
sphere, the choice of lasers and the relative merits 
of working at various wavelengths; (2) basic ques- 
tions of signal to noise and calibration of lidars; 
and (3) some consideration of range resolution 
and optical and system design issues for operation 
from different platforms. 


Information 


Data 
analyser 


Figure 12.4 The basic components of a laser radar. See text and Table 12.5 for an outline of the many 
technical considerations and options for the various components. 
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Table 12.5 Some technical considerations and options in the design of a coherent laser radar (see also 


Figure 12.4) 


Laser 


Power level, pulsed/cw, wavelength, line stability, coherence, lifetime, mass, 


volume, power supply, cooling, etc. 


Optics 


Size, uniaxal/biaxal, optical/mechanical precision, field of view, temperature 


effects, longterm stability, etc. 


Optical control 


Scan patterns, speed, range discrimination, aiming, switchability, etc. 


Scattering, signal strength, range, Doppler shift, signal fluctuations and 


Propagation Attenuation, aberrations—turbulence, absorption, etc. 
Target 

statistics, speckle, coherence time, etc. 
Detector 


Signal processor 
alarms, etc. 
Data analyzer 
alarms, etc. 
Information 
rates, etc. 


Bandwidth, sensitivity, noise characteristics, cooling, etc. 
Frequency, integration times, repetition rate, thresholds, missed signals, false 


Parameters, integration times, repetition rate, thresholds, missed signals, false 


Range, velocity, bearing, elevation, character, signal strength, fluctuations, error 


12.3.2 Atmospheric transmission and 
choice of laser 


For field operation at useful ranges, one generally 
requires a laser with good transmission through 
the atmosphere (although it may be noted that, for 
some operations, a lidar with limited sensing out to 
a few tens of metres, and which thereafter is rapidly 
absorbed, may be valuable). Figure 19.2 in Volume I 
shows a low-resolution spectrum of transmission in 
the atmosphere near ground level in clear weather. 
Only certain regions in this spectrum (known as 
“atmospheric windows”) can be used for remote 
sensing over extended ranges. As is obvious in 
Figure 19.2, Volume I, the absorption due to molecu- 
lar components, primarily water vapour and carbon 
dioxide, in the atmosphere establishes three primary 
bands of good transmission: from the near-ultra vio- 
let at ~0.3,1m through the visible region and up to 
~2.5 um; 3-5 tm; and the longer-wavelength 8-13 pm 
region. Table 12.6 shows a brief list of lasers that may 
be used. At the shorter wavelengths, a vast range of 
lasers is available, including gas lasers (e.g., argon ion 
Ar*, helium-neon He-Ne, etc.), solid state (e.g., ruby, 
neodymium-yttrium aluminium garnet, Nd-YAG, 
etc.) and semi-conductor and fiber amplifier lasers. 
The 3-5um range is less well served with CO 
and various He-Ne-Xe lasers and OPOs currently 
offering rather low efficiency in conversion of elec- 
trical to optical laser power (so called “wall-plug” 
efficiency). In the 8-12 1m band, carbon dioxide 


CO, lasers are widely used and offer both high effi- 
ciency (often >10%) and a very wide tuning range 
over many possible molecular lasing transitions. 
Selection of the laser type and most effective 
lidar for a given remote-sensing problem are obvi- 
ously determined by a vast range of factors, some 
of which are indicated in Table 12.5 and further 
discussed in the following section. Any decision is 
often a matter of best engineering judgement and 
not susceptible of scientific “proof”. Over the years, 
considerable controversy has sparked between 
advocates of different lasers and systems; such 
arguments have usually generated rather more heat 
than light and on occasions have delayed progress. 


12.3.3 Signal strength, signal-to- 
noise and lidar calibration 


The signal return power, collected by a laser radar 
from a large, solid, hard target (bigger than the 
laser beam), can be written as 

Pa = Py-€| AIR? |T-€22 (12.7) 
where P, is the transmitted power, e is the target 
reflectivity, A is the effective collection area of the 
lidar receiver and R is the range to the target from 
the lidar. Note that, if the target is small compared 
with the beam, the range dependence becomes 
1/R*. The atmospheric absorption coefficient for the 
operating wavelength is « and T is the transmission 
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Table 12.6 Some of the lasers that may be employed for lidar work in the visible and near infrared 


Gas lasers 
Solid state 


He-Ne 0.63 umm; Ar+0.45-0.55 pmm; CO, He-Ne-Xe 3-5 umm; CO, 9-12 umm 
Nd-YAG—1.06 mm (and frequency doubled 0.53mm and frequency tripled 


0.35mm); ruby 0.65,1mm; Tm, Ho: YAG: YLF 2-2.2mm 


Diode lasers 
(DFB) lasers 1.55 umm 
OPOs 


0.8-1.611m with power augmented in optical fiber amplifiers; distributed feedback 


Optical parametric oscillators, tunable 3-5 pmm 


of the lidar optics. For a direct detection lidar 
(Figure 12.3a), the effective collection aperture Aj;, 
is given by the area of the lidar-receiving telescope 
(and as dictated by the available etendue U of the 
analysing interferometer). However, for coherent 
lidar (Figure 12.3b, the situation is somewhat more 
complex, as discussed in Section 12.2.2. To a good 
approximation, with a well-adjusted system, the 
effective area of the collection aperture for a het- 
erodyne system can be written as 
Anet =[1/ Asp +1/ Ato +1/Aar]" (12.8) 
where Agp is the average area of a speckle element 
(Figure 12.1c) in the return field. The influence of the 
local oscillator and its spatial profile is incorporated 
by the Ajo term. Atmospheric turbulence is incor- 
porated by the A, coherence area term. Turbulence 
effects will dominate, and the return signal will be 
greatly reduced, if A,; is smaller than the combina- 
tion of the unperturbed speckle and local oscillator 
areas Ap and A, . Physically, this means that, with 
reference to Figure 12.1c, the speckles have been 
broken up by the turbulence-induced refractive 
index fluctuations and are much reduced in size. 
In a lidar experiment, the accuracy with which 
a parameter may be measured will be governed 
by the signal-to-noise ratio (SNR). This in turn is 
fundamentally constrained by the quantum nature 
of light and the number of photons in the signal. 
Thus, for an experiment lasting a time ft, with a 
signal power P, and effective quantum efficiency 
of the detector of n, the mean number N, of sig- 
nal photons detected (each of energy hv where h is 
Planck’s constant and the angular frequency of the 
light v is equal to 2mc/) will be given by 
N, =1(P,/ hv)t. (12.9) 
The SNR relations for direct and coherent detec- 
tion are, in fact, rather different. With reduced 


spectral discrimination in direct detection, there 
is the possibility of collecting background light N, 
(e.g., daylight) not related to the signal, and also a 
noise contribution N, from the detector itself so 
that the average measured signal (N,,) is given by 

(Nin) =(Ns+ No t+Na) (12.10) 
From this, it might erroneously be supposed that 
the SNR would be given by N,/(N,+N,). This, 
however, provides no indication of the accuracy 
with which the signal can be measured, for which 
we need the fluctuations in these quantities. The 
magnitude of the root variance N, is, in fact, for 
commonly met statistics, given by N§”, with 
similar expression for background and detec- 
tor noise. Thus, measurements of detector out- 
put, with and without signal present, provide an 
SNRg;, given by 


SNR gir = (measured signal) / {sum of variances}? 


=(N,)/[(N,+Npt+Na)+(NatNp)]? 


=(n'Pst2”) / (hv)'?[(B, + By + Pa) + (Bi + Bh)? 
(12.11) 


In the case where P,>>P,, P, the SNR reduces to 
N&. In this case, the “noise” is truly “noise-in-sig- 
nal”, as governed by the quantum nature of light 
detection. 

In coherent heterodyne detection, due to the 
mixing of the local oscillator light field E,, and the 
signal light field E,, the detector provides an elec- 
trical signal i, proportional to their product so that 

i, < EyoE, (12.12) 
Analysis shows that, in a well-adjusted system, the 
coherent SNR,,,, in the power spectrum of the sig- 
nal is given by 
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SNR. =nP,/hv B=N,/B (12.13) 


where 1 is the effective quantum efficiency of the 
detector and B is the operational frequency band- 
width for the lidar system. It is supposed that the 
local oscillator has been arranged so that its own 
photon shot-noise is the dominant term above 
any contribution from even an inherently noisy 
detector. The bandwidth term B requires careful 
consideration. In the lidar signal processing, the 
instrumental bandwidth B, requires to be well 
matched to the frequency band B, over which the 
signal is spread. If B, is set much greater than Bg, 
the SNR. is unnecessarily diminished. If it is set 
much smaller, not all the signal photons are effec- 
tively utilized. 

The precision with which frequency measure- 
ments (e.g., Doppler shifts) may be made can be 
analysed in terms of the Cramer-Rao relationship. 
For direct detection, interferometric spectroscopy, 
the Cramer-Rao relationship gives the limiting 
(standard deviation) accuracy 6W, with which the 
centre of a Gaussian spectral line profile can be 
found, as 


Relative standard deviation accuracy 


SW 2=0.425WoNs” (12.14) 


where N, is the total number of photocounts dis- 
tributed across such a line profile of width W, 
(half width at height e'’). This is illustrated in 
Figure 12.5 by the lower line of slope —1/2, show- 
ing attainable accuracy versus total photocount. 
These total photocounts may be accumulated by 
summation over many individual laser pulses and, 
as noted, across a collection aperture containing 
many optical modes (provided they lie within the 
allowable etendue of the interferometer). 

For coherent heterodyne spectroscopy, the situ- 
ation is very different (see, e.g., Reference [2]). The 
simple Cramer-Rao relationship of Equation 12.14 
needs to be multiplied by a complex term contain- 
ing the photocount degeneracy 56. The effect of this 
is manifest in the upper curves of Figure 12.5 for 
accumulation of n=1 and n=100 laser pulses. As 
outlined in the previous section, the light signal 
must be collected within a single optical mode and 
for a low photocount the slope (for degeneracy 5 < 
3.3) is close to —1 and the accuracy is best improved 
by increasing the laser energy per pulse. Only for a 


Total effective photocounts 


Figure 12.5 Schematic of the standard deviation of Doppler frequency estimates as a function of the 
total effective photocount on log-log scales. Line (a) shows the direct detection Cramer—Rao limit 
with slope -1/2. Curve (b) is the heterodyne Cramer-Rao for n=1 shot (i.e., pulses); note the slope of 
—1 at low photocount and tending to zero and saturation at high photocount. The arrow indicates the 
region of closest approach with photocount degeneracy 6~3.3. Curve (c) is the heterodyne Cramer-— 
Rao for n=100 shot (i.e., pulses): this is a translation of curve (b) by a factor of 100 in effective photo- 
count and a factor on (n)”?=10 in standard deviation accuracy. 
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higher photocount (6 > 3.3, deriving from stronger 
scattering) is the accuracy best improved by accu- 
mulating over a number of pulses. There is thus a 
very clear difference in the design considerations 
for the two classes of lidars. For direct detection, 
the available laser energy may be spread without 
undue penalty over many pulses—the total effec- 
tive photocount and accuracy depend only on the 
total energy transmitted (and proportionately scat- 
tered). For coherent heterodyne spectroscopy, it is 
essential (at lower scattering levels) to make the 
energy per pulse as large as possible to ensure that 
6 is as large as possible. 

Further examination of signal statistics, process- 
ing, etc., is beyond the scope of the present chap- 
ter. The subject rapidly becomes extremely complex 
with many highly specialized techniques. It is, 
however, worth considering the topic of calibra- 
tion and performance (see, e.g., References [5-8]). 
Equations 12.7 and following relate the lidar signal 
to the lidar parameters. If any of these latter are in 
any way defective, e.g., poor optical transmission T, 
inefficient detector n, reduced laser power P,,, etc., 
the system performance will be impaired. It is, in 
fact, often remarkably difficult to prove by abso- 
lute calibration test of signal strength that a lidar 
system (particularly a coherent system) is working 
to its full potential. The principle of calibration is 
fairly straightforward; given a test target of well- 
known scattering characteristics (with reliable € in 
Equation 12.7) a SNR.,), is calculated for the lidar 
from all its individual, precisely-measured, parame- 
ters (e.g., optical transmission of lenses, beam split- 
ters detector quantum efficiency, etc.) as inserted 
into the lidar equations. This value is then com- 
pared with that SNR,,, which is actually observed at 
the output of the lidar signal processing. Any major 
discrepancy requires explanation and a search for 
defective elements, alignment and signal processing 
components. It is of course extremely important to 
conduct such an exercise; a 3 dB (factor of 2) loss in 
SNR,,, for example would either require in compen- 
sation a doubling of laser power or provide a range 
performance reduced by at least 30%. 


12.3.4 Range resolution, optical and 
system design issues 


For pulsed lidars, the range and range resolution 
are given by Equation 12.6a and b and the selec- 
tion of time delay t, length of laser pulse At, and 


the length of the timing interval At, (often called 
the “range gate”) in the signal-processing system. 
Pulse length and range gate are usually set to be 
rather similar for most effective operation. As 
noted in Section 12.2.1, a 10 ns pulse gives a range 
resolution of 1.5m, which may provide adequate 
accuracy for range measurement on a solid distant 
object. For an extended target, such as the atmo- 
sphere, measurements over successive volumes 
along the laser beam may be required, in which 
case, for example, a 1 ps pulse (and matched signal- 
processing range gate) would give ~150m resolu- 
tion. In this case, measurements up to 15km would 
require 100 equal range gates, each of equivalent 
length 150 m in the processing. 

For direct detection, the signal strength has 
a range dependence given by Equation 12.7 and 
reduces as R~* (supposing that all the transmitted 
energy is incident on the “target”). The correspond- 
ing range dependence for the coherent heterodyne 
case was first evaluated in 1971 by Sonnenschein 
and Horrigan [9] (see also Reference [10]); it is 
implicit in Equation 12.8 but can be expressed 
more directly as 


S(R, F)<[R*{1+(A/A) (1/R-1/F) }]" (12.15) 


where F is the range at which the lidar beam is set 
to focus and A is the effective area of the telescope 
radius (~A,, in Equation 12.8). This expression has 
some interesting implications; for a target at a given 
range R, the maximum signal is attained by focusing 
at the target and setting F equal to R. Thus, for solid 
targets (so-called “hard” targets with scattering from 
a solid surface) expected at various long ranges, the 
lidar would be set with the outgoing beam almost 
collimated; the R? term dominates in the bracket on 
the left-hand side of Equation 12.14 and the range 
dependence (R~) is the same as for direct detection. 
Similarly, for a short range, the signal strength will 
peak up at ranges R close to F. This is illustrated in 
Figure 12.6, where a representative value of (A/A)? 
equal to 3x10°m? has been used. This establishes 
the possibility of range selectivity with a continu- 
ous wave laser beam and is particularly useful for 
measurements on an extended or diffuse source of 
scatterers, such as small particles in the atmosphere. 
In this case, the measured coherent signal derives 
most strongly from the focal region of the beam. 
This comes about because the scattered radiation 


12.3 Lidar technology and systems 443 


from this region is best matched in wave front to the 
local oscillator beam in the lidar. In consequence, it 
beats most efficiently with the local oscillator and 
provides the strongest signal. The curves shown 
in Figure 12.6 are representative of what may be 
achieved with a 10.6um coherent lidar with a 30cm 
diameter transmitting/receiving telescope (with 
the outgoing laser beam and equivalent A,, of best 
diameter ~16cm). The central probe volume is com- 
monly defined as the region within which the signal 
is within 3dB of its peak position: its half-length is 
written as AR,,,. Some manipulation of Equation 
12.15 with the approximations F => AR gives: 


ARsap =+F’(A/A) (12.16) 
Thus, at a focal setting of F=100m, a spatial reso- 
lution of about +6 m is produced. For such diffuse, 
extended targets, the total observed signal S will 
be determined by the integral over an appropriate 
range interval R, to R, given by 


Ry 
S cc | Pasir, AER 


Ri 


(12.17) 


where P(R) is the strength of scattering at range 
R. In a generally clear uniform atmosphere, P(R) 


0.00020 


0.00010 


0.00005 


over short ranges may be considered as constant, 
but obviously this is a very crude approximation in 
conditions of low cloud, layered fog or smoke [11]. 
Brief consideration of these equations indicates 
some of the complexity of comparing coherent 
lidar performance at different wavelengths. Thus, 
at shorter wavelength, the focal depth will be 
smaller and the spatial resolution will be greater; 
the total signal S is correspondingly reduced for the 
smaller AR. However, at shorter wavelength, the 
maximum useful size for the effective telescope 
diameter, D, is limited by refractive turbulence in 
the atmosphere (which destroys lateral coherence 
across the beam, thus reducing A,p; see Section 
12.2) and has an impact approximately propor- 
tional to A~*. Thus, if atmospheric conditions per- 
mit, an effective telescope diameter of ~50cm at 
10.6 4m wavelength would be reduced by a factor 
(10.6/2.06)*’> [12] (for an extensive discussion with 
many references, see also Reference [13]) at 2.06 1m 
to only ~7.0cm. Such a reduced collection aperture 
would provide both a weaker signal and poorer 
spatial resolution than the admittedly larger and 
more expensive telescope that could be used in 
these circumstances at the longer wavelength. Such 
wavelength comparisons may be extended to many 
other aspects of signal collection, processing and 
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Figure 12.6 The range sensitivity for a coherent lidar as given by Equation 12.14 with the (A/A)? terms 
equal to 3x 10m2, and for four values of the focus F=80, 100, 120 and 140m. Note how the sensitivity 
peaks up very strongly around the focal range F. This permits the possibility of useful range resolution 
with a continuous wave laser for measurements on an extended diffuse target such as aerosols in the 


atmosphere. 
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measurement precision. Thus, in Equation 12.13, 
the instrumental bandwidth required to cover a 
given velocity interval is proportional to d~! (see 
Equation 12.5 for the Doppler shift) and altogether 
with the larger photon energy hv gives 


SNR, NP, -A? (12.18) 


For a given laser power, the scattering P, from most 
surfaces and aerosol particles increases at shorter 
wavelength. To give a signal-to-noise advantage at 
shorter wavelength, this rate of increase obviously 
needs to be greater than A to overcome the longer 
wavelength advantage of Equation 12.18. 

Another point of comparison is the choice of 
pulse length At¢,. Fourier transform analysis of a 
short wave train gives a limiting frequency reso- 
lution Af,~(At,)1. Thus, a 1ppulse gives a line 
width of ~1 MHz. Comparing this with Equation 
12.5, such a line width would provide a velocity 
resolution AV given by 


AV =Af(A/2)=A(2At,)* (12.19) 


Thus, at 21m, a pulse length of 1 ps would give a 
Fourier transform velocity resolution of ~1 ms", 
which increases to ~5ms™ at 10,1m. This poten- 
tially provides a significant advantage to shorter 
wavelengths and also offers the possibility of 
achieving greater spatial resolution (with shorter 
pulses). 

Further discussion is beyond the scope of the 
present article; wavelength and system compari- 
sons are generally very difficult and rarely permit a 
clear cut “winner”. Selection of a lidar system for a 
given task usually requires consideration of many 
issues, often practical and logistical, as well as the 
technical options outlined in Table 12.5. As an 
example, the overriding constraints for an aircraft 


Table 12.7 Primary atmospheric parameters 


Structure 
Dynamics 


or military installation is likely to be the maximum 
optical aperture that can be accommodated in the 
structure, together with strict limits on mass, vol- 
ume and available power budget. 


12.4 LIDAR AND ATMOSPHERIC 
SENSING 


12.4.1 Introduction: Atmospheric 
parameters and measurement 


Some of the first demonstrations of lidar in the 
1970s were measurements of wind and cloud. The 
light scattered from a laser beam in a clear atmo- 
sphere is primarily from air molecules (Rayleigh 
scattering) and from small airborne particles 
(dust, etc.), usually referred to as aerosol scattering. 
The strength of molecular scattering is approxi- 
mately proportional to \~4, whereas the wave- 
length dependence of aerosol scattering typically 
lies between A! and ~~. Thus, molecular scattering 
is much stronger at shorter wavelengths (hence the 
blueness of the sky) and also increases relative to 
aerosol scattering. Perhaps the most obvious com- 
mon example of the latter is the appearance of a 
shaft of sunlight between clouds. 

Wind and atmospheric measurements are 
interesting and important for many reasons: to 
meteorology and climatology, for example, for 
the information they provide on the behavior of 
the atmosphere, to local studies of shear and tur- 
bulence, including the planetary boundary layer 
and practical measurements for shipping, aviation 
and commerce. All these impact the four primary 
parameters of interest to the atmospheric physicist 
shown in Table 12.7. 

Lidar studies make a contribution to all these 
areas and this is likely to increase in the future 
with advanced instruments operating remotely 
and from space. 


Temperature, density and pressure as a function of altitude 
Circulation and motion at all scale sizes from global through synoptic 


to small local scale 


Radiative properties 


Distribution, characteristics and structure of clouds and aerosols and 


their impact on solar, ground and atmospheric radiation 


Chemical concentration 


Vertical structure and horizontal distribution of molecular species 


including minor constituents such as ozone, radicals and active 
molecules (Section 12.6) 
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12.4.2 Scattering in the atmosphere 


Over the past 30 years, considerable effort has been 
put into the measurements of scattering in the 
atmosphere in many different regions, seasons and 
wavelengths. A brief account of molecular, aerosol 
and cloud scattering data is presented as an essen- 
tial preliminary, and a schematic of aerosol and 
molecular scattering is shown in Figure 12.7. First, 
it is worth noting the nomenclature B(0, A, z) m™ 
sr! that is commonly employed for the strength 
of atmospheric scattering. This expresses the frac- 
tional amount of light that would be scattered from 
an incident light beam along a metre length of path 
(m~) into the equivalent unit radian solid angle 
(sr) centred at scattering angle 0, at wavelength 
d and at a height z in the atmosphere. Typically, 
this will often be abbreviated to B(x) as a general 
term for backscattering. It is also worth noting 
various lengthy publications devoted in whole or 
part to scattering in the atmosphere, including 
those by Scorer [14], Hinkley [15] (in particular the 
chapters by Zuev [16], Collis and Russell [17] and 
Inaba [18]), and also the Conference Publications 
of the biennial Coherent Laser Radar Meetings and 
International Laser Radar Conferences. 

(a) Molecular scattering. Molecular scattering, 
in both theory and practice, has been extensively 
discussed over the years; see, for example, the early 
review of Fabelinski [19]. In the atmosphere, the 
strength of molecular scattering varies slowly, uni- 
formly and predictably with altitude through the 
atmosphere. The exact exponent for wavelength 
scattering should be 4.09 instead of 4 to account for 
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dispersion of the index of refraction of air. Taking 
the reference wavelength of A,=1.06um and a 
reference height of z,,,,=8000m, the total back- 
scattering contribution from the air molecules in 
the atmosphere at height z may be modeled by an 
exponentially decreasing function: 


Brcctl Ts Zn) = 10” (Ao 1)? eo 2m sr 
(12.20) 


The molecular spectrum is quite different in the IR 
from what it is in the visible due to the contribution 
of collisional effects on the molecular scattering 
and hence the atmospheric pressure. At low densi- 
ties (or pressure), molecules scatter independently, 
producing a single, near-Gaussian line shape, 
whereas at higher densities and longer wavelengths 
the central component, the so-called Rayleigh line, 
bisects the Mandlestam-Brillouin doublet due 
to scattering from moving density fluctuations 
(sound waves). Thus, a well-defined triplet struc- 
ture prevails at near infrared (21m) and thermal 
infrared (10j1m) wavelengths, and appears to offer 
some opportunity for Doppler wind lidar (DWL) 
applications with coherent heterodyne techniques, 
as discussed by Rye [20]. 

Other molecular scattering phenomena include 
resonant scattering and Raman scattering. Both 
may be used for atomic and molecular species iden- 
tification and concentration measurements of, for 
example, water vapour and ozone (and aerosol con- 
centration) and study of atmospheric chemistry and 
dynamics in the atmosphere (see, e.g., References 
[18,21,22] and Section 12.6) in the atmosphere. 
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Figure 12.7 Schematic of the light collected by a lidar operating in the visible region with compo- 


nents due to aerosol scattering, molecular scattering and background daylight. The relative propor- 
tion of these varies quite strongly with wavelength and height in the atmosphere. In the troposphere 
(up to ~10km) and in the infrared at 101m aerosol scattering is generally dominant. 
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(b) Aerosol scattering. The characteristics of 
aerosol scattering are very complex, depend- 
ing on the chemical constitution of the particles, 
their size distribution, typical shape, etc. These 
are largely dictated by the previous history of the 
air mass. Contributory sources include mate- 
rial blown up by winds and convected from the 
Earth’s surface (most dramatically, for example, 
the fine Saharan sand occasionally spread over 
Northern Europe and the loess soil from Central 
China which extends every spring as a plume into 
the Western Pacific). Other sources include volca- 
nic eruption (e.g., El] Chichon in Mexico in 1982 
and Mt Pinatubo in the Philippines in 1991), for- 
est fires, salt spray from the sea surface, man-made 
pollutants and the constant rain of interplanetary 
material (manifest, for example, in shooting stars). 
It has been calculated that the eruption of Mt 
Pinatubo alone injected about 10 km? of material 
in the form of fine dust into the stratosphere, with 
clear impact on the global climate for a few years 
thereafter. Typically, the particles most important 
for lidar scattering lie in the size range 0.1-10 pm 
diameter; larger particles settle out fairly quickly, 
while smaller ones scatter much less strongly. 

The investigation of the nature, sources, transport 
and chemistry of atmospheric aerosol is assuming 
steadily greater importance in atmospheric science. 
Transport and dispersion models are an important 
component of the numerical modelling machinery 
applied to investigation in these areas, and are used 
to simulate the concentration and dilution (typically 
due to turbulence) of airborne material of all kinds. 
These activities include the modelling of air pollu- 
tion and quality involving, for example, sulphur, 
nitrogen and photochemistry (see, e.g., References 
[23,24]). Increasingly sophisticated and accurate 
transport modelling is being used in studies of the 
source strength and background concentrations of 
radiatively active trace gases [25,26]. Operationally, 


the modelling of aerosol transports features in the 
simulation of the spread of volcanic ash, of critical 
importance to aviation [27], and pollution resulting 
from nuclear or major chemical accidents and con- 
flagrations. International protocols are in place for 
the management of such emergencies and extensive 
collaborative work had been carried out on the test- 
ing and intercomparison of transport and dispersion 
models (e.g., Reference [28]). 

Transport of airborne pathogens, and biota in 
general, also require suitable transport and disper- 
sion models, which have been applied to pollution 
problems, and the physics of atmospheric disper- 
sion on all motion scales from street canyons to 
global diffusion [29]. A second area of aerosol and 
gaseous transport studies of pressing importance 
is concerned with the evolution of climate forcing 
and change, in which sulphate aerosol and volca- 
nic effluent are just two of a range of airborne sub- 
stances subject to intensive investigation, typically 
involving general circulation models. 

There have now been an enormous number 
of lidar measurements of aerosol scattering with 
ground, airborne and space platforms. The wave- 
length range covers the UV at ~0.351m to IR at 
10.6m with direct detection and heterodyne 
techniques (see, for example, Tables 12.8 and 12.9). 

However, there are great problems in synthe- 
sizing these into a database of global perspective, 
taking account of different wavelengths, regions, 
seasons, histories, sampling, etc., of the mea- 
surements. Two extensive airborne programmes 
conducted in the late 1980s and early 1990s pro- 
vide a broad base of knowledge at some of these 
wavelengths. These were the SABLE and GABLE 
programmes (South Atlantic/Global Atmospheric 
Backscatter Lidar Experiments) of the USAF 
Geophysics Laboratory and the then UK Royal 
Signals and Radar Establishment (subsequently 
the Defence Evaluation and Research Agency, 


Table 12.8 Recent papers on aerosol backscatter from lidar measurements 


over the range 0.35-1 um 


Ansmann et al. [30] 
Barnes and Hofmann [31] 
Browell et al. [32,33] 
Cutten et al. [34] 
Donovan et al. [35] 

Hoff et al. [36] 

Li et al. [37] 


Marenco et al. [38] 
McCormick et al. [39] 
Osborn et al. [40] 
Parameswaren et al. [41] 
Post et al. [42] 

Shibata et al. [43] 
Spinhirne et al. [44] 
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Table 12.9 Principal centers for coherent lidar measurements of atmospheric backscatter B(x, 10pm) 


and a selection of more recent references 


NOAA Wave Propagation Laboratory (WPL), 
Boulder, CO 

NASA Marshall Space Flight Center (MSFC), 
Huntsville, AL 

USAF Geophysics Laboratory (GL), Hanscom 
AFB, MA 

Jet Propulsion Laboratory (JPL), Pasadena, CA 


DERA (Malvern), UK—formerly RSRE 


Several other smaller data bases, also SAGE 
extinction data 


Groundbased, pulsed [45-48] 

Groundbased, pulsed [34,47-54] 
Airborne, CW 

Groundbased, pulsed [55-58] 

Groundbased, pulsed (54,59-63] 


Airborne, pulsed 
Airborne CW to 16km 
altitude 
[39,68,69] 


[47,48,55-57,64-67] 


Malvern) conducted across the North and South 
Atlantic, and the NASA-supported GLOBE 
(Global Backscatter Experiment) programme, 
which extended across the Pacific. These studies 
were conducted well after the El Chichon volcanic 
eruption of 1982 and prior to the Pinatubo erup- 
tion of 1991. The data were thus accumulated in an 
historically “clean” atmospheric period, and may 
thus represent the lowest levels of backscattering. 
This concept of a “background mode” of atmo- 
spheric backscattering had in fact been suggested 
by Rothermel et al. [47] from data drawn from two 
widely separated areas—over the UK and over the 
continental USA. 

The measurements taken in the SABLE and 
GABLE trials with an airborne 10.6 1m cw lidar 


have been extensively analysed and discussed 
[56,57] and the median and upper/lower quartiles 
and deciles are shown in Figure 12.8. These appear 
to comprise the most comprehensive database cur- 
rently available at 10.6m for the Atlantic region. 
The measurements amounted to nearly 200,000 
individual records of backscatter made during 80 
flights over the Atlantic in six different regions 
and/or seasons. They are put forward as a reason- 
able measure of global backscatter under generally 
clean atmospheric conditions at 10.6 pm. 

At 1.54 and 2m, validated data sets of B(m) 
were collected over the Pacific Ocean during the 
GLOBE II mission in spring of 1990 [44,70] and at 
2pm over the continental US in 1995-1996 [71], 
respectively. Measurements at 0.35-1 um have been 
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Figure 12.8 Median, quartiles and deciles of backscatter coefficient versus altitude at 10.6,1m as 
measured across the tropical mid-latitude and northern Atlantic and presented as a reasonable global 


measure under atmospherically clean conditions. 


448 Remote optical sensing by laser 


mainly directed to the estimation of the typical 
particle size in specific areas. However, measure- 
ments made with the NASA/LITE (Lidar In-space 
Technology Experiment; see also Section 12.9.2) in 
September 1994 [72] have been able to document 
large areas of aerosol in the upper troposphere and 
stratosphere [73], and the transport of anthropo- 
genically produced aerosols [36], Saharan dust [74] 
and biomass burning aerosols [32]. The measure- 
ments were made at three different wavelengths: 
0.355, 0.532 and 1.064ym. These three wave- 
lengths were used to derive the aerosol and cloud 
characteristics. 

Much effort has been put into the different com- 
parison of backscattering at different wavelengths 
(see, e.g., References [34,51,66,67]). On the one 
hand, there are very few colocated and contem- 
poraneous measurements of backscatter at differ- 
ent wavelengths. On the other hand, calculations 
based on Mie scattering are critically dependent 
on the assumed aerosol size distributions, chemi- 
cal constitutions and refractive indices. Generally, 
the atmospheric aerosol backscatter coefficient 
increases with decreasing wavelength. For a wave- 
length range excluding anomalous refractive index 
changes, this dependence may be expressed by a 
power law: 


B(A,Z)=B(AosZ)(Ao/A)® (12.21) 


where B(Aj, Z) is the backscatter coefficient at a cho- 
sen wavelength A, and at a height z and where ¢ 
is the wavelength scaling exponent to derive back- 
scatter at other wavelengths i. 

From the SABLE/GABLE data shown in Figure 
12.4, and comparison with other data including mod- 
eled Mie scattering [75] and other measurements at 
shorter wavelengths (e.g., References [44,54,71]), the 
following expression for ¢ was derived: 


6(0.35<A<2.1pm,z)= 


0.24|log;o(B (10.6 4m, z))|—0.62 (12.22) 


This expression is considered valid for shorter 
wavelengths, in particular the range 0.35-2.1 um 
and may be used to translate the SABLE/GABLE 
data at A, = 10.6pm. Just such a translation from 
10.6,m is shown in Figure 12.9a and compares 
well with the measurements and other data at 


1.06 ym shown in Figure 12.9b. Another extensive 
and valuable investigation has been reported by 
Srivestava et al. [76]. 

Equation 12.22 must be used with caution. At 
9.1m, for example, the situation is rendered par- 
ticularly complex by the well-known “resonance” 
in scattering due to the sharp changes in refrac- 
tive index at this wavelength, particularly for 
ammonium sulphate; see, in particular, Figure 3 of 
Srivestava et al. [51]. In consequence, the backscat- 
ter at 9.1lis generally greater than at 10.6 1m and 
the ¢ component in the scaling law increases more 
sharply than at shorter wavelengths. The following 
equation encapsulates such an increase: 


¢(9.11 um, z)=1.25|logio(Bo(10.6 um, z))|— 8.25 
(12.23) 


A final example of aerosol backscatter measurement 
and comparable atmospheric trajectory analyses is 
shown in Figure 12.10. Air masses with notably dif- 
ferent aerosol backscatter, measured at three levels, 
6.5, 8.25 and 10km over the Arctic, originated over 
Europe, the United States and Canadian Arctic, 
respectively (from Reference [77]). 

(c) Scattering from the Clouds. Clouds display 
large backscatter coefficient (with B,(m)~10~° 
to 10°m= sr‘) and large extinction coefficient 
(a= 10-> to 107! m™! sr). The strength of scatter- 
ing may vary by several orders of magnitude at a 
given wavelength and may vary by one to several 
orders of magnitude for different cloud size distri- 
butions and particle number density as a function 
of wavelength. Dense clouds strongly attenuate at 
laser wavelengths so lidar measurements cannot 
penetrate more than tens or hundreds of metres. 
However, holes are present in dense clouds at the 
small scale (on the order of 1-100 m) and results in 
so-called optical porosity. Such an optical porosity 
allows profiling of the atmosphere beyond dense 
clouds depending on the relative size of the lidar 
footprint and optical porosity (specifically on the 
probability density function of optical porosity). 
Lidar measurements at an angle close to nadir are 
favorable to reach the surface, although the prob- 
ability of a clear line of sight through dense clouds 
does not decrease too much for angles not larger 
than 30-40° with respect to nadir. The only cloud 
type with an expectation of continuous lidar pen- 
etration is cirrus. The typical thickness of cirrus 
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Figure 12.9 (a) Backscatter coefficient versus altitude at 1.06j1m as derived from Figure 12.8 (for 
10.6um) with the wavelength scaling of Equation 12.21. The background and enhanced NASA New 
Millennium Programme (NMP) profiles are also included for comparison (see text). (b) Backscatter 
coefficient versus altitude at 1.0641m for the GLOBE II measurements over the Pacific. The back- 
ground and enhanced NMP profiles are also included for comparison. 


clouds is <1 km but on occasion thicknesses of up 
to 4km have been observed. An analysis of prob- 
ability of occurrence of layered cirrus has been 
discussed by Vaughan et al. [66] together with cal- 
culations of the impact on spaceborne lidar opera- 
tion. Backscatter and extinction coefficients have 
been calculated for water clouds using Mie theory 
or relevant approximations for spherical particles 
with a good accuracy (e.g., Reference [78]). This 
work has been conducted to derive the relevant 
relationships between microphysical parameters 
like liquid water content and optical parameters, 
ie, B(a), a, k (e.g., References [79,80]). As shown 
in Table 12.10, backscatter coefficients for a given 


cloud type do not change sharply in the visible and 
near IR up to 1-2 um, but are typically smaller by 
1-2 orders of magnitude and more variable in the 
10 pm region. 


12.4.3 Wind and related 
measurement 


1. Short range up to ~1 km 
Wind measurement may be conducted with 
both pulsed and continuous wave (cw) lasers. 
The former are typically used for ranges 
>~1km and, with standard techniques of range 
gating, the range resolution is determined by 
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Figure 12.10 Flight 63 from Keflavik: altitude record of backscatter for the descent flying to the E 
along 71°N (in the region of 13°W) from 09.50 to 10.30 UTC on 17 May 1990. The strong scattering in 
the lower levels of the troposphere (< 9km) shows a very strong narrow band centred at 8.25km which 
was absent on other legs of the flight. There was a sharp temperature inversion at 11.5km; in the 
lower strasphere the scattering peaked at ~14km but was very weak in the band 12-13km. Air mass 
back trajectories for 9 days computed over the release point 71°N, 13°W at 10.15 UTC (May 1990) for 
6.5, 8.25 and 10km. Note the anticylonic loops and the remarkably different origins of the air masses 
for these trajectories from, respectively, Europe, the United States and the Arctic. 


Table 12.10 Approximate values of extinction and backscatter coefficient of different cloud types in 
the ranges ~0.4—2 and ~10 1m (PSC—polar stratospheric cloud) 


Backscatter coefficient B, 
(m-1 sr-) Extinction coefficient «, (m-') 


Altitude 

Cloud type <2um Low 10m High <2um Low 10m High (km) 
Cumulus 6.0x 10-4 1x10-5 to 1x 10-4 1.2x 10-4 5x10 -3-30 x10-3 2-10 
Stratus 5.0x 10-3 3x 10-5 to 5x 10-4 9x 10-2 1x10-4-7 x 10-2 0.2-0.7 
Alto-stratus 1.0x 10-3 1x10-> to 1x10-4 1.8x10°3 3x 103-2 10-2 2-4.5 
Cumulo- 1.0x 10-4 4x10-> to 1x 10-3 1.8x10 7! 1.510 -2-6x 10-2 2-4 

nimbus 
Cirrus 1.4x 10-5 1x10-> to 1x 10-5 2x 10-4 5x 1074-5 x 10-3 8-16 
PSC 3.0x 10-7 6x 10-6 

the laser pulse length and signal processing. attained with a cw lidar by focusing the beam 

For a cw beam, the atmosphere of course pro- to give a peak sensitivity around the focal 

vides an extended target with scattering at all range F (Figure 12.6). 

distances. Nevertheless, as discussed in Section An early measurement with a cw CO, LDV 


12.3.4, quite a sharp range resolution can be lidar is shown in Figure 12.11. This record was 
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Figure 12.11 Records of strong gusts showing the line-of-sight wind component (derived from the 
dominant frequency in the Doppler spectrum) versus time. The CW CO, lidar beam was pointed at an 
elevation angle of 30° and the probe volume was set at a range of (a) 400m and (b) 100m.t 


made under rather gusty conditions on an 
airfield with the LDV pointed at an eleva- 
tion angle of 30° into the prevailing wind; 
the Doppler spectrum was measured every 
12.8 ms. The Doppler shifted frequency of the 
strongest signal in the spectrum was con- 
verted to an analogue voltage and displayed 
as a function of time. In Figure 12.11a, the 
wind component changed from 7 to 17 ms! 
(from about 15 to 35 kt) over a period of about 
5s. Greater fluctuations are apparent at the 
shorter range in Figure 12.11b due to greater 
wind turbulence at the lower height and the 
reduced spatial averaging and integration of 
the signal over the considerably shorter probe 
volume. 

Such a system provides a single line-of-sight 
wind-velocity component. However, simple 
extension of such equipment enables one to 
determine the wind field anywhere around 
the measuring station. The laser beam may be 
scanned in order to resolve various compo- 
nents of the wind and several different scan 
patterns have been developed to suit particular 
measurement tasks. One of the simplest is to 
use a conical scan about a vertical axis. In this 
case, if the half-angle of the cone is @ (i-e., the 
angle of the lidar beam from vertical) and 9,, is 
the beam direction in the horizontal plane, the 
measured line-of-sight wind component V,, is 
given by 


Vu = Vucos(Oy — 8...) sin®+Vycos @ 
(12.24) 


where V,, and 0,, are the horizontal wind speed 
and bearing and V,, is the vertical wind speed. 
Examination of this expression shows that, as 
the beam rotates around the cone, any vertical 
up/down draughts (usually small) contribute a 
constant term V,cos® and the horizontal com- 
ponent varies with the cosine of (0,,—9,.). This 
is readily demonstrated on a polar plot of V,,; (or 
equivalent Doppler shift), as shown by the mea- 
surement example of Figure 12.12. As the beam 
rotates, the magnitude of Vj, traces out a figure- 
of-eight plot, for which the main axis gives the 
horizontal wind bearing (at 0,, equal to 0,,) and 
the amplitude gives the wind speed. For the 
example shown in Figure 12.9, the cw CO, lidar 
beam was focused at five successive heights from 
25 to 250m with three revolutions of the beam 
at each height. Each revolution was completed 
in 1s with recording of 64 individual Doppler 
spectra. With less than 2s to alter focus, the 
complete cycle of wind measurement from 25 to 
250m may be completed in less than 25s. Rapid 
analysis of the Doppler spectra, and fitting to 
Equation 12.24 to extract the wind parameters 
Vip 8, and Vy, may be carried out in real time. 
Figure 12.12 provides a clear example of changes 
of wind speed and direction with height and 
the data were acquired with a compact and 
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Figure 12.12 Polar plots of wind component at heights recorded at night in relatively calm conditions 
with a conically scanned CW CO, LDV. The wind strength (shown by the size of the figures-of-eight, 
with one scale division equal to 2 ms-') increased steadily with height from ~3.5 ms~' (25m) to ~8 ms=' 
(250m). A strong directional wind shear (shown by the axes of the wind plots) is also very obvious and 
amounts to ~36°. Data recorded with the compace, mobile DERA (Malvern LDV). 


highly mobile equipment mounted in a Land 
Rover vehicle, which could make measure- 
ments within 5 min of arrival at a site. Another 
example of measurement with a cw LDV is 
illustrated in Figure 12.13. The conical scan was 
set to examine the changes of wind speed typi- 
cally experienced by the revolving blades of a 
wind power generator. Comparison with simple 
models of turbulence showed reasonable agree- 
ment. Important questions of representativity 
and wind-velocity measurement errors for such 
systems have been considered experimentally 
and theoretically by Banakh et al. [81]. 

Several ground-based cw systems have now 
been built worldwide, mostly employing CO, 
lasers with power output in the range 4-20 W. 
LDV systems based on shorter wavelengths, 
notably with cw diodes and fiber amplifiers 
in the 1.5 pm region, also offer much promise 


and are being developed for both monostatic 
and bistatic operation [82]. Such lidars should 
be compact, do not need detector cooling and 
incorporate fiber optic techniques for ease of 
assembly and precise wave-front matching. A 
recent example has been described by Karlsson 
et al. [83] and used to investigate signal statistics 
of particulate scattering in the atmosphere [82]. 
The non-Gaussian character with high SNR 
from single particles is dramatically different 
from the complex Gaussian scattering with mul- 
tiparticles in the probe volume (see also work 
by Jarzembski et al. [84], who employed known 
particle sizes for lidar calibration). 

The many applications of cw LDV include 
wind-flow measurement around buildings, 
smoke plumes from power stations and wind 
measurement from an oil rig to compare with 
satellite observations of the sea surface. Another 
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Figure 12.13 Schematic diagram for LDV measurements of rotationally sampled winds from a scan- 
ning mirror. The cone angle was fixed at 10° and the radius of the vertical measurement circle was 
varied by altering the beam focus. (From Hardesty, R. M. and Weber, B. F. J. Atmos. Oceanic Technol., 


4, 191-203, 1987.) 


potentially important application is towards 
wind-turbine power generation; Vaughan and 
Forrester [85] identified three specific local tasks 
in addition to the strategic benefits of a global 
wind measuring system (see Section 12.9.3). 


e Site studies of local wind. Wind generators 
require to be optimally sited. Conventional 
techniques with mechanical anemometers on 
tall towers can be very expensive. LDV, with its 
speed and mobility, could have a role. 

© Complex flow around wind generators. LDV 
offers a unique capability for rapid, non-intru- 
sive measurement in quite small probe volumes 
within the complex flows generated by the 
turbine blades. 

e Wind monitoring and gust warning. 
Measurements made by scanning up to 
hundreds of meters ahead (e.g., with LDVs 
mounted on the nacelle of the generator) would 
permit advance control of blade pitch for 
most efficient power extraction. In addition to 
routine gains in efficiency, such LDVs would 
also provide warning of occasional large gusts 
or turbulence; with appropriate blade control, 
the risk of consequent catastrophic damage to 
blades or gear-train would be reduced. 


Other applications of cw LDV lidars for aviation 
(wake vortices, true airspeed, etc.) are outlined in 
Section 12.5 and military tasks in Section 12.7.5. 


2. Longer range measurements 
Laser Doppler wind measurements at ranges 
typically greater than ~1 km have been under- 
taken with several types of pulse lidar system: 


@ Coherent heterodyne at ~10 1m with CO, lasers, 
~2.1 um with Tm, Ho: YAG-YLEF lasers, 1.55 1m 
with doped fiber amplifier lasers and 1.06 .m 
with Nd-YAG lasers. These all utilize aerosol 
scattering. 

e Interferometric direct detection at 1.06, 0.53 and 
0.35 um with Nd-YAG lasers, utilizing both 
aerosol and molecular scattering. 


For completeness, one should also note non-Dop- 
pler techniques, whereby time series of the strength 
of aerosol scattering from two beams of known 
separation may be compared (e.g., with correla- 
tion techniques). The time of passage of a strong 
fluctuation from one to the other (e.g., with peak 
in the correlation function) provides a measure of 
the cross wind but has limited temporal and spatial 
resolution [86-88]. 
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Technology and application of coherent LDV have 
been well reviewed. For good frequency measure- 
ment, the transmitted laser beam requires to have 
minimal change of frequency through the duration 
of the pulse. For CO, lasers, the gas discharge con- 
ditions that ensure such good chirp characteristics, 
including gas catalysis, plasma effects and laser- 
induced media perturbation (LIMP), have been 
analysed by Willetts and Harris [89,90]. Practical 
use of heterodyne detectors, and attainment of good 
performance, has been considered by many authors; 
in one study, near ideal behavior with up to 13dB 
local oscillators shot noise was demonstrated [91] 
(see also Reference [92]). Signal processing with 
different algorithms and procedures has also been 
extensively reviewed (e.g., see References [93-98]). 

At the longer CO, wavelengths, TEA lasers have 
typically given larger pulses (0.2-3J) with p.rf. 
0.1-10 Hz; for lower pulse energy (typically 1-10 mJ) 
and higher p.r.f, q switched and mini-MOPA lasers 
are employed. As an example at longer ranges and 
higher altitudes up to 10-20km, the NOAA CO, 
lidar system has shown great reliability over many 
years. It has been used over widespread field cam- 
paigns with measurements of convective out flows, 
thunderstorm microbursts [99], sea breeze, flow in 
complex terrain [100] and downslope windstorms 
[101], etc. A record of measurement within the 


Grand Canyon taken at two levels below the rim is 
shown in Figure 12.14. Differential flow along the 
bottom of the canyon established that, on occasion, 
pollutants entered from the southwest USA and con- 
tributed to wintertime haze [102]. At shorter ranges, 
and particularly for weak winds in the boundary 
layer, smaller pulse systems have been widely devel- 
oped and used (see, e.g., References [103,104]). 

A particularly long-ranging, coherent system at 
1.06 pm was reported in 1993 by Hawley et al. [105], 
employing a 1J energy Nd-YAG coherent lidar. 
With aerosol scattering augmented by Pinatubo 
volcanic material, lidar measurements in a conical 
scan to 26km altitude were documented, as shown 
in Figure 12.15, in good agreement with rawin- 
sonde wind speed. Coherent lidar at the eye-safe 
2um wavelengths has employed lasers with pulse 
energies, typically in the range 2-50 mJ. A plan 
view of a 2m coherent system is illustrated in 
Figure 12.16. At the shorter wavelengths, range 
resolution may typically be 30-50m (see Section 
12.3); measurements on a stationary hard target 
showed bias errors of —3.3cm s! with standard 
deviation error 11cm s" [103]. Performance of a 
2m coherent system was reviewed by Frehlich 
[106]. The compactness and potential for “turn- 
key” operation are very favorable considerations 
and such systems are now commercially available. 
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Figure 12.14 Radial flow within the Grand Canyon at two levels —0.2 and -1.2km height relative to the 
canyon rim measured by CO, lidar. The shaded areas are night. Under the low wind (stagnant condi- 
tions) of the first few days, the flow at the top of the canyon is opposite in direction to the flow at the 
bottom of the canyon. In stronger winds at the end of the period, the flows were in the same direction. 
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Figure 12.15 Comparison of wind speed with altitude for lidar and rawinsonde to 26km measured 
above the Kennedy Space Centre Shuttle Landing Facility. The 1.06ym lidar scanned at 20° off verti- 
cal at six fixed positions with 100 pulses averaging at each position. A complete record was com- 
pleted in 3min. (From Hawley, J. G. et al., Appl. Opt., 32, 4557-4568, 1993.) 
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Figure 12.16 Plan view of a 2m injection-seeded coherent lidar reviewer. (Hardesty, R. M. and 
Huffaker, R. M., Remote sensing of atmospheric wind velocities using solid state and CO, coherent 


laser systems, Proc. IEEE @ 1996 IEEE.) 
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A pulsed lidar system based on fiber optic technol- 
ogy operating at 1-548, with an erbium doped 
fiber has recently been described by Pearson et al. 
[107]. Operation in both bistatic and monostatic 
antenna configurations was investigated. 

In addition to wind measurement with such 
lidars, it is worth noting that the strength of aero- 
sol scattering may also provide useful information. 
One proposal has, for example, suggested the early 
detection of forest fires from the increased scanning 
due to quite minor smoke plumes. A lidar beam 
scattering from an elevated platform could provide 
coverage and monitoring over quite large areas. 

As discussed in Section 12.3, the design consid- 
erations for direct detection, interferometric lidars 
are very different. In particular, the signal may be 
accumulated from many small laser pulses without 
undue penalty (see Figure 12.3) and across many 
optical modes. In addition, at shorter wavelengths, 
the molecular scattering is stronger and may be 
utilized. A number of different interferometric 
techniques have been developed: 


1. Fringe imaging (also called “multi-channel”) 
with multi-beam interference fringes for both 
aerosol and molecular scattering using either 
Fabry-Perot or Fizeau interferometers. 

2. Double edge (more strictly “dual channel”) 
with two pass-band Fabry-Perot filters set on 
either side of the scattered spectrum. 

3. Visibility curve with two-beam interfer- 
ence fringes formed in a Mach-Zehnder 
configuration. 


Considerable controversy has been generated 
in the literature by the proponents of each tech- 
nique. Theoretical calculations typically show cal- 
culated performance within a factor of 2-3 of the 
Cramer-Rao limit (discussed in Section 12.3.2; see, 
e.g., Reference [108]). Several fringe imaging and 
double-edge systems have been built and measure- 
ments made high into the atmosphere. The visibil- 
ity curve (two-beam) technique is more recent but 
shows promising performance (see Reference [109]). 

In the fringe imaging technique, the spectrum 
of the scattered light is imaged across a number of 
detector elements (typically 10-20). A high-res- 
olution system is obviously required for analysis 
of the narrow aerosol return (see Figure 12.7) and 
the molecular spectrum is spread as a background 
noise across all the detector elements. The actual 


width of the aerosol signal is likely to be dominated 
by the laser line width itself. If this is of the order 
of 100 MHz (equivalent to ~20 ms“! Doppler width 
at 355 nm wavelength) then the Cramer-Rao limit 
discussed in Section 12.3 shows that, for a mea- 
surement accuracy of 1 ms“, the minimum num- 
ber of photons required is given by 

Ny, 2=(nT,) '(0.425)’(20)? (12.25) 
where nT, is an overall instrumental efficiency 
made up of n-detector efficiency and T, instrumen- 
tal transmission. Supposing nTt~0.05 at best gives 
a minimum input requirement at the receiving 
telescope of 


N, 21.5 10° photons (12.26) 


neglecting the impact of molecular background, 
daylight scattering, detector noise, etc. A lower 
resolution system for analysis of the much wider 
molecular Rayleigh line (of equivalent width ~600 
ms at 355 nm) with similar assumptions would 
require a minimum of ~1.3 x 10° photons. 

Several fringe imaging lidars have been built 
and deployed in field trials (see, e.g., References 
[110,111]). Operation has usually been at 532 nm 
with frequency doubled Nd-YAG lasers. A particu- 
larly powerful system with 1.8 m diameter telescopes 
has been deployed at the ALOMAR Observatory 
(Andoya Rocket Range, Andenes, Northern 
Norway 69°N, 16°E). It has been used regularly for 
stratospheric wind measurements by day and night, 
through summer and winter (see Reference [112]). 

In the double-edge technique, Doppler shifts 
are derived by calibration of the response function 
R given by 


where I, and I, are the measured signals pass- 
ing through the two fixed filters A and B placed 
on either side of the scattered light spectrum, as 
shown in Figure 12.17. As the spectral line shifts 
in frequency, the relative magnitudes of I, and I, 
change. The fixed filters are formed by two Fabry- 
Perot cavities of slightly different spacing. With 
suitably chosen frequency separation of the filters, 
operation on the molecular spectrum can be made 
relatively insensitive to the narrow band, often 
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Figure 12.17 Schematic of the double edge technique with two filters A and B set on either side of 
the broad molecular spectrum as utilised for scattering at 0.35 and 0.53pm. The technique can also 
be applied to the relatively stronger aerosol channel at 1.061m with appropriately spaced filters at 


higher resolution. 


highly variable, aerosol scattering at the centre. 
Precise calibration of R also needs to take account 
of the exact form of the broad molecular spectrum, 
which varies with temperature through the atmo- 
sphere. Several double-edge systems have been 
built [113-115]; one of the first, that at Observatoire 
Haute Provence (OHP) in southern France, operat- 
ing at 0.53 um, has made extensive measurements 
into the lower stratosphere over several years [116]. 

A number of field trials comparing different 
LDV lidars have now been conducted. The VALID 
trial at OHP in July 1999 [120] brought together 
double-edge (0.53 1m), fringe imaging (0.53 pm) 
and heterodyne (2.1 and 10.6j1m) lidars for com- 
parison with a microwave wind-profiler radar and 
balloon radiosondes. Forty-six data sets were accu- 
mulated and generally showed good agreement 
with cross-correlation coefficients above 60%. 
For some specific meteorological cases (e.g., a jet 
stream), measurement discrepancies were readily 
explained. At a trial in September 2000 at Bartlett, 


New Hampshire, USA, a fringe imaging lidar 
(0.53 um), a double edge lidar (aerosol channel at 
1.06m, molecular at 0.35j1m) and heterodyne 
lidar (1.06m) were deployed with microwave 
wind profiler and regular radiosonde launches. 
Generally reasonable agreement in a variety of 
atmospheric conditions was obtained. 


12.4.4 Cloud measurement 


Detailed knowledge of cloud parameters, as indi- 
cated in Table 12.11, is important for three major 
applications: climatological studies, weather fore- 
casting and local information, particularly in avia- 
tion. In addition, knowledge of polar stratospheric 
clouds (PSCs) and their role in atmospheric 
chemo-dynamics, particularly for ozone destruc- 
tion, is of great significance (see Section 12.6). 
Laser radar can be used to measure cloud height, 
vertical extent, structure, optical thickness, statis- 
tical distribution, classification, presence of thin 


Table 12.11 Application of cloud measurement data (height, structure, etc.) 


Climatological studies 


Radiation balance 


Global energy balance 


Aqueous phase chemistry 


Monitoring of polar stratospheric clouds and ozone studies 


Weather forecasting 


Delineation of air mass discontinuities 


Assessment of frontal or convective activity 


Assimilation into numerical weather forecasting models, 
mesoscale, regional and global 


Improved quality of humidity and temperature retrievals 


Improvements in cloud vector winds and sea surface 
temperature 


Local information 


Objective assessment of cloud base and cloud cover, 


particularly for aviation and airport operation 


Precipitation and vertical visibility 


458 Remote optical sensing by laser 


and subvisual cirrus, precipitation, etc., and many 
studies have been conducted (see, e.g., References 
[59,118-121]). Spaceborne lidar systems obviously 
have the potential to provide such information ona 
global scale, as illustrated, for example, in the dis- 
cussion following the first such demonstration of 
the NASA LITE equipment [72]. 

For local measurements, ground-based systems 
have been extensively developed and are commer- 
cially available. Relatively simple techniques with 
pulsed lasers are employed with direct detection of 
the scattered light along the beam in ranging inter- 
vals of typically 100 ns (15m equivalent). In fair 
weather and well-marked clouds, the basic cloud 
height measurement is a straightforward ranging 
task. Consideration does need to be given to the 
precise definition of “cloud height”—should it be 
the initial return from the lowest fluffs of cloud, 
the height of the steepest increase in backscatter 
or the height of maximum backscattered signal? 
In a typical commercial lidar ceilometer (e.g., the 
Vailsala CT25K), the processing algorithm selects 
a height in the region of peak backscatter—which 
corresponds to the height from which pilots can 
usually see the ground well. 

In severe weather, the problem becomes much 
more difficult; stray signals will be evident from 
rain, snow, virga (precipitation out of cloud evapo- 
rating before it reaches the ground), haze and fog. 
Quite complex algorithms have been developed to 
invert the measured backscatter profile to provide 
a credible vertical extinction coefficient profile. 
Thresholding criteria based on experience and 
extensive observations can then be applied to the 
latter to determine and classify a cloud presence. 
As mentioned, the Vaisala CT25K lidar ceilom- 
eter is a robust, fully automated system based on 
a single lens, monostatic transmitter/receiver and 
an InGaAs pulsed diode laser operating at 905 nm. 
Measurements can be made from 0 to 25,000 ft 
(7.5km) with 500 range gates and range resolution 
of 50ft (15m) in a programmable measurement 
cycle of 15-120s. 

This equipment has been further developed 
with partial funding by the European Space 
Agency and uses an array of four standard equip- 
ments operated in synchrony. Above 1000m, all 
four beams overlap so that every receiver collects 
scattered light from every transmitted beam. The 
increased SNR ensures that this more powerful 
equipment provides a measurement capability on 


cloud up to the 75,000ft (21km) altitude refer- 
enced by the World Meteorological Organisation 
(WMO) in 1983 as the upper limit for cloud height 
observations. 


12.5 LIDAR AND AVIATION 


Following the demonstration of lidar measure- 
ments in the atmosphere, the application of such 
techniques to problems in aviation was rapidly 
developed. Immediately obvious topics were mea- 
surement of true airspeed, warning of wind shear 
and turbulence, calibration of pitot-static pressure- 
differential probes and the potential hazards of 
aircraft wake vortices. For operation in aircraft, 
equipments must obviously be compact, reliable 
and robust to the environment of vibration and 
reduced pressure. 


12.5.1 True airspeed, wind shear, 
turbulence and pressure error 
calibration 


In the late 1970s, a number of airborne Doppler 
lidars were built. One of the earliest was quite a 
large-pulsed CO, system designed to look for- 
ward of the aircraft at ranges >5km. One of the 
main aims, to establish whether the equipment 
could detect clear air turbulence at high levels in 
the atmosphere, was successfully demonstrated 
(see, e.g., References [49,103]). For measurements 
at shorter ranges, the Laser True Airspeed System 
(LATAS) was built at the Royal Signals and Radar 
Establishment (RSRE) in 1980 and flown in air- 
craft of the Royal Aircraft Establishment (RAE). 
Figure 12.18 shows the installation of LATAS in 
the unpressurized nose of an HS125 executive jet- 
type aircraft. The continuous wave CO, laser gave a 
power output of ~3 W and the optics head was con- 
tained in a well-insulated temperature-controlled 
enclosure. All controls were operated remotely 
from a console in the aircraft cabin. Figure 12.19 
shows a pair of typical spectra recorded at differ- 
ent levels in the atmosphere; one shows increased 
levels of turbulence in the probe volume. 

From the peak of such spectra, the true air- 
speed of the aircraft (that is, the speed of the 
aircraft relative to the air it is moving in) can be 
determined with an absolute accuracy of typically 
better than 0.2ms". Figure 12.20 shows a record 
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Figure 12.18 The Laser True Airspeed System (LATAS) installed in the unpressurized nose of an HS125 
aircraft. For flight, this was covered with a nose cone holding a germanium window through which the 
lidar beam emerged. 


(a) (b) 
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Figure 12.19 Typical Doppler spectra recorded in flight with the LATAS equipment at focal range of 
100m with 25 ms experiment time: (a) height 5000ft, the peak frequency corresponds to 225.2kt; (b) 
height 960ft, the mean frequency corresponds to 215.0kt; note the broadened spectral width due to 
the turbulence of ~5kt. 
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Figure 12.20 True airspeed recorded against time. The top trace is the lidar data with sampling at 
25 s-'. The lower trace is the speed calculated from a pressure-differential gust probe mounted on 
a boom. Note the close correspondence of velocity structure. (From Keeler, R. J.et al., J. Atmos. 
Oceanic Technol., 4(1), 113-137, 1987.) 
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of such measurements made at 25 s"! in the atmo- 
spheric boundary layer [122]. For these studies, 
the lidar was focused at a reduced range of 17m 
(but sufficiently far enough to be away from any 
disturbance of the advancing aircraft) to give good 
comparison with a small, conventional, pressure- 
differential gust probe mounted in a thin boom 
projecting ~1 m ahead of the aircraft. The air speed 
derived from the pressure-differential system is 
also shown in Figure 12.20. The small difference 
between the two airspeeds (about 3 ms~!) almost 
certainly arises from inaccurate correction of the 
pressure data, and provides the basis for calibrat- 
ing such devices as discussed later. Closer exam- 
ination of the data in Figure 12.20 shows a time 
offset of ~0.11s (clearly corresponding to the 16m 
forward look of the lidar), and excellent agreement 
of the detailed structure (and computed variances) 
for the two sets of data. 

Although clear air turbulence at high level in 
the atmosphere is troubling to aircraft, with risk 
of injury to unbelted passengers and crew, few if 
any aircraft have been caused to crash. Low-level 
wind shear, on the other hand, often but not always 
associated with thunderstorm activity, has caused 
many serious accidents, notably with aircraft in 
landing or take-off phase. At this stage, aircraft 
have a low airspeed, typically ~120 kt (~60 ms-’). 
In the presence of wind shear, as the aircraft passes 
into a region of notably different wind speed, its 
lift may be greatly reduced. The pilot may have 
insufficient time to accelerate the aircraft up to 
sufficient airspeed to keep it aloft. Figure 12.21 
shows an early LATAS record of passage through a 
thunderstorm microburst during the Joint Airport 
Weather Studies (JAWS) trial in Colorado in 1982. 
In this case, the focal range was set 250-300m 
ahead of the aircraft. The sensitivity extended out 
to 700-800m and thus strong shear or turbulent 
structures entering the extended probe at longer 
range were evident. In Figure 12.21, the sequence 
of Doppler spectra in the lidar record showed a 
headwind that changed by over 40 kt (~20 ms~) 
in about 5s. There was an additional down draft 
of ~6 ms". Analysis of these measurements con- 
tributed to the development of a descending 
vortex-ring model for thunderstorm microburst 
behavior, in contrast to the more usual vertical jet 
and outflow model. Simulations with the LATAS 
parameters have shown that the ~5-10s warning 
of shear from a probe range of ~300-600m could 


be useful if heeded promptly [123-125]. Wind 
shear and microbursts are dynamic phenomena; 
indeed, the simulations showed that there was 
significant advantage in controlling the aircraft 
(a medium-size passenger jet) using the airspeed 
measured ~300m ahead, but increasing the dis- 
tance to 600m produced little further improve- 
ment. Nevertheless, in terms of general operation, 
it would be useful to look at a greater range with 
pulsed systems and a number of airborne equip- 
ments were built in NASA-supported programmes. 
Both CO, 10 pm and solid state 1 and 2 pm lidars 
were successfully demonstrated at ranges to ~3 km. 
Comparative performance in terms of range and 
velocity resolution, and atmospheric factors in dif- 
ferent conditions of rain, high humidity, etc., were 
assessed [126]. 

While wind shear remains a severe problem, the 
widespread deployment of lidars in civil airliners 
nevertheless appears unlikely in the near future. 
Increased awareness of the meteorological fac- 
tors that precipitate severe wind shear, combined 
with the warning from large powerful Terminal 
Doppler Weather Radars (TDWR) operating at 
major airports, provide improved confidence and 
safety in airport operation. Nevertheless, it may 
be remarked that lidar has a very strong capability 
to detect so-called “dry” microbursts (i.e., without 
associated rain) and is thus complementary to the 
high sensitivity of TDWR for conditions of rain 
and high humidity. 

In an extensive development in France in the 
late 1980s, a CO, 10 pm cw lidar was similarly used 
for true airspeed measurement and flown in a heli- 
copter and various transport and fighter aircraft. 
This was subsequently configured into a three-axis 
equipment called ALEV-3 by Sextant Avionique, 
as shown in Figure 12.22. This lidar has been used 
for air data calibration and aircraft certification in 
extensive flight tests of Airbus aircraft. With the 
beams focused at 70m, the equipment gives a pre- 
cise, real-time measurement of the airspeed vector 
and thus permits the calibration of static pressure 
and the angles of incidence and sideslip [127]. 

An advanced optical air data system (OADS) 
for measurements high in the atmosphere was 
built and flown in the early 1990s [128,129]. Good 
performance at ranges to a few tens of metres 
was obtained with a direct detection technique 
in which the velocity components were derived 
from the measured transit times of aerosols 


12.5 Lidar and aviation 461 


enon Netiiimencemsennns : 
HS125 flight 792 file 4 


channels 270-430 


Jaws trial ‘82 6 
———— cient ingemsip OO Wii 
——————OoOO= 


P= 


~20 ms~! Shear 


i i 4, eS / In~5s 
ge, 


ag. ncaa 35 Microburst Windshear 


woman Oa 36 Doppler Spectra at 
———”" Nes 37 0.5 s intervals. 
caspauty, Smaaen, 38 Ground speed ~100 ms-! 


0 So) 


(~55 kts) 


Figure 12.21 Sequence of lidar spectra recorded at 0.5s interval during aircraft passage through a 
thunderstorm microburst. Note the severe wind shear (40 kt, ~20 ms“ in ~5s) in the record. 
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Figure 12.22 The ALEV-3 airborne lidar for air data calibration and aircraft certification. The opti- 
cal axes chosen for transport aircraft. The lidar (optical unit) and installation in the aircraft. (From 
Morbieu, B. et al. ALEV3, a 3-axis CO, CW anemometer for aircraft certification. Proceedings of 7th 
Conference on Coherent Laser Radar: Applications and Technology, Paris, Paper, 1993.) 


between light sheets. This programme is outlined 
in Chapter 18 of the present volume and used as an 
exemplar of optical engineering practice in a hos- 
tile environment. 

More recently, an airborne wind infrared 
Doppler lidar (WIND) has been developed in 
cooperation between CNRS and CNES in France 
and DLR-Munich in Germany [130]. In this lidar, 
built more for meteorological investigations and as 
a precursor for study of spaceborne systems, the 
TE-CO, pulse laser operates at ~300 mJ pulse and 
4 or 10 Hz prf. Successful measurements from an 
altitude of 10km, with downward conical scan- 
ning at 30° from nadir, have been undertaken in 
international field campaigns since 1999 [131]. 


12.5.2 Aircraft wake vortices 


In the process of generating lift (deriving from the 
pressure difference between the upper and lower 
surfaces of the wing aerofoil), all aircraft (including 
helicopters) create transverse rotational flow in the 
air that has passed over each wing. This rotational 
flow rapidly evolves into two powerful counter- 
rotating vortices that extend as a pair of trailing 
ribbons behind the aircraft. The initial separation 
of the vortices is about 80% of the wing span; their 
rotational sense produces a strong downdraft in 
the region between them, and in free air they tend 
to sink at a rate of 1-2 ms” Their trajectory is, of 


course, largely determined by meteorological con- 
ditions; on approach to the ground, the sink rate 
is reduced and the vortices tend to separate. An 
extensive assessment and bibliography of the then 
state of knowledge of wake vortices was given by 
Hallock in 1991 [132]. 

The existence of such wake vortices, some- 
times rather dramatically referred to as “hori- 
zontal tornados”, represents a potential hazard 
to following aircraft, particularly for smaller air- 
craft following larger in the vicinity of airports 
on landing and take off. The Air Traffic Control 
System guards against this hazard by the appli- 
cation of separation minima between all pairs 
of aircraft operating under Instrument Flight 
Rules. The larger and heavier the aircraft, the 
more powerful the vortex it creates and hence 
the separation distances depends on the types 
of aircraft (set into various weight categories) 
involved. Such separation minima provide a sig- 
nificant constraint on airport capacity—it has 
been estimated, for example, that if they were 
not necessary the arrival capacity at Heathrow 
could be increased by up to five aircraft per hour. 
Obviously, the separations must be sufficiently 
large to ensure safety, but they should not be so 
excessively conservative as to unduly restrict the 
capacity of airports. 

The characteristics of wake vortices—their for- 
mation, evolution, persistence, trajectory, mode of 
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decay, etc.—have been studied for many years. In 
particular, powerful techniques of computational 
fluid dynamics and large wind tunnel measure- 
ments on reduced scale model aircraft provide 
information on the early stages of vortex forma- 
tion and evolution. Recent water tank studies have 
extended this to the late stages. The positive and 
complementary features of lidar measurements are 
that they can be carried out on full-scale aircraft, 
in the real atmosphere and potentially at long dis- 
tance (many 100s of wing span) downstream of the 
generating aircraft and thus on fully mature and 
decaying vortices. 

Wake vortices were, in fact, amongst the very 
first subjects of study by coherent laser radar. A 
demonstration of coherent 10 pm LDV measure- 
ments with cwlidars was made in 1970 ina NASA- 
sponsored programme. Subsequently, during the 


100 
80 


60 


h(m) 


40 


20 


—300 —200 —100 


1970s and early 1980s, very extensive NASA-and 
Federal Aviation Agency (FAA)-funded inves- 
tigations were made and helped to define vor- 
tex separation standards [132]. In the 1980s, 
the German DLR Institute of Optoelectronic 
deployed a high-performance cw lidar at 
Frankfurt Airport between two parallel runways 
with the prime aim of examining the risk due 
to a cross wind possibly transporting vortices 
from one runway to the other [133]. The runway 
separation of 520m is often too small for oper- 
ating both runways independently with respect 
to wake vortices. Figure 12.23 shows a section of 
the vertical measurement plane with examples of 
vortices moving under the prevailing cross wind 
from one runway to the other. A number of such 
vortices show a steep ascent towards the paral- 
lel runway. This bouncing effect may enhance the 
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Figure 12.23 Position of wake vortices at Frankfurt airport measured by the Laser Doppler 
Anemometer (LDA) situated between the two parallel runways 25L (left) and 25R (right). The upper 
record shows the propagation of Boeing 747 vortices generated on runway 25L moving towards 25R, 
under the prevailing cross wind, and the lower from 25R in the opposite wind flow. 
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hazard since the vortex will tend to pass near the 
altitude of approaching aircraft. This lidar work 
has contributed to the development of a strategy 
of operation for Frankfurt Airport in various 
cross wind conditions. 

With increasing demand in the 1990s for avail- 
able runways, it is clear that wake vortices and the 
required separation minima limit the capacity for 
the world’s busier airports. There has thus been 
increased interest recently in lidar measurements 
with NASA and FAA programmes in the USA, 
Civil Aviation Authority-funded programmes in 
the UK and EC-funded programmes in Europe. 


(a) 


This includes 10 pm cw lidars with controlled 
tracking of vortex position and comparison with 
acoustic sensors and mechanical anemometers. In 
addition, short wavelength 1 and 2 pm pulse lidars 
have been used for longer ranges. 

A schematic illustration of the flow field and 
velocity distribution for vortices is shown in 
Figure 12.24. The Doppler lidar spectra expected 
for a beam intersecting such a flow is somewhat 
complex due to the changing line-of-sight veloc- 
ity component along the beam and the spatial 
weighting function of the lidar. The form gener- 
ally expected is shown Figure 12.5, as calculated 
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Figure 12.24 Schematic of vortex flow and lidar spectra. (a) Cross section of a pair of rotating vorti- 
ces. Two positions A and B of a lidar beam intersecting the left hand vortex at tangent positions t, 
and t, are shown. (b) Approximate peak velocity profile V(r) along the line between the vortex cores. 
(c) Doppler spectra for the lidar beams at A and B. Analysis shows that scattering from the immediate 
tangent regions t, and t, give the peaks in spectra at Doppler frequency f(t,) and f(t,) equal to 2V(t,)/A 


and to 2V(t,)/A, respectively. 
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Figure 12.25 Doppler spectra (recorded at 25 s“') from wake vortices carried by the prevailing cross 
wind through a static, upward looking lidar. Each spectrum is plotted vertically and color coded for 
amplitude as shown by the insert. The time after passage of the generating aircraft is shown on the 
horizontal scale; the Doppler frequency (left hand) and equivalent velocity component (right hand) are 
on the vertical scales. (a) Pair of vortices from a B757 aircraft (CAA 320133, 3 October 1995); lidar ele- 
vation 90° and focal range 110m. The cross wind was 4.9 ms“note the very sharp profiles of the vor- 
tices, rising close to the top of the scale, on either side of the cores. The downflow midway between 
the vortices is 7.4 ms~. (b) Pair of vortices from a B747-200 aircraft (CAA 320068, 30 October 1995); 
lidar elevation 90° and focal range 115m. The cross wind was 6.0 ms“! and downdraft ~0.7 ms~' Note 
the characteristic shape with broad, steadily rising profile up to ~12 ms“! and then rounding over into 
a series of inversions. The corrected downflow V, between the vortices is 7.78 ms“. 


by Constant et al. [134]. Most importantly, scat- 
tering from the immediate tangent region pro- 
vides the highest frequency peak component in 
the spectra. The intensity of these peaks rapidly 
weakens as the beam gets closer to the vortex core. 
Nevertheless, with a lidar of adequate spatial and 
spectral resolution, their measurement provides a 


velocity profile through the vortices, as shown in 
the series of experimental spectra in Figure 12.25 
with a fixed pointing lidar. In these pictures, the 
spectra are plotted vertically and coded for inten- 
sity as shown in the inset. The rapidly rising peak 
Doppler frequency on approach to the vortex core 
is very obvious. 


466 Remote optical sensing by laser 


With the lidar beam scanning to the side, many 
successive intersections on vortices with the char- 
acteristic cusp-like signatures may be obtained. 
Analysis of a record from Heathrow showed a 
very remarkable result in that the near-wing vor- 
tex from a B-747 aircraft pursued the unusual path 
shown in Figure 12.26. After initially sinking to 
about 40 m above ground level, it subsequently rose 
and, at ~70s, was back at the glide slope with essen- 
tially undiminished strength. This is a good exam- 
ple where the ATC separation standards worked 
well—the following aircraft passed through at 
~102s, well after the vortex had moved away. The 
unusual character of this example must be empha- 
sized; in the overwhelming majority of the nearly 
3000 lidar records at Heathrow, the vortices were 
rapidly convected away from the glide slope by the 
prevailing wind within 30-40s. 

Much quantitative data on vortex character, 
strength and longevity may be obtained from such 
lidar records. The strength of vortex circulation or 
vorticity I(r) in m? s~! is given by the expression 


T(r) = 2nrV(r) 


(12.28) 


100 0-2 m/s 


Meters 


Cross wind 


where V(r) is the rotational speed at radius r. From 
Figure 12.25, values of vorticity for the B757 vortex 
of T'(14.3 m)=331 m? s“! (at mean age 16s) and for 
the B747 of T (22.4m)=545 m? s“! (at mean age of 
17s) are obtained, in reasonable agreement with 
expectation for these aircraft in landing configu- 
ration. As a further example, from the scanned 
record for Figure 12.26, much detailed information 
may be extracted, including the values and changes 
of vorticity at different radii over the full observa- 
tion period. The circulation [’ (13.5m) remained 
close to 510 m2 s~! out to 70s [135]. 

In recent years, study of aircraft wake vorti- 
ces has further intensified; EC-supported pro- 
grammes have been very fruitful in combining 
lidar studies with wind tunnel, water tank, 
catapult launching of models and computer 
fluid dynamics (CFD) investigations (see, e.g., 
Reference [136]). Pulse laser systems at 2 1m wave 
length and spatial resolution of ~50m may also 
be used for localization and measurement of vor- 
tices at ranges greater than ~1 km, as first dem- 
onstrated by Hannon and Thomson [137]. Lidar 
studies include measurement of military-type 
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Figure 12.26 Reconstruction of vortex trajectories for record CAA 80039. The lidar was sited at a 
position ~110m north of the centre line with the aircraft (B747-200) ~85m above ground flying E to 
W. The lidar was set to focus at ~128m range and the scan in the N-S plane is shown extending over 
31.5410°. The wind was WNW, about 5-6 ms“! at ~295° at a height of 75m, giving a variable cross 
wind of 0-2 ms“! from the north. The weather was cold (3—5°) with occasional rain and sun. 
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aircraft in Germany [138], report of measure- 
ments at Dallas/Fort Worth airport [139] and 
at Heathrow [140,141]. Very recent collabora- 
tive campaigns with several lidar systems, both 
CW and pulsed, have been held at European air- 
fields. Precise comparisons of wind tunnel data 
with lidar measurements have been made [142], 
and observations reported of an unusual struc- 
ture with a vortex within a vortex [143]. A recent 
paper considered the impact on helicopter han- 
dling of vortex encounters using lidar data [144]. 

In summary, a broad view of the application 
of lidar to problems of aircraft wake vortices sug- 
gest several principal areas. First, at the most basic 
level, there is the important contribution to fun- 
damental knowledge of vortices—their forma- 
tion, interaction, transport, decay and dissipation 
and comparison with data using other techniques. 
This potentially should input into detailed aircraft 
design (e.g., wing configuration, interaction of 
engine, flap and tip vortices, etc.) with develop- 
ment of low vorticity vortices (lvv) or quickly dis- 
sipating vortices (qdv). Such progress is important 
particularly for the very large transport aircraft 
now being developed (e.g., the Airbus A380). For 
airport operation, two types of capacity improve- 
ment may be envisaged: 


© Strategic: scheduled increase in arrival/depar- 
ture slots. Increased knowledge of vortex decay 
and environmental interaction could lead to 
small capacity gains from refined separation 
standards. Even a few slots per day at a busy 
airport would be of great value. 

e Tactical: considerable scope for local hour- 
by-hour ATC decisions to reduce separations 
based on meteorological and lidar (LDV) 
information monitoring. Potentially large fuel 
savings and reduction of delays, particularly 
for aircraft in holding patterns, should be 
attainable. 


This last point raises the question as to whether 
effective forms of routine monitoring by lidar 
could be developed to provide useful vortex advi- 
sory systems for air traffic controllers. Any such 
airport advisory system must of course be cost- 
effective to install and operate and hence the tech- 
nology must above all be robust, reliable and easily 
maintained. 


12.6 CHEMICAL, ENVIRONMENTAL 
AND POLLUTION STUDY 


The three primary phenomena for detection 
and measurement of chemical species and their 
environment have been noted in Section 12.2: 
differential absorption, Raman scattering and 
fluorescence. Lidar techniques using these phe- 
nomena have been widely developed for studies 
ranging from short range monitoring (~10-100 m) 
across, e.g., chemical plants to measurements high 
into the mesosphere even above 100km altitude. 
Differential absorption lidars (DIAL) are available 
commercially and may be employed routinely for 
pollution monitoring across, e.g., industrial sites 
and cities. Many lidars have been incorporated in 
high flying, long-range aircraft and have provided 
vital information on atmospheric chemistry and 
dynamics, including analysis of polar ozone deple- 
tion. The basis of the techniques with a few recent 
examples are outlined. Detailed accounts may be 
found in, e.g., References [18,21,22,33,97]. A con- 
cise review of DIAL and Raman lidar for water 
vapour profile measurements has been given by 
Grant [145] with 133 references. 


12.6.1 Differential absorption lidar 
(DIAL) 


The simple principle of DIAL lidar is the com- 
parison of the backscattered signal (from aerosols 
and molecules) for two laser beams traversing the 
same path; one beam is tuned to an absorption 
feature of the chemical species under study while 
the other beam is well off absorption. For a pulse 
lidar differential analysis of the two signals S, 
and S, with highly developed retrieval algorithms 
[146] will provide a range resolved concentration 
density map of the absorbing chemical species. 
For some specialist applications, cw lasers oper- 
ating on fixed paths to topographic targets or 
retroflectors can provide a path-integrated mea- 
surement. This may be suitable for, e.g., 24-h sur- 
veillance across a chemical processing plant with 
the beams successively traversing various sensi- 
tive areas of processing reactors, storage tanks, 
etc. Such lidars can guard against any sudden cat- 
astrophic escape of materials and contribute also 
to the detection of low level leaks with improved 
operating efficiency. 
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A vast range of lasers have been used for DIAL 
equipments ranging in wavelength from near UV 
to mid-IR. Typically in the range 250-435 nm, spe- 
cies such as toluene, benzene, SO,, NO, and ozone 
can be detected with dye lasers or frequency dou- 
bled or tripled light from, e.g., a Ti sapphire laser. 
Water vapour and molecular oxygen have absorp- 
tion features with strong, well resolved lines in the 
near-IR in the region 720, 820 and 930 nm (water 
vapour) and 760 nm (oxygen). Dye lasers, tunable 
vibronic and solid-state lasers have been employed 
amongst others. The longer wavelengths 1-4.5 um 
are well suited to detection of, e.g., methane, eth- 
ane, hydrogen chloride, hydrogen sulphide, nitrous 
oxide and many hydrocarbons. Varying laser dyes 
with sum and frequency difference mixing in 
non-linear crystals provides tunable coverage of 
much of the range, together with tunable OPO 
lasers. In the range 9-12 um, covered by CO, laser 
wavelengths, there are many absorption features 
for hydrocarbons and various pollutant and toxic 
materials which can also be investigated by DIAL. 

A schematic of a DIAL lidar is shown in Figure 
12.27; clearly, a basic technical requirement is 
the accurate positioning of the laser emissions 
with respect to the selected molecular absorption 
line (typically within 0.001cm"), with the laser 
line width narrower than the absorption feature. 
Separation of the scattered radiation at the two 
wavelengths (if transmitted simultaneously from 


two laser sources) may be achieved with narrow- 
band filters. Otherwise, switching between the two 
“on” and “off” lines must be accomplished swiftly 
and precisely with typical pairs of pulses at > 10 Hz 
rate. Attainable accuracy for concentration mea- 
surement depends on the parameters of the specific 
application, e.g., spatial and temporal resolution 
and the material under study. Typically detection 
limits below 10 ppb can be achieved with an accu- 
racy of better than a few ppb. Measurements in 
the atmosphere for meteorological investigations 
have been widely developed. With molecular oxy- 
gen as the absorber and operating in the wings of 
the absorption lines, surface pressure and pressure 
profile can be measured with an accuracy of order 
0.2%. Similarly, the temperature profile may be 
measured with an accuracy of typically better than 
1 K through the tropopause using highly excited 
vibration lines of molecular oxygen with appropri- 
ate choice of emitted laser frequencies. Many inter- 
comparisons and sensitivity analyses of different 
equipments have been made (see, e.g., References 
(147-149]). 

For the shorter wavelengths, techniques of direct 
detection are employed with narrow-band opti- 
cal filters for isolation of the required wavelengths 
and reduction of background. At longer wave- 
lengths, heterodyne techniques have been inves- 
tigated in order to improve SNR with inherently 
noisy detectors. The speckle-induced fluctuations 
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Figure 12.27 Schematic of a DIAL lidar showing the laser's transmitter and receiver. 
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Figure 12.28 Comparison of the results for two ozone DIAL lidars spaced close together. The vertical 
arrows show the measurement day during the trial’s period [148]. 


in such coherent systems require averaging, but use- 
ful improvements in range performance with CO, 
DIAL systems have been demonstrated (see, e.g., 
References [150-152]). Speckle correlation and aver- 
aging with two or more wavelengths for improved 
accuracy in DIAL have also been examined [153]. 

DIAL systems have now been used for many 
years in both fixed and mobile ground stations. 
An interesting comparison of results for two 
high-performance ozone DIAL lidars is shown in 
Figure 12.28 (from Reference [148]). A number of 
extensive campaigns have been conducted across 
European cities (e.g., Athens in 1994, Grenoble 
in 1999) bringing together a range of lidars and 
monitoring equipment for study of ozone smog 
and pollution episodes. Several large permanent 
ground stations (e.g., at Eureka in the Canadian 
High Arctic [35], at Alomar in northern Norway 
[112] and at OHP in southern France [154]) have 
been established for longer term monitoring of the 
atmosphere, often combining different lidar facili- 
ties with DIAL, Rayleigh, Mie and Raman lidar 
detection channels. 

Measurements from aircraft have been par- 
ticularly notable, particularly in the combination 
of aerosol and DIAL lidars. In the winter of 1999- 
2000, for example, the NASA Langley airborne 
equipment measured ozone at DIAL wavelengths 
301.6 and 310.9nm and aerosols at 1064, 622 and 
3llnm, giving vertical profiles from 12 to 28km 


across the wintertime Arctic vortex. Further valu- 
able information on the interaction of PSCs and 
their implications for chemical depletion of O; was 
obtained [22,33,155]. 


12.6.2 Raman lidar 


Raman scattering provides an effective means for 
chemical species and aerosol measurement with 
lidar systems operating at shorter wavelengths to 
give the strongest scattering. Frequency doubled 
or tripled Nd-YAG lasers have been used; with an 
XeF examiner laser operating at 315nm, Raman 
returns from O, (371.5nm), N, (382.5nm) and 
water vapour (402.8nm) are created. Systems typi- 
cally operate at several hundred Hz with 50-100 mJ 
per pulse, with collecting telescopes up to 1m 
diameter and range resolution 20-100 m. The role 
of water vapour in atmospheric models for numer- 
ical weather prediction (NWP), and as the lead- 
ing greenhouse gas, is particularly important and 
Raman lidar measurements offer valuable infor- 
mation (see, e.g., Reference [156]); a measurement 
example is shown in Figure 12.29 [157]. Calibration 
techniques for Raman lidar water vapour mea- 
surements have been widely investigated (e.g., 
Reference [158]) using, for example, comparison 
with radiosondes providing local humidity data 
(see, e.g., Reference [159]) and pressure and tem- 
perature data at cloud base with expected 100% 
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Figure 12.29 Example of Raman lidar measurement of water vapour in the atmosphere [157]. 


humidity (see, e.g., Reference [160]). Cloud liquid 
water content has also been an important study 
[161] for Raman lidar; liquid water has a broadened 
spectrum ~7 nm wide shifted about 5 nm from the 
narrow vapour return. Interference filters ~4nm 
wide readily separate the two signals. 

Another important application of Raman lidar 
has been in measuring aerosol extinction [162,163]. 
By measuring the Raman N, signal at the shifted 
wavelength and the backscatter signal (molecular 
and aerosol) at the laser wavelength, the height- 
resolved aerosol extinction o,,, and the extinction 
to backscatter ratio o,,,/B(a),.. can be determined 
(see, e.g., Reference [164)]). 


12.6.3 Fluorescence lidar 


Fluorescence lidar can be applied in several areas: 
for detection and identification of organic and bio- 
logical materials on surfaces (by land or water), 
for detection of fluorescent dye particles (FDP) 
released as tracers in the atmosphere and tempera- 
ture and concentration measurement of atoms at 
very high altitudes. 

For organic and biological materials, two diag- 
nostics may be employed. First, the fluorescent 
wavelength edge above which (as the laser source is 


tuned to longer wavelengths) the material no longer 
exhibits fluorescence. Second, there is the charac- 
teristic (usually quite broad band) spectrum of the 
fluorescence itself. The technique has been applied 
to pollution studies of, for example, oil spillages 
(accidental or deliberate) at sea with the possibility 
of large area, rapid surveillance by aircraft. Similarly 
the technique may be applied to biological studies 
of water—certain algae and simple life forms have 
characteristic fluorescence spectra. Early detection 
and identification can be achieved and monitoring 
of development as, for example, in the phenom- 
enon of algae blooms. The use of FDP tracers in the 
atmosphere has been described by Uthe [165] (see 
also Reference [166]). Organic resin particles of ~2 
pm diameter containing a fluorescent orange dye 
have been used. In one example, a parcel released at 
2.3km altitude was tracked (by airborne lidar) for 
over 8h and 200km passage [165]. 

The first lidar measurement of sodium high in 
the mesosphere was made as long ago as 1969. This 
region is in fact difficult to study by other tech- 
niques, being inaccessible to balloons, aircraft and 
most satellites. It has many interesting features 
containing the coldest part of the atmosphere—the 
mesopause, metal layers formed by meteor ablation 
and polar mesospheric clouds (PMCs). Knowledge 
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Figure 12.30 Example of fluorescence lidar measurements of temperature and Na density high in the 


atmosphere [170]. 


of the temperature, density and their variability (as 
a function of season, latitude and solar cycle, etc.) 
is important for many geophysical phenomena. 
Several fluorescence lidar have been built variously 
operating at resonance wavelengths for Na, Ca, K 
and Fe. Typically, a range of dye lasers pumped by 
Nd-YAG have been used with pulse energies in the 
range 5-100 at typically ~20 Hz. All solid-state 
generation of the Na D, resonance line at 589 nm 
has also been achieved by sum-frequency mixing 
Nd-YAG 1064 and 1319 nm radiation with lithium 
niobate or triborate non-linear crystals. 

For relative concentration measurements, direct 
detection may be employed with narrow band fil- 
ters (typically ~1 nm) to reduce background light. 
It is also possible to simultaneously measure den- 
sity and temperature and a number of techniques 
have been developed [167-169]. For sodium, two or 
more laser frequencies are transmitted in turn, one 
exactly at the peak of the Na D, line and the other 
shifted 800 MHz by an acousto-optic modulator. 
The relative scattering at the two wavelengths gives 
a measurement of the temperature. An example 
of temperature and Na density profiles in the alti- 
tude range 80-105km is shown in Figure 12.30 
[170]. For iron, the technique relies on the tem- 
perature dependence of the relative scattering for 
two closely spaced Fe atomic transitions at 372 and 
374 nm. These two lines originate on the lowest 


atomic ground state and an upper sub-ground 
state about 416 cm" higher; their population and 
hence the ratio of scattered light is determined by 
Maxwell-Boltzmann statistics in thermal equi- 
librium. Generation of the required radiation 
has been achieved with independent, frequency- 
doubled and injection-seeded pulsed alexandrite 
lasers. Several equipments, ground and airborne, 
have now been built for mesospheric studies with 
routine measurements over extended periods, 
and more concentrated trials for expected meteor 
showers. Figure 12.31 shows a record of such an 
event [171]. 


12.7 LIDAR AND MILITARY 
APPLICATIONS 


Many think that in some way they 
(the Martians) are able to gener- 
ate an intense heat in a chamber 
of practically absolute non-con- 
ductivity. This intense heat they 
project in a parallel beam against 
any object they choose by means 
of a polished parabolic mirror of 
unknown composition... 


H G Wells: The War of 
the Worlds (1898) 
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Figure 12.31 Record of a meteor ablation trail made with a Fe resonance lidar [171]. 


12.7.1 Introduction 


In the popular mind, lasers probably represent the 
ultimate science fiction offensive weapon. After 
much media hype through the late 1980s, fueled by 
the United States’ Strategic Defence Initiative (pop- 
ularly known as “Star Wars”), the injection of very 
large funding and remarkable computer graphics, 
it is true, for example, that a leading US defence 
company claimed in 1996 to be able to build an 
airborne laser cannon that would destroy enemy 
missiles at ranges of several 100km. This perhaps 
neatly concludes the requirement, allegedly laid 
down in the 1930s by the British War Office to early 
proponents of radio waves, that their “death ray” 
should be capable of killing sheep at a few miles. 

Just as such early studies led to radar and its 
enormously important impact on the Second 
World War, the remote sensing capabilities of lasers 
are now becoming increasingly important. In fact, 
it may be said that later sensing is one of the few 
truly new (non-nuclear) military techniques intro- 
duced since the Second World War. Even modern 
thermal imaging and its immense capabilities had 
its genesis in the basic work and crude systems 
introduced in the late 1930s (see, e.g., R-V. Jones, 
Most Secret War, 1978). 

In reviewing laser radar applied to military 
tasks, one can distinguish three successive phases. 
The first phase consists of relatively simple, direct- 
detection methods for very successful range find- 
ing, target marking and laser beacons employing 
near-IR ruby and Nd-YAG lasers. The non-eye-safe 
character of these laser systems gives problems of 


troop training in realistic conditions, and this has 
led in a second phase to the introduction of lon- 
ger wavelength lasers such as CO, and Ho-YLF for 
basically similar tasks. In these phases, the laser 
is not much more than a bright light source. The 
third phase, which employs both advanced direct- 
detection and coherent techniques and exploits the 
full laser capabilities for wind measurement, target 
identification, chemical sensing, imaging, etc., is 
potentially of considerable military significance in 
the 21st century. Before outlining this work, how- 
ever, it is worth reviewing the strengths and weak- 
nesses of lidar in comparison with other military 
sensing technologies. 

In many ways, lidar is very complementary to 
thermal imaging. In particular, CO, lasers at 10 pm 
wavelengths lie in the middle of the 8-13 ym band of 
longer wavelength thermal imagers, and thus their 
atmospheric penetration and range performance is 
likely to be very similar. On the other hand, passive 
thermal imagers, with rapidly scanned multi-ele- 
ment detectors, provide excellent surveillance over 
quite large fields of view. In contrast, lidar, with 
few or single-element detectors and the very pre- 
cise pointing capabilities of a diffraction limited 
laser beam, seems better suited to detailed inter- 
rogation and investigation of small regions iden- 
tified as “interesting” by other means. Compared 
with longer wavelength centimetric radar, lidar 
has considerably poorer transmission, at least in 
the lower atmosphere, and thus most battlefield- 
type tasks have been confined to tactical ranges of 
a few tens of kilometres at most. This, of course, 
no longer holds true high in the atmosphere, and it 
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is also worth appreciating the limited penetration 
of millimetric radar through, for example, heavy 
rain (due to large scattering from millimetric- 
size rain drops). Covertness is another important 
aspect; it is quite difficult for a potential target to 
detect that it is being illuminated with a compara- 
tively low power, often short-pulse, laser beam, and 
much attention has been devoted to this. However, 
radar, unless used with great discretion, has been 
described as a beacon signalling to its adversary 
“here I am—come and hit me”. Finally, it is worth 
pointing to a few unique military capabilities of 
lidar; these include, for example, wind sensing, 
detection of very small “targets” such as wires and 
cables and remote chemical detection. 


12.7.2 Range finding, target marking, 
velocimetry, etc. 


Simple rangefinders based on short-pulse, flash- 
lamp pumped, solid-state lasers (ruby, Nd-YAG) 
were amongst the first developments of military 


(a) 


hardware. Using the elementary timing principle 
(Section 12.2.1, Figure 12.2c), very compact, robust 
and efficient (often battery-operated) systems were 
rapidly instituted at the infantry rifleman level, 
for tank rangefinders and also for aircraft opera- 
tion. In a clear atmosphere, reliable operation over 
several kilometres range is easily obtained with a 
few metres accuracy corresponding to ~10ns pulse 
length. In fog and smoke, longer-wavelength lasers 
generally have superior performance with the 
added advantage of eye safety. Over the years, very 
extensive studies have been carried out on relative 
effectiveness under varied conditions of humidity, 
fog, smoke, camouflage, target characteristics, etc., 
and laser ranging systems of many different levels 
of sophistication have been developed. For exam- 
ple, scattering from a smoke-screen laid down in 
front of a target is likely to give a strong signal sim- 
ilar to that from the obscured target itself (suppos- 
ing that the laser beam can adequately penetrate 
the smoke). Logic that selects this later pulse sig- 
nal from the target rather than the smoke signal is 
required (Figure 12.32a). 
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Figure 12.32 (a) Schematic of a laser rangefinder with signals from different ranges. Last-pulse logic 
is required to select the correct return from the target. (b) Schematic of guided missile homing on 
light scattered from the target marked by an incident laser beam. 
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In the related technique of target marking, the 
selected target is illuminated with a laser. Scattered 
radiation may then be picked up by a lidar receiver 
housed in a guided bomb or missile and provides 
information that directs in onto the target (Figure 
12.32b). These methods showed their effectiveness 
in the 1970s in the Vietnam War; many unsuc- 
cessful attempts had been made to destroy an 
important bridge by conventional bombing which 
was eventually knocked out first time by a laser- 
guided device. More recently, the sophisticated 
development of such laser-guided missiles (LGMs) 
first impacted on the popular consciousness with 
extensive newsreel coverage of missiles apparently 
penetrating ventilation shafts, bunkers, etc., dur- 
ing the 1991 Gulf War. 

For rangefinding, the use of a short pulse is but 
one extreme of coding the transmitted radiation. 
Many other forms of code may be used to acquire 
range or equivalent time-delay information, just 
as in longer-wavelength radar. These include 
amplitude modulation (AM) with, for example, 
a pseudorandom on-off code; auto-correlation 


of the strength of the return signal with the AM 
code itself gives it a peak response at a time inter- 
val t; corresponding to the range to the target and 
back. Alternatively, frequency modulation (FM) 
of the transmitted beam may be employed and 
has the potential advantage of providing Doppler 
velocimetry information about the target, as illus- 
trated in Figure 12.33a with a linear chirp pulse 
of duration At, and frequency modulation f.. The 
return signals of a stationary target and moving 
target (with negative Doppler shift f,) are shown. 
Signal-processing techniques of pulse compres- 
sion ensure that the range resolution in this case 
is not determined by pulse length At, but rather 
by the rate of change of frequency (the chirp rate). 
Some compromise of range and Doppler fre- 
quency resolution is required but, typically, 10m 
range resolution and 1m s" velocity resolution 
has been demonstrated in reasonably compact 
coherent systems at ranges up to 10km. The sign 
of the Doppler shift may be established by use of 
the successive up and down chirps, as illustrated 
in Figure 12.33a. 
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Figure 12.33 Coding of laser beams for simultaneous range/Doppler measurement. (a) Up/down lin- 
ear chirp pulses of duration At,, separation t, and frequency modulation f.. (b) Frequency modulation 
of a continuous wave beam (FMCW). For a distant stationary target, the range is proportional to the 
frequency difference f, of the outgoing beam and return signal as shown. 
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Frequency modulation of a continuous wave 
beam (FMCW) may also be used, as illustrated 
in Figure 12.33b. With the delayed return signal 
beating against the outgoing waveform as shown, 
simple spectral analysis gives a frequency shift 
proportional to the range to the distant (station- 
ary) target. For a moving target, this splits into two 
frequencies separated by +f,. In practice, careful 
consideration of the various parameters indicated 
in Figure 12.33b of FM, code length and the tar- 
get properties of range, velocity, etc., are required, 
including the problem of range-Doppler ambiguity. 
Nevertheless, several systems have been developed 
demonstrating high performance for simultaneous 
measurement of range and target velocity. 


12.7.3 Micro-doppler and vibrometry 


The bulk motion of a distant object may provide 
useful military information, as is demonstrated 
in many combined rangefinders and Doppler 
velocimeters. In addition, the information due to 
very small movements and vibrations within an 
object may potentially be of value in both civil and 
military fields. For a simple surface vibrating sinu- 
soidally with amplitude a and frequency f, the dis- 
placement a (sin 2zf,t) gives a peak speed of +2nf, a. 
From Equation 12.5, the peak micro-Doppler shift 
rises to +4nf,a/A; with typical values of a=20pm, 
f,=200Hz, the peak speed is +5.03 x 10?m s7? 
and for A=10.6 pm the micro-Doppler shift is 
only +9.51kHz. Such a shift is readily detected 
(to provide the amplitude) but, more importantly, 
techniques of frequency demodulation may be 
employed to give the underlying vibrational fre- 
quency f, The thrust of much recent work has been, 
on the one hand, to develop the lidar systems for 
such measurement (coping with objects which may 
also be in bulk motion) but, on the other hand, to 
establish how characteristic and reproducible the 
vibrational frequencies are from particular targets 
and whether they can provide a key to successful 
identification. It is well known, for example, that 
their precise blade rotational frequencies are spe- 
cific to particular types of helicopter and provide 
a means of identification. In practice, the problem 
is extremely complex; within a vehicle, for exam- 
ple, the driving frequencies originating at the pri- 
mary power sources of engines, electrical motors, 
etc., are coupled through all manner of nonlinear 
mechanisms to the outside panels and frames, etc., 


from which the laser radiation is scattered. These 
outside surfaces are likely to have their own reso- 
nant frequencies and are further influenced for a 
ground vehicle by the terrain—gravel, mud, tar- 
mac, etc., over which it is proceeding. Nevertheless, 
some success is being achieved and has been dis- 
cussed in open literature. At the very least, the state 
of readiness of distant vehicles and, for example, 
the difference between live, active tanks and dead, 
knocked-out tanks or decoys is detectable. As a 
further example, it should be possible to match the 
radar techniques of aircraft identification (from 
signals reflected from turbine blades within the 
jet engine itself) with lidar methods taking signals 
reflected from the outer airframe skin. 


12.7.4 Target imaging 


The military lidar capabilities discussed to date of 
range finding, velocimetry and vibrometry may be 
considered as useful sensing discriminants in the 
broader task of providing a target image. These and 
various other discriminants available by electro- 
optic laser techniques are indicated in Table 12.12. 

The point is that, while one is accustomed to 
interpreting visual grey-scale or color images, any 
discriminant that varies across a scene may be 
used to build an “image” of that scene. This may 
itself provide valuable information about an object 
(the “target”) and a means of picking it out against 
the background. 

Of the nine discriminants listed, the first five 
all relate to various aspects of the target region 
reflectivity and the signal intensity as potentially 
measurable in either a direct detection or coher- 
ent lidar. Under laser illumination, the scattering 
characteristics of different surfaces—vegetation, 
metal, paint, concrete, soil, grass, camouflage, 
etc.—vary quite widely under changing conditions 
of illumination (angle of incidence, polarisation, 
wavelength, etc.), as indicated in Figure 12.34. 
The term “glint” refers to the very strong, almost 
retro-reflection, signal that may occur on succes- 
sive reflections from smooth surfaces (e.g., around 
a window frame or wheel arch) and is very char- 
acteristic of the presence of a man-made object as 
opposed to natural vegetation and soil. This is an 
extreme example of the difference of signal fluctu- 
ations as a laser beam is moved over different sur- 
faces, e.g., concrete or brick compared with grass. 
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Table 12.12 Remote sensing discriminants for creating an image of the target region 


Scattering reflectivity/intensity from strength of signal 


Change of intensity at different wavelengths 


Change of intensity at different polarizations 


Occurrence of “glint” signals—high intensity retro-reflections 


Statistical character of signal fluctuations 
Range as set in different range intervals 
Doppler frequency shift—bulk velocity 


Micro-Doppler—frequency modulation (FM) vibrometry 


Signal amplitude modulation (AM) 


These and the other four discriminants listed 
in the table—range, bulk velocity, vibration and 
any amplitude modulation (indicative of moving 
machinery for example)—may be used to create 
and active “image” of the target region with suitable 
color coding, highlighting of significant regions, 
etc. The pixel size or spatial resolution in the image 
will be determined by the usual optical param- 
eters—the diffraction limits and atmospheric 
refractive turbulence—and at 10 pm should pro- 
vide a good match to co-located thermal imagers 
surveilling the passive scene. However, it is worth 
noting the prime military imperative of speed of 
measurement and the impact that has on optical 
scanning arrangements and the lidar dwell time on 
each pixel element. For passive imagers, whether 
thermal or visual, all pixel elements in the scene 
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are radiating/reflecting energy simultaneously and 
the angle of view and overall speed of measure- 
ment is ultimately determined by the number of 
individual elements (each viewing one pixel) that 
can be packed into the detector. With an active 
lidar system, maximum sensitivity is achieved by 
illuminating each pixel sequentially with the laser 
beam, making the measurement and moving on 
to the next pixel. Necessarily, this is likely to be a 
slower process and reinforces the utility of lidar 
for provision of unique information from detailed 
interrogation of small regions as opposed to large 
area surveillance. The alternative of floodlighting 
a region, across many pixels, with a high-power 
expanded laser may be feasible in some circum- 
stances but obviously mitigates against low-cost, 
low-power requirement and covertness. 
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Figure 12.34 Example of the active laser radar (10 um) and passive thermal imager (8-12 jm) con- 
trasts to be expected between various targets and standard background (supposed at 300 K) tem- 
perature emissivity 0.999 and reflectivity €=10-3). The large variation can be attributed to different 
angles of incidence, degree of moisture, etc. (From Steinvall, O. et al., SPIE 300, 104, 1981.) 
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Figure 12.35 Range and reflectance imagery with a 1.047 ym direct detection lidar at 2km, collected 
with the Hercules Corporation system. Much detail is lost in transcription from the original false color 
imagery. (Osche, G. R. and Young, D. S., Imaging laser-radar in the near and far infrared, Proc. IEEE @ 


1996 IEEE.) 


A number of lidar images are shown in Figure 
12.35 and illustrate the potential. As always, exten- 
sive deployment into the armed forces requires 
that such systems are made cheap, compact, reli- 
able and easy to operate and maintain. 


12.7.5 Wind measurements: 
Rocketry, landing aids, ballistic 
air drops 


As described in Section 12.4, rapid measurement 
of wind field with good spatial resolution at ranges 
between a few tens of metres to many kilometres is 
unique to laser radar. In consequence, it is not sur- 
prising that such capabilities have been extensively 
explored for military tasks. An immediately obvi- 
ous area is the impact of wind on accuracy of muni- 
tions. Rockets, in particular during the early stages 
of flight, are travelling comparatively slowly and 
may be quite strongly influenced by the local wind 
conditions. For example, measurement of flow and 
downwash under helicopters has been measured by 
lidar in order to improve delivery point accuracy 
of rockets fired from such platforms. Very large 
ground-based rocket systems are also influenced 


by the wind; some attempt to allow for this may be 
made by applying a correction in the fire control 
system. Conventionally, this may be a forecast wind 
or from a distant radiosonde measurement, possi- 
bly made several hours and tens of kilometres away. 
With the vagaries introduced by local terrain and 
natural wind fluctuations, the remaining uncertain- 
ties may contribute appreciably to the budget error 
for the weapon. For example, a compact coherent 
CO, lidar was developed in the late 1980s for the US 
Army Multiple Launch Rocket System (MLRS). The 
cw lidar would map the wind field at a distance of 
100m and it was said it could be produced, in vol- 
ume production, at a competitive cost. 

The potential of wind measurement as a landing 
and take-off aid has also been described. This could 
assist the landing of helicopters and fixed-wing 
aircraft on ships with monitoring of the complex 
wind flows around the landing deck. Such sys- 
tems could also provide real, full-scale measure- 
ments around such structures and, together with 
wind tunnel testing and CFD modelling, assist in 
improved ship and platform design. 

An airborne lidar system that profiles the wind 
field below the aircraft has recently been described. 
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Parachute delivery of cargo and conventional bomb 
drops without guidance control are obviously much 
affected by the prevailing winds. If these are known, 
the release point can be adjusted to compensate. 
However, such information may not be available or 
easily attainable in remote or hostile locations. A 2 
pum lidar system based on a flashlamp-pumped Cr, 
Tm:YAG laser was built by Coherent Technologies 
and flown from the USAF Wright Laboratory. The 
Q-switched laser gave 50 mJ pulses at 7 Hz repetition 
rate and the system was easily installed in a C1411 
aircraft. The scanner mirror mounted on a wedge 
produced a 20° x 28° (half angle) elliptically coni- 
cal scan pattern with minor axis along the aircraft’s 
longitudinal axis. Signal processing limited the first 
test in June 1995 to a range of 4.3km, equivalent 
to a vertical distance of 3.8km (15,000ft) with the 
scan angles employed. Excellent agreement in the 
comparison of wind profiles measured with the air- 
borne system, two ground-based lidars (one 21m 
and one CO, 10pm) and a radiosonde was found. 
Further studies assisted in a high-altitude bombing 
test above 30,000 ft (8 km) by B-52 aircraft. Using a 
composite wind profile, the bombers could correct 
for the wind and substantially improve their drop 
accuracy. 


12.7.6 Chemical detection 


The DIAL technique described in Section 12.6 sug- 
gests the possibility of detecting the poison gas 
and toxic nerve agent materials that might be used 
in chemical warfare (CW). The problem is that 
such materials may be chemically quite complex 
with rather broadband regions of absorption and 
transmission that makes positive discrimination 
and identification difficult. This contrasts with the 
detection of simple atmospheric constituents and 
pollutants with generally narrow absorption lines, 
as discussed in Section 12.6. In the latter case also, 
the lidar would usually be built with the aim of 
detecting certain well-defined chemical species; in 
practical operation, the equipment would be set to 
concentrate on very specific materials. In the mili- 
tary case, however, a range of toxic materials might 
be present and the lidar must be capable of seek- 
ing and detecting all of them without any prior 
indication of which is actually there. Nevertheless, 
the DIAL technique has been seriously examined 
and the US army at least has developed a CO,- 
based chemical detection system. With isotopic 


lasers, the overall tuning range may extend from 
9.1 to ~11.8p1m, which offers a reasonable band of 
frequencies for discrimination in the longer-wave- 
length region. Solid-state lasers based on non-lin- 
ear optical materials may also offer broad tuning 
ranges in wavelengths up to ~5 ym. 

Finally, it is worth noting that some attention 
has been given to the possible remote detection by 
lidar of materials used for biological warfare (BW). 
In this case, discrimination against similar materi- 
als (pollens, spores, etc.) occurring naturally in the 
atmosphere would seem very difficult. However, it 
has been suggested that an airborne lidar search- 
ing for anomalies and discontinuities in the atmo- 
spheric aerosol scattering (Section 12.4) might 
provide an indication of the possible release of such 
BW material at several tens of kilometres range. 


12.7.7 Terrain following and obstacle 
avoidance 


Aircraft flying at moderately low level are extremely 
vulnerable to fast, agile missiles fired from the 
ground. One solution is to fly at even lower level 
to provide an adversary with less warning of 
approach. Such “nap-of-the-Earth” operation 
increases the obvious dangers of failing to clear the 
ground or flying into natural or man-made obsta- 
cles. Laser radar with its capability for very rapid 
measurement, fine pointing and range resolution 
has the potential to provide a valuable navigation 
and obstacle warning aid, as illustrated in Figure 
12.36. Several lidar systems have been developed 
worldwide to explore this potential. 

Figure 12.37 shows a terrain map generated 
by a CO, lidar developed in France and flown on 
a Puma helicopter. In this case, the lidar was an 
FM/CW coherent system with the beam directed 
into a pair of rotating prisms that generated a scan- 
ning rosette pattern. This pattern was composed of 
4000 dots or pixels, each one corresponding to a 
pulse from the signal processing. The maximum 
scanning rate was ~2.5 Hz. An Anglo-French tech- 
nical demonstrator programme was developed 
with acronym CLARA (Compact Laser Airborne 
Radar). This advanced 10 pm pulse heterodyne sys- 
tem was trialed on both a fixed-wing Tornado air- 
craft anda Puma helicopter and followed the earlier 
LOCUS (Laser Obstacle and Cable Unmasking 
System) developed by two groups within the GEC 
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Figure 12.36 Potential obstacles in the path of a very low-flying aircraft that might be detected by a 


rapidly scanning laser radar. 
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Figure 12.37 A terrain map, 10 x 7km generated by a scanning laser radar (coherent FMCW CO.) 
flown on a helicopter. (From Stephan, B. and Metivier, P., Active Infrared Syst. Technol., SPIE-806, 


110-118, 1987.) 


Marconi Company and flown on A6-E, HS748 and 
Tornado aircraft. As indicated in Figure 12.38, 
the technology to detect power cables and display 
them to the aircrew was demonstrated. In addition 
to the two primary roles of obstacle warning and 
terrain following, CLARA was also designed for 
targeting and true air speed measurement. A pod- 
mounted configuration was adopted with complex 
optical scanner assembly to address several quite 
different modes of operation. Detection, classifica- 
tion and real-time display of obstacles, including 
various cables, masts, buildings and trees, must 
be achieved in daylight, at night and in adverse 
weather. In order to ensure that suitable warnings 


continue to be provided when the aircraft is turn- 
ing, a large field of regard, the size of which is gov- 
erned by the flight envelope of the host aircraft, 
must be adopted for the sensor. 


12.7.8 Star wars 


In the original Strategic Defence Initiative (SDI), 
it was envisaged that lasers would have a central 
role, both as weapons for destroying enemy mis- 
siles and as sensors of such missiles. An important 
aspect of such sensing is to distinguish between 
decoy missiles and those actually containing war- 
heads. A number of investigations in the United 
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Figure 12.38 Taken from a video record made with the LOCUS CO,,,,. heterodyne lidar, showing the 
lidar returns from cables. Much of the detail is lost in reproduction. (From Hogg, G. M. et al., Conf. 
Proc., 563, 20, 1995.) 
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Figure 12.39 Artist's conception of the Firepond wideband imaging laser radar at the Lincoln 
Laboratory. Laser beams are transmitted from a 1.23m diameter mirror Coude telescope with central 
abscuration of 0.21m. (Melngailis, |. et al., Laser component technology, Proc. IEEE @ 1996 IEEE.) 
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Figure 12.40 Schematic of the laser radar imaging during the SDI Firefly tests on missile detection in 
1990. (Melngailis, |. et al., Laser component technology, Proc. IEEE @ 1996 IEEE.) 


States have been reported in the open literature of 
the technical problems of laser pointing, tracking 
and examining high-flying rockets and space vehi- 
cles (see, e.g., Reference [172]). Figure 12.39 gives 
an artist’s conception of the large installation of 
the Firepond Laser Radar Research Facility of the 
Lincoln Laboratory where much of this work has 
been done. In this equipment, several laser radars, 
but principally a large CO,-pulsed coherent system, 
were used in a succession of long-range measure- 
ments. In the early 1990s, the first range-Doppler 
images of orbiting satellites were collected at ranges 
of 800-1000km and were later extended to 1500km. 
The argon laser indicated in Figure 12.39 provided 
the source for a visible light tracker of reflections 
from the satellites. Two initial SDI experiments were 
conducted later in 1990. These FIREFLY experi- 
ments were designed to investigate whether a laser 
radar could discriminate between a ballistic missile 
warhead and an inflatable decoy. In the tests shown 
schematically in Figure 12.40, a rocket was fired 
from the NASA Wallops Island Flight Facility sev- 
eral hundred kilometres to the south, rising to an 
altitude of 560km in a 750s flight. Coherent CO, 
laser radar imaging was successfully conducted over 
the central 50s of the flight at a maximum range of 
740km. The prime target was a conical inflatable 


balloon made of carbon cloth. After approximately 
360s from launch, this was ejected in a small canis- 
ter, was subsequently inflated and then set spinning 
in different manoeuvres to present a variety of target 
views to the imaging laser radar. 

A second series of tests with acronym FIREBIRD 
(Firepond Bus Imaging Radar Demonstration) had 
the more ambitious objectives of investigating 
laser radar discrimination and countermeasure 
techniques. A high-performance booster rocket 
was used to deploy a dozen targets for study by sev- 
eral ground and airborne sensors along the east- 
ern seaboard of the United States. These included 
passive IR sensors, UHF radars and optical sen- 
sors as well as the Firepond laser radars which also 
incorporated a photon-counting Nd-YAG laser 
radar. Generally excellent results were reported 
and completed the investigation of laser radar 
discrimination techniques. The advanced electro- 
optic techniques developed for this programme, 
including laser stabilization to the sub-Hertz level 
at high-power, precision coding of laser pulses, and 
sophisticated signal processing and analysis rep- 
resent a remarkable tour de force. Demonstration 
of such precision tracking and imaging at over 
1000km ranges constituted at least a significant 
milestone in the history of laser radar. 
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12.8 LIDAR AND GEOSCIENCE 


For the present purposes, lidar as applied to geosci- 
ence can be discussed in several broad areas: rela- 
tively short-range measurements to the sea surface 
and sea bed (few tens to hundreds of metres); sea, 
earth and ice surface from aircraft at up to ~15km 
and finally ranging at hundreds to thousands of 
kilometres to satellite (and lunar) retroreflectors 
for studies of crustal dynamics, etc. These areas are 
outlined in turn. 


12.8.1 Airborne mapping of terrain 
and ocean shelf 


Ranging to the Earth’s surface has been touched on 
in Section 12.7.7. Extensive studies with airborne 
systems have been conducted by several groups. 
Typical pulse energies of a few mJ with Nd-YAG 
lasers at 1.06 jm operating at up to a few hundred 
Hz have been employed. Figure 12.41 shows a block 
diagram of the lidar instrument developed at the 
Goddard Space Flight Centre [173] and mounted in 
a high-altitude aircraft; the position of the aircraft 
was measured to sub-metre accuracy by use of dif- 
ferential GPS receivers. Typical footprint size, from 
the beam divergence of 2.5 mrad, amounted to 
~25m at 10km altitude. Problems of pulse waveform 
spreading and target signatures for such altimeters 
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have been considered by several authors (see, e.g., 
References [174,175]). Reported investigations 
include study of crater form structures and observa- 
tions of the Mount St. Helena Volcano [176,177]. 

Laser depth sounding and the performance 
of a Swedish system has been well reviewed by 
Steinvall et al. [178]. Several such lidars have been 
built worldwide, including, for example, in the USA 
[179], Australia [180] and also Canada, Russia and 
China. In typical operation from a helicopter, two 
pulsed laser beams are emitted simultaneously and 
scanned over the surface. The longer wavelength 
(usually 1.06 um, Nd-YAG) is mostly reflected from 
the water surface whereas the shorter wavelength 
(green 0.53 um, doubled Nd-YAG) better penetrates 
the water and may provide a return from the bottom 
layer. Such lidar bathymetry provides a most prom- 
ising technique for high-density, rapid sounding of 
relatively shallow waters typically 1-50m deep. In 
addition, its use can be established for other sens- 
ing applications, as noted in Section 12.6.3—e.g., 
algae/chlorophyll monitoring and oil slick detection 
and classification. Operating from a helicopter also 
gives access to areas difficult for ships, e.g., around 
small islands, reefs and inlets. In addition, the rapid 
survey rate, typically 10-100 km2 h-! depending on 
shot density, is particularly valuable. 

In evaluation of performance, the prime ques- 
tion is the depth penetration, as largely determined 
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Figure 12.41 Block diagram of the airborne laser radar built at Goddard Space Flight Centre [173]. 
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Figure 12.42 Schematic of water and seabed scattered return signals: (a) haze or “sea smoke”, (b) 
surface return, (c) scattering layer, and (d) seabed return. 


by water turbidity [181]. The maximum depth pen- 
etration Dy,,x is given (see, e.g., Reference [178]) by 


Dara = InP */Prrin V2G (12.29) 


with P*=S. c/H’, where S is a system parameter 
(derived from laser power, receiver aperture and 
loss factors), ¢ is the reflectivity of the seabed and 
H the lidar platform altitude. P.,;,, is the minimum 
power for detection of the bottom of the sea (usu- 
ally limited in daytime by solar background and 
at night by detector/electronic noise). The system 
attenuation G is a complex function of intrinsic 
water parameters—beam attenuation, scattering 
coefficients, etc. With typical lidar parameters, val- 
ues of GDy4x=3-6 can be achieved. It should be 
noted that a large increase in system parameter S 
only gives a modest increase in depth performance 
Dyyax due to the exponential attenuation expressed 
in Equation 12.29. Steinvall et al. [178] gave the 
example of a factor 10 increase in S only adding to 
Dyax by ~4m for G=0.3 m™ [log 10/(2 x 0.3) =4]. 

Optical attenuation in sea water may be related 
to the Secchi disk depth D,, which is the maximum 
depth at which a submerged white disk can be 
observed. Steinvall et al. [178] showed that values 
of Ds~10m gave Dyax~20-25m, and D,~30m 
gave Dy,x~25-35m for a state-of-the-art lidar 
bathymeter. 

Other important factors limiting system perfor- 
mance include strong winds (affecting helicopter 
flight path) and associated sea state (white and bro- 
ken water reduces accuracy and beam penetration). 


Low-lying cloud and fog obviously inhibit perfor- 
mance and the thin haze (often quite dense and 
sometimes called “sea smoke”) above the water may 
provide a false return for the surface itself. Within 
the water, narrow scattering layers and other inho- 
mogeneities may also provide spurious returns. 

Finally, the nature of the sea bed, i-e., dense veg- 
etation, rocks and slopes, needs evaluation. Figure 
12.42 showsa schematic ofa lidar return, as affected 
by these phenomena and requiring postprocessing 
and evaluation for extraction of water depth to the 
sea bed. An important consideration is the nadir 
angle of the scan, which would ideally be fixed to 
give constant slant range and angle. However, a full 
conical scan at fixed nadir provides inefficient cov- 
erage with more and less dense regions. Various 
scan patterns have been developed; a modified 
semi-arc scan, part conical, ahead of the helicopter 
gives more uniform coverage. Typically 100-200m 
swatch width is used with angle from nadir ~20°. 
For complete evaluation of the sea environment, 
lidar bathymetry provides a complementary tech- 
nique to be combined with acoustic and direct 
mechanical methods of sampling. 


12.8.2 Satellite and lunar ranging 


The evolution, technology and utility of long- 
distance, high-precision (near millimetre) rang- 
ing to retro-reflectors mounted on satellites (and 
the moon) has recently been well reviewed by 
Wood and Appleby [182]. The present brief out- 
line draws heavily on their report. The concept of 
satellite laser ranging (SLR) to improve geodetic 
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information was first suggested by Plotkin [183]. It 
was considered that accuracies at the level of a few 
centimetres would be required to observe interest- 
ing geophysical processes of the Earth’s crustal 
dynamics, etc. The first laser radar observations in 
the mid-1980s demonstrated metre-level precision 
and established the potential. Over the succeeding 
years, rapid developments of short-pulse lasers, 
sensitive detectors, dedicated satellites and accu- 
rate computer-guided tracking now give precision 
approaching a few millimetres, with more than 30 
active SLR systems in a worldwide network. 

For sub-centimetre precision, very short pulses 
are required. These are now provided by mode 
locking systems applied to Nd-YAG lasers giving 
a train of very narrow (typically ~30 ps) pulses, 
all equally spaced in an envelope several hun- 
dred nanoseconds long. Frequency doubling is 
employed to take advantage of high quantum 
efficiency detectors at 0.53 ym in the green. The 
envelope is approximately Gaussian shaped; in 
practice, either the most energetic pulse alone may 
be transmitted or this together with all the follow- 
ing ones. In the latter case, range ambiguity (attri- 
bution of which signal return to which pulse) is not 
a problem since the pulses are separated spatially 
by 2-3 m, whereas the range accuracy is better than 
lcm. Highly developed techniques are required for 
the elements of an SLR system and include 


e Transmitter and receiver telescopes: they have 
been coaxial, separate co-mounted and, most 
recently, separate, individually mounted and 
controlled [184]. Very high precision, com- 
puter-directed acquisition and scanning to 
follow the satellite path are required. 

e Detectors: very precise timing of the incoming 
signal is required. Solid-state devices offer signif- 
icant improvements over the photo-multipliers 
used in earlier systems. Single photon detection 
with low-energy systems is increasingly favored. 

e Filters: very narrow, oven-controlled, spectral 
filters, typically 0.1-0.2 nm wide are employed 
to cut down sunlight for daytime operation. 

e Retroreflectors: due to satellite motion, the 
onboard corner cube retroreflectors require to 
be slightly “defocused”. In order to ensure that 
the footprint of the reflected beam will cover the 
emitter/receiver station, one of the three mutu- 
ally perpendicular surfaces is given a slight cur- 
vature. Ideally, only one retroreflector would be 


mounted on the satellite but this gives problems 
of visibility from different directions. The ulti- 
mate solution noted by Wood and Appleby [182] 
would be eight corner cubes joined at their apices 
to give reflections equivalent to a reflection from 
the centre of mass of the satellites. 

e Satellites: the six dedicated geodetic satellites are 
typically small (~1 m diameter) and spherical, 
and placed in near-circular orbits at altitudes 
between 800 and 19,000km. They are variously 
suited to different scientific tasks—low orbit: 
high-frequency gravity terms, ocean tides, etc.; 
higher orbit: crustal motion, Earth rotation, etc. 


Several studies of accuracy and range corrections 
have been given (see, e.g., References [175,185]). 
The handbook article of Wood and Appleby [182] 
provides over 120 references to research articles in 
satellite laser ranging. One remarkable example 
was for the NASA LRS system based at Arequipa 
in Peru. This station was subject to a steady move- 
ment of the underlying Nazca plate causing an 
ENE motion measured at about 10mm per year. 
After the earthquake of June 2001, analysis of the 
dramatic shift in the data confirmed that the sta- 
tion had moved ~0.5 m to the south west. 

As to lunar ranging, a number of small arrays 
of retroreflectors were placed on the moon in the 
1960s and early 1970s. Four are in regular use with 
ranging from two trekking stations—at Mcdonald, 
Texas and Grasse in France. With the R-* range 
dependence of signal, ranging to the Moon is vastly 
more challenging and large telescopes (0.75 and 
1.5m) and high-power pulses (~150 mJ) are required. 

Several scientific tasks of astrometry can be 
undertaken, including accurate determination 
of the Earth and Moon via the tides. The Earth- 
Moon distance is increasing by 3.8 cm per year and 
the Earth is slowing on its axis by an increase in 
the length of day of about 2 ms per century. 


12.9 LIDAR IN SPACE 


12.9.1 Introduction—Why lidar in 
space? 


During the past 30years, many passive imag- 
ers and sounders have been placed in orbit for 
study of the Earth’s surface and atmosphere with 
capabilities extending across the visible, IR and 
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Table 12.13 The four candidate lidar systems for space considered by ESA in 1989 


A simple backscatter lidar—for measurements of cloud-top height, cloud extent and optical 
properties, planetary boundary layer and tropopause height, aerosol distribution—with wide 


applications to meteorology and climatology 


A differential absorption lidar (DIAL), providing high-vertical-resolution measurements of humidity, 


temperature and pressure 


A wind-profiling lidar with the unique capability of improved weather forecasting and global dynamics 


A ranging and altimeter lidar for very accurate measurements of surface features, including ground, 


sea and ice cap height for solid-earth studies 


microwave spectral regions. Active sensing with 
radar systems has also been undertaken, with 
very high-performance scatterometer and imag- 
ing radars carried on operational satellites such as 
ERS-2. Compared with ground-based or even air- 
borne sensors of all types, the prime advantage of 
space lidar is the prospect of rapid global coverage, 
including data from otherwise totally inaccessible 
regions. Compared with other spaceborne sensors 
(both passive and active), lidar techniques provide 
certain unique measurement capabilities, such as 
cloud-top and boundary layer height and global 
wind fields. Other lidar capabilities (e.g., pressure, 
temperature and humidity and ranging/altimetry) 
have the potential to complement and extend exist- 
ing methods and often provide improved height 
and spatial resolution, particularly compared with 
passive techniques. 

Against this background, several national and 
international space agencies have initiated feasibil- 
ity studies for lidars in space, culminating in sev- 
eral reports, including LASA (Lidar Atmospheric 
Sounder and Altimeter), Earth Observing System 
Vol Ud, NASA 1987; LAWS (Laser Atmospheric 
Wind Sounder), Earth Observing System Vol Ilg, 
NASA 1987; BEST (Bilan Energetique du Systeme 
Tropical), CNES, France 1988; Laser Sounding 
from Space—Report of the ESA Technology 
Working Group on Space Laser Sounding and 
Ranging, ESA 1989. This latter report focused on 
four lidar systems considered good candidates for 
space deployment; an extract from its executive 
summary is shown in Table 12.13. During the past 
15years, several evaluations of the detailed tech- 
nology and logistics of space lidars for specific 
tasks have been funded by the space agencies. In 
addition, various of the critical subcomponents, 
such as lasers, optical arrangements, lag-angle 
correction, etc., have been studied and space- 
qualifiable prototypes constructed. Progress to 


actual space deployment, however, remains slow, 
with the exception of the NASA Lidar In-space 
Technology Experiment (LITE), which flew on 
the Space Shuttle Discovery in September 1994, 
and very recently the Geoscience Laser Altimetry 
System (GLAS) launched in 2003. 

This and other steps towards spaceborne lidars 
are discussed in the following sections. Critical 
design factors for a space lidar obviously include 
the demands on spacecraft accommodation for 
mass, size, power, heat dissipation, cooling, point- 
ing and vibrational stability, etc., all for operation 
in a realistic orbit. Performance factors include 
ultimate sensitivity limit to weak levels of scatter- 
ing (largely determined by laser power and tele- 
scope aperture) and the geometric terms of range 
resolution (influenced by laser-pulse length), beam 
footprint size (beam divergence and the need to 
keep light levels within eye safe limits—including 
observation by ground telescopes) and the separa- 
tion of successive footprints (laser-pulse repetition 
frequency, beam scanning and spacecraft velocity). 


12.9.2 Backscatter lidar and LITE 


The science objectives and technology require- 
ments of a spaceborne, direct-detection, backscat- 
ter lidar were discussed in the LASA report listed 
above and also in a 1990 ESA report of the ATLID 
(Atmospheric Lidar) Consultancy Group. The latter 
in particular reviewed the potential for operational 
meteorology and  environmental/climatological 
research. The atmospheric features that should be 
detectable from such an ATLID backscatter lidar 
are illustrated in Figure 12.43 drawn from the 
report. For weather forecasting (numerical weather 
prediction (NWP)), ATLID data should be of direct 
help with clouds, boundary layer height, air mass 
discontinuities and frontal/convective activity. For 
climate studies, the information on clouds and 
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Figure 12.43 Atmospheric features of clouds and aerosols detectable by a simple backscatter lidar 
at various latitudes. (From ATLID Consultancy Group, Backscatter Lidar, the Potential of a Space- 

Borne Lidar for Operational Meteorology, Climatology and Environmental Research, ESA Specialist 
Publication, SP-1121, Noordwijk, 1990.) 
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Figure 12.44 Functional diagram of the LITE system. Light at the three wavelengths was directed 
down to the Earth from the two-axis gimballed mirror. Scattered light was collected at the lightweight 
receiver telescope of 1m aperture and passed via dichroic beam splitters to the two photomultipliers 
(PMT) and avalanche photodiode (ADP). Some of the 0.532 1m light was passed to the microchannel 
plate (MCP) quadrant detector, which provided an error signal for precise alignment of the transmit- 
ter and receiver system. (Winker, D. M. et al., An overview of LITE: NASA’s lidar in space technology 
experiment, Proc. IEEE @ 1996 IEEE.) 
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Figure 12.45 The LITE return signal at 0.5321m after offset correction showing the multilayered cloud 
structure associated with a tropical storm system over West Africa. Note the aerosol signal (yellow 
and red) on the left-hand side, particularly at the boundary layer. Ground signals are also obtained 
beneath the high-level thin cloud (16km, at centre of picture), but not below the thicker cloud layers 
on the right hand side. (Winker, D. M. et al., An overview of LITE: NASA's lidar in space technology 


experiment, Proc. IEEE @ 1996 IEEE.) 


aerosols and their effect on the Earth’s radiation 
balance should be a valuable contribution. 

As planned (see, e.g., References [186,187]), 
the LITE provided a valuable demonstration of 
the enabling technologies for such operational 
systems. In the 12-day shuttle mission, the LITE 
instrument performance was excellent and a large 
volume of data was generated [72]. Figure 12.44 
shows a block diagram of the equipment, which 
was mounted on a Spacelab pallet. The flash-lamp- 
pumped Nd-YAG laser operated in doubled and 
tripled frequency mode to give simultaneous out- 
put pulses at 1.064, 0.532 and 0.355 1m. The pulse 
repetition rate was 10 s! and pulse width was 27 
ns. The beam footprint at the Earth’s surface varied 
between 290 m (0.355 1m) and 470 m (1.06 pm) and 
successive footprints were 740 m apart. The mass of 
the instrument was 990kg and the average power 
consumption in lasing operation was 3.1 kW. 

Two of the fascinating records from this instru- 
ment are shown in Figures 12.45 and 12.46. These 
pictures provide some indication of the quality, 
depth and volume of information that could be 
available from an operational system, and indeed 
the extent of the signal processing, evaluation 


and transmission systems that would need to be 
set up to use it effectively. For meteorological and 
weather forecasting purposes, such information 
must of course be made available in good time to 
the operational centres. 


12.9.3 Global wind field 
measurement by lidar 


One of the earliest feasibility studies of a satellite- 
borne lidar for global wind measurement was 
sponsored by the USAF Defence Meteorological 
Satellite Program. In the early 1980s, this was 
followed by a hardware definition study, con- 
ducted by Lockheed, of a system given the acro- 
nym WINDSAT (Windmeasuring Satellite); 
another title current at this time was WPLID for 
Wind Profiling Lidar (see, e.g., Reference [103]). 
These have been followed by the reports noted in 
Section 12.9.1, all of which considered large, multi- 
scanning (conical or several axes) lidars. More 
recently, the ESA Atmospheric Dynamics Mission: 
Reports for Mission Selection (1999) was presented 
to the European Earth Observation community at 
a meeting attended by over 300 participants. Of 
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Figure 12.46 The LITE system serendipitously passed directly over the eye of typhoon Melissa. 
Satellite photographs were obscured by thin cloud. Nevertheless, the lidar was able to penetrate this 
and make measurements within the eye-wall down to the surface. (From Barnes, J. E. and Hofmann, 


D. J., Geophys. Res. Lett., 24, 1923-1926, 1997.) 


the four candidate missions, ADM was selected as 
the second Core Mission to be implemented. The 
lidar studies indicate the degree of importance 
attached to global scale wind measurement deriv- 
ing from the unique capabilities ofa Doppler Wind 
Lidar (DWL) compared with other spaceborne 
methods. For example, low-level wind strength 
and direction may be derived from radar scatter- 
ing from the sea surface where the wave height is 
largely determined by the surface wind. However, 
such low-level winds are not of the highest value 


for meteorological purposes. In the mid-levels, 
the winds derived from cloud-top motion are not 
always completely representative of the air motion 
in that region, and may also have some uncer- 
tainty as to relevant altitude. For the ESA ADM, 
the observational requirements have been chosen 
as shown in Table 12.14 and led to selection of a 
relatively simple single-axis system for the first 
demonstrator mission. The most suitable lidar to 
realize these goals has been extensively discussed 
with three possible candidates: 


Table 12.14 Observational requirements of the ESA atmospheric dynamics mission 


Observational requirements 


PBL 
Vertical domain (km) 0-2 
Vertical resolution (km) 0.5 


Horizontal domain 

Number of profiles (h-') 

Profile separation (km) 

Horizontal integration length (km) 
Horizontal sub-sample length (km) 
Accuracy (HLOS component) (ms~") 1 
Zero-wind bias (ms~') 

Windspeed slope error (%) 

Data reliability (%) 

Data availability (h) 

Length of observation data set (yr) 


Troposphere 
2-16 
1.0 
Global 
>100 
>200 
50 
0.7-50 
2 
0.1 
0.5 
95 
3 
3 


Stratosphere 
16-20 
2.0 
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(a) Coherent heterodyne, with CO, gas lasers in 
the 10 um band (aerosol scattering). 

(b)Coherent heterodyne, with solid-state slab 
lasers in the 1-2 um band (aerosol scattering). 

(c) Incoherent direct detection, at ~0.3-0.5 um, 
and Fabry-Perot interferometer/edge technique 
analysis (aerosol and molecular scattering). 


For the coherent systems, the laser beam must be 
transmitted in a collimated, single-mode beam. 
From a 400-500km orbit, the beam size, or foot- 
print, at the Earth’s surface would be of order 10m, 
which rules out a non-eye-safe laser of wavelength 
less than ~1.5 um. For the incoherent systems at 
shorter wavelength, the beam may be expanded to 
produce an enlarged footprint with energy density 
within eye-safe limits. For any system, the crucial 
questions derive from the backscattering character- 
istics of the atmosphere that determine the primary 
Doppler signal, whether from aerosols or molecules. 

As discussed in Section 12.4, molecular scat- 
tering varies uniformly and predictably whereas 
aerosol scattering is highly variable with occa- 
sional large increases due to volcanic activity 
(a suggestion for augmenting the scattering lev- 
els with a strategically placed device is shown in 
Figure 12.47). For both domains, a large, powerful 
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lidar is required and may be quantified in terms of 
the necessary ED’, where E is the total laser pulse 
energy (Joules per individual measurement) and D 
is the collecting telescope diameter (metres). For 
successful measurement at lowest levels of atmo- 
spheric scattering (both molecular and aerosol), 
analysis shows that ED? requires to be greater than 
~100 Jm?. With D?~ 1, for the heterodyne systems 
the available energy is best utilized in large pulses, 
ideally > 10J, whereas for direct detection lidars 
the energy may be distributed over many small 
pulses (see Section 12.3.4). 

Extensive modelling of performance for space- 
borne wind lidars has been conducted. Modeled 
performance for different levels of aerosol 
scattering—median, quartiles, deciles—correspond- 
ing to the values shown in Figure 12.8 and for a 
coherent heterodyne lidar operating with the instru- 
mental parameters listed is shown in Figure 12.48. 

Most recently, for the ADM demonstrator, ESA 
has chosen a direct-detection interferometric lidar 
operating at 0.35 pm; the basic optical layout is 
shown in Figure 12.49. In a novel arrangement, 
the narrow band aerosol (Mie) scattering would be 
separately analysed by a Fizeau or possibly a two- 
beam Michelson interferometer; the broadband 
molecular (Rayleigh) scattering, which is relatively 


Figure 12.47 A suggestion for augmenting aerosol scattering (6 in the atmosphere to improve the 
performance of the spaceborne Doppler wind lidar (ALADIN). (Courtesy of Rodolphe Krawczyk, 1994.) 
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Parameters used for the performance assessment 


OrbAlt = 400° km Orbit altitude 
6 = 30° deg Nadir angle 
A= 9.1° um Laser wavelength 
E = 22.5’ joule Pulse energy 
ne5 Number of pulses accumulated over 25 km horizontally 
D=0.8°m Telescope diameter 
w = 0.35° MHz Return signal bandwidth (HW@ exp(-0.5)) 
nl = 0.12 Overall instrument efficiency (optics transmission, 
overall heterodyne efficiency) 
Bn = 40° MHz Processing bandwidth 
AH = I’ km Vertical resolution 
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Figure 12.48 Calculated performance of a space-borne 9.1 pm coherent detection lidar with param- 
eters shown for median and upper/lower quartiles and deciles of aerosol scattering. Note that the 
accuracy (ms~) curves start at the lower left-hand side and the reliability curves (0-100%) start at the 
top left-hand side for the different levels of scattering. 


stronger at these shorter wavelengths, would be 
analysed by a double-edge Fabry-Perot etalon. 
Suggested performance is shown in Figure 12.50 
with an accuracy better than 2 ms” up to 11km 
altitude. An artist’s impression of this equipment 
ina satellite flying in a sun-synchronous polar orbit 
at an altitude of ~400km is shown in Figure 12.51. 
Processing of the backscatter signals will provide 
about 3000 globally distributed wind profiles per 
day, above thick cloud or to the surface in clear air, 


at typically 200km separation along the satellite 
track. Target date for launch is 2008. 


12.9.4 Chemical, ranging and solid 
earth studies 


As noted in Table 12.13, DIAL lidar is an obvi- 
ous candidate for measurements into the atmo- 
sphere from space (see, e.g., Reference [188]). The 
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Figure 12.49 Optical arrangement for the ALADIN direct detection lidar for space-borne Doppler 
wind lidar. The aerosol (Mie) and molecular (Rayleigh) scattered signal are measured in the two sepa- 


rate interferometer systems as shown. 


problems, however, are formidable: two or more 
laser wavelengths for each chemical species under 
study (ozone, water vapour, greenhouse gases, etc.), 
compensations for the Doppler shifts in the beams 
due to the large spacecraft velocity (typically 15° off 
nadir = 0.3 cm” shift), with the beams transmit- 
ted simultaneously and co-axially to ensure return 
signals from the same elastic scatterers. Much 
study has been put into such active remote sensing 
from space and DIAL lidars will almost certainly 
be flown in the coming years. However, passive 
techniques over a wide range of wavelengths are 
providing much information. For example, in the 
NASA Earth Observing System (EOS), the Aura 
mission to study chemistry and dynamics of the 
troposphere and stratosphere, the instruments 
include sounders for measurement of thermal 
emission from the atmospheric limbs (microwave 


and high resolution) as well as nadir viewing imag- 
ing spectrographs for ozone and trace gases. 

Lidar for geoscience has been outlined in Section 
12.8. Early studies for space-borne systems were 
given by Cohen et al. [189] and Bufton [175]. Very 
recently, in February 2003, the Geoscience Laser 
Altimetry System (GLAS) on board the NASA Ice, 
Cloud and Land Elevation Satellite (ICE Sat) was 
activated. This mission is intended to accurately 
measure ice sheet mass balance, cloud and aerosol 
heights as well as land topography and vegetation 
characteristics. Initial test of the space-bound lidar 
altimeter system was validated with data from 
the global Laser Reference System (LRS) noted in 
Section 12.8. The lidar operates on Nd-YAG at 1.06 
um; it is reported that the signal returns were as 
expected and the transmitted laser beam is very 
close to the bore sight. 
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Figure 12.50 Calculated performance, by Monte Carlo simulation for the ALADIN space-borne 
Doppler wind lidar. For this calculation, the main parameters were: a 1.1m diameter telescope, over 
700 shots a total pulse energy per measurement of 120J at 0.35 j1m wavelength and a detector quan- 
tum efficiency of 80%. Performance may be readily scaled by the Cramer-Rao relationship. The shift 
at 16km is due to a change in the required vertical resolution from 1 to 2km at that altitude in the 


atmosphere [196]. 


Figure 12.51 Impression of the in-orbit configuration for aspace-borne Doppler wind lidar showing 
the telescope pointing down at, ~35° from nadir [196]. 


12.10 CONCLUSIONS 


From the preceding papers cited, it should be 
clear that the past 30 years have seen remarkable 
advances in laser radar technology, applications 
and understanding. Much fine equipment has been 
installed in different centres worldwide and valu- 
able, often unique, measurements not accessible 
by other techniques have been made. However, 
there have also been significant disappointments, 
particularly in the wider deployment of coherent 


lidar equipments and techniques. This final sec- 
tion attempts a brief analysis of the situation and 
to draw some guidelines for the future. 

Table 12.15 offers a crude summary comparison 
of the history and development of four remote-sens- 
ing technologies. It is striking that, for microwave 
radar, thermal imaging and sodar, the first deploy- 
ment (defined as use by non-specialist scientists and 
technicians) took place within ~15 years, followed 
by widespread deployment (e.g., equipments com- 
mercially available off-shelf or bespoke) less than 
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Table 12.15 Comparison of remote sensing technologies 


First 
deployment Widespread 
Technology First research (a) deployment (b) Comment 
Microwave radar ~1935 “killa = ~1940—military >1942—aided by Enormous variety of 
sheep at 2 “needs a Ph.D. compact cavity capabilities, 
miles” in every set” magnetrons application 

wavelengths, 
equipment, etc. 

Thermal imaging 1930-1970 ~1975 ~1985 Widespread 
applications in civil 
and military domains 

Active acoustic sensors 1960-1970 ~1975 ~1985 Slow, limited range, 

(SODAR) low resolution etc. 

BUT relatively cheap 
and simple 

Direct detection lidar: = 1965-1975 ~1975 ~1980 Military and civil 

(a) temporal 1970-1985 ~1985 - ranging, target 

(b) spectral marking, cloud base, 
etc. Chemical 
detection (DIAL); 
limited Doppler 
applications 

Coherent lidar 1970-1990 ~2000 - Enormous range of 


applications 
demonstrated. Why 
not taken up outside 
research? 


10 years later. Examination of a radar supplier’s cat- 
alogue with the dozens of different equipments, all 
available off-the-shelf, for many varied applications 
working over a large range of wavelengths provides 
a salutary example to the lidar scientist. Even closer 
is the astonishing growth in the optical technology 
and applications for the telecommunications indus- 
try. However, for laser radar, apart from relatively 
simple tasks of military and civil ranging, target 
marking and possibly DIAL, widespread deploy- 
ment has scarcely been achieved. The question is 
why should this be so? Are more advanced lidar 
systems perceived as too difficult, too expensive, 
unreliable, dangerous, too uncertain, etc.? 

As scientists, we tend to believe the adage ‘build 
a better mousetrap and the world will beat a path to 
your door’. In the hard, commercial world, this is 
not true—a new technology has to be significantly 
superior to displace an older established one and 
is unlikely to do so if it is even marginally more 


expensive. Other relevant truisms that bear heavily 
on the planning, specification and design of equip- 
ments include 

(1) Defined as first use by non-specialist scien- 
tists/technicians. (2) Defined as widely available 
off-shelf (or bespoke). 


The “best” is the enemy of the “good”. 

The devil is in the detail. 

Small is beautiful—but is it achievable? 
Multifunction usually means, literally, good for 
nothing. 


Ignoring one or more of these is usually a recipe for 
disaster—as is very evident in a review conducted by 
the author (unpublished and probably unpublish- 
able!) of nine “failed” projects. These were all large, 
expensive (equivalent of several million pound 
sterling or more) projects funded during the past 
25 years by civil and military agencies across Europe 
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and the United States; all were intended to lead 
into wider development and even (in some cases) 
medium-scale production. The technical perfor- 
mances of the prototypes in these projects ranged 
from poor to very good indeed. For those in the lat- 
ter category, lack of ongoing success was basically 
due in one case to failure to sufficiently interest the 
end user (partly due to the inadequate data/display 
facilities available at the time) and, for the other, an 
alternative microwave radar technology was pre- 
ferred. In at least two projects, the basic properties 
of the phenomenon under investigation were poorly 
researched and understood (scattering strength, 
spectral form, etc.) so the lidars, when built to the 
set specification, could not provide the required 
information. In two others, the performance speci- 
fication became overly ambitious (the “best”) and 
multifunction leading to excessive size, complex- 
ity and cost, and poor overall performance. Other 
projects were bedeviled by arguments between lidar 
scientists over the appropriate choice of laser, not 
helped by inflated and premature claims from laser 
theorists and manufacturers as to the achievable 
pulse energy, prf, reliability, etc. In one bad exam- 
ple, basic principles of coherent operation were not 
properly appreciated at a high technical level, lead- 
ing to poor decision making. From this brief history, 
afew more truisms may be drawn: 


e A project will fail with too much technology 
“push” and not enough end user “pull”. 

e Bad science and over-promoted technology will 
breed disaster—but good science and technol- 
ogy will not necessarily win. 

e Beware “technology jocks” bearing gifts. 

e The commendation “more reliable” must mean 
“an in-depth engineering study of failure modes 
has been conducted” or it is worse than useless. 


One may thus draw a few guidelines (see Section 
12.5 in the present volume for a much more com- 
prehensive analysis): the good system and project 
development arises from a total holistic approach 
incorporating: 


1. A good understanding of the measurement 
problem with a realistic specification for the 
lidar based on reliable, available component 
technology. 

2. Good science/technology in the lidar design 
including 


Good fundamental science of direct detec- 
tion/coherent lidar. 
Translation into simple, serviceable optical 
design. 
Proper study of engineering failure modes. 
Good signal processing. 
Good data analysis and display which 
provide 
- Swift timely analysis 
- Informative graphics 
- And truly give the end user what he 
needs and can use 
This, of course, raises various broader questions 
including, for example: 


e Where is the basic lidar training? Few uni- 
versities conduct research with coherent 
lidar, which seems a little puzzling, particu- 
larly with the potential for laboratory-based 
physics applications of coherent heterodyne 
techniques. 

e Is there an adequate, compact and referable 
body of knowledge? An enormous body of 
research publications and reviews exists, but a 
more pedagogic literature is perhaps lacking. 

e How to involve and convince end-users? The 
bulk of lidar applications research is still done 
by lidar scientists who initially built demon- 
strators but then developed an interest in, e.g., 
the applications to meteorology, atmospheric 
chemistry or wake vortices, etc., occasionally 
managing to interest colleagues more directly 
involved in these fields. 

e This final point emphasizes that, without an 
enthusiastic body of such end-users (profes- 
sional meteorologists, environmental scien- 
tists, system engineers, etc.), who can envisage 
worthwhile application of a lidar to “their” 
problem, appreciable funding is not going to 
be forthcoming from the research councils and 
agencies or from industry. 


Lest this be considered too downbeat, an assess- 
ment of lidar must emphasize again some of the 
remarkable and unique achievements of laser radar 
over the past 30years: contributions to flow mea- 
surement, wind fields, atmospheric chemistry, 
ozone depletion, pollution dispersal, avionics, geo- 
science, etc., even apart from more military-ori- 
ented applications. Looking to the future, progress 
is likely to be rooted in: 
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e Laser developments: particularly of high power 
and efficient, agile-frequency CO, lasers, 
precisely specified off-the-shelf solid-state 2 
and 1m lasers, and diode and fiber amplifier 
lasers at ~1.5 1m with increased power and 
mode purity. 

e Lidar construction: reduction in cost and 
complexity should flow from monolithic 
waveguide-type construction at 101m and, at 
shorter wavelengths, much use of fiber guiding, 
components and optical techniques borrowed 
from the communications industry. 

e Detectors: very fast, high quantum efficiency 
(>80%), low-noise array detectors are becom- 
ing available at visible and UV wavelengths. 
Extension to the near-IR would have great 
impact, as would uncooled heterodyne detec- 
tors at 10pm. 

e Data analysis and display: the rapid, user- 
friendly access to data coupled with graphic 
display of results now offered by modern com- 
puting facilities provides a great advance for 
lidar studies and should always be budgeted for 
in a system programme. 


Finally, it is worth noting the encouraging pros- 
pects for global wind field measurements in the 
Atmospheric Dynamics Mission—ESA’s technol- 
ogy demonstrator, due to be launched in 2008, and 
at the other extreme the potential development of 
small, low-cost LDVs for flow measurement ahead 
of, for example, wind-turbine generators. 
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13.1 INTRODUCTION 

Optical data storage has a long history dating back 
to the 1960s but, with the compact disk initially, it 
became relevant to the consumer and industry. The 
success of the early laser disk indicated the possi- 
bility of data storage based on optical phenomena 
and materials as an alternative to magnetic stor- 
age. Optical storage offers reliable and removable 
storage media with excellent robustness and archi- 
val lifetime and very low cost. Today, optical disk 
technology covers a wide variety of applications 
ranging from content distribution to professional 
storage applications. One of the major application 
areas for optical storage disks is the secondary 


storage and 


13.4 Novel approaches in optical storage 526 
13.4.1 Beyond the resolution limit 527 
13.4.2 Frequency-/time-domain 

optical storage 528 

13.5 Three-dimensional optical information 
storage 529 
13.5.1 Multilayer optical information 

storage 529 
13.5.2 Holographic data storage 532 
13.5.3 Page-oriented holographic 

memories 532 
13.5.4 Bit-oriented holographic storage 534 


13.6 Potential impact of novel technologies 537 
References 539 


storage of computer data in personal computers 
(PCs) and computer networks. An optical storage 
system is a particularly attractive component of 
the data storage network because it provides fast 
data access times and fair storage capacities while 
serving as a link between different multimedia and 
computerized systems. Perhaps, the most enabling 
feature of optical storage is the removability of 
the storage medium that allows transportation 
and exchange of the stored information between 
desktop and laptop computers, audio, video play- 
ers, and recorders. In contrast to the flying head 
of a hard disk drive, there are separations of a few 
millimeters between the recording surface and 
the optical “head,” although active servo systems 
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enable dynamic recording and readout from a 
rotating disk. Consequently, the medium can be 
removed and replaced with relatively loose toler- 
ances allowing an optical disk to be handled in dif- 
ferent drives. 

The compact disk (CD) digital audio has been 
launched into the era of digital entertainment by 
Sony and Philips in 1982. CD has enjoyed unprec- 
edented success and universal support among 
electronic companies and hardware manufactur- 
ers. From its origin as the music storage medium 
for entertainment, CD has grown to encompass 
computer applications (CD-ROM), imaging appli- 
cations (CD Photo), and video game applications 
(CD Video). The CD-ROM drives became standard 
in PCs and CD-ROMs enhanced the efficiency 
of distribution and use of software, games, and 
video. These read-only disks containing 680 MB 
of information can be mass replicated by injection 
molding in a few seconds, and they are virtually 
indestructible. 

Significant advances in the enabling technolo- 
gies constituting the compact disk make it pos- 
sible to increase capacity in digital versatile disk 
(DVD) format with the ability to store an entire 
movie in high-quality digital video on a single 
disk. With the DVD, optical disk storage opens 
a new chapter for video and multimedia applica- 
tions. Laser optics, thin film, and disk replication 
technologies have made considerable strides in 
the last decade. Digital coding and compression 
algorithms have become more sophisticated; inte- 
grated circuits and drive mechanisms have also 
advanced. Furthermore, the intensive research 
on new optical materials and recording phenom- 
ena has resulted in the development of new tech- 
nologies. The CD-Recordable (CD-R), a recordable 
write-once system, and the CD-Rewritable (CD- 
RW) drives and media have been introduced to the 
optical disk market. With its recordable formats, 
either for single recording or for expanded record- 
ing capabilities as in the case of CD-RW, optical 
storage became an alternative to the established 
hard disk and floppy disk systems, which domi- 
nated the data storage industry. 

Versatility of optical storage is an additional 
feature of foremost importance for its applicabil- 
ity. Optical recording allows read-only, write-once, 
and rewritable (erasable) data storage. Read-only 
optical disks or read-only memory (ROMs) are 
suitable for distribution of digital contents. In 


read-only technology, the information is written 
on a master disk that is then used for printing the 
embossed patterns onto a plastic substrate. The 
printing process allows for rapid, low-cost mass 
reproduction making optical disks the media of 
choice for distribution of digital data. The write- 
once read-many (WORM) technology allows one 
to store permanently a large amount of data on a 
thin disk medium, to remove it, and to have fast 
access to it in any compatible optical drive system. 
Information stored on rewritable optical disks can 
be erased and rewritten many times. Present opti- 
cal drives are designed to handle different media 
formats at the same time—the data stored on 
read-only, WORM, and rewritable media can be 
accessed all in one unit. 

Optical information storage is based on laser- 
material interaction for writing and reading. More 
general, data storage is the conversion of raw infor- 
mation into physical changes in an appropriate 
recording medium. Data recovery is the recogni- 
tion of the stored information from the storage- 
induced changes. The physical processes applied 
to different storage technologies differ widely and 
define their performance limits and applicabil- 
ity. In conventional technologies, optical storage 
relies on data recording on a rotating, disk-shaped 
medium while recovery is done by optical effects. 
Typically, a focused light beam is used to interact 
with the physical structure of the data stored on 
the disk. The interaction between the read beam 
and the medium can be due to a number of differ- 
ent effects but it must modify some detectable light 
property. The light reflected from the medium is 
modulated by the stored data and is then directed 
to a photodetector that converts the optical signal 
into electronic signals. All conventional optical 
disk systems are based on the reflection mode but 
often use different recording and readout mecha- 
nisms. There is a variety of optical effects and mate- 
rials that have been investigated and employed in 
various approaches to realize an optical memory. 
Table 13.1 summarizes different recording mecha- 
nisms and corresponding data structure for read- 
out in well-established optical storage concepts. 

Further advances in optical memories with 
high storage capacity up to terabyte require 
research on new photoactive materials exhibiting 
strong laser-induced changes of their optical prop- 
erties. Such materials constitute a special class of 
nonlinear optical materials with optical properties 
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Table 13.1 Optical information storage: Recording mechanism 
and corresponding data structure in different approaches 


Recording mechanism 
Print 

Phase change 
Magneto-optic 
Photochromic 
Photorefractive 


Data storage/readout method 


Surface grooves 

Reflectivity change 

Polarization rotation 

Complex refractive index change 
Refractive index grating 


depending on the incident light intensity or energy. 
This chapter deals not only with the fundamentals 
of optical storage but also with novel approaches to 
higher performance storage. 


13.2 OPTICAL DISK STORAGE 
TECHNOLOGIES 


Removability of optical storage media is an attrac- 
tive feature but this has made the standardizing 
efforts more complex compared with magnetic 
storage. Standardization in optical information 
storage was first accomplished with the appearance 
of CD creating the standards for CD Digital Audio 
format in 1982. Five years later, Sony and Philips 
announced the standard for CD storage of com- 
puter data—the CD-ROM. The establishment of 
worldwide standards has made it possible to incor- 
porate writing technologies into CD systems. The 
CD-R is functionally compatible with the standard 
CD-ROM, even though the reflectivity changes on 
a CD-R are induced by physical processes different 
from the interference effect at the printed grooves 
on a conventional CD. 

Following the CD-R, CD-RW, the rewritable 
compact disk, has been introduced based on phase- 
change technology. Here, the reflective layers have 
two states, which differ sufficiently in reflectivity 
to be read optically. Table 13.2 gives an overview 
of CD technologies starting with the CD audio 
that was modified to allow new applications and 


later advanced to exploit the new technologies with 
CD-R and CD-RW. 

In general, an optical disk storage system con- 
sists of a drive unit and a rotating disk medium 
containing the information. The optical compo- 
nents are integrated within a small optical head 
allowing both optical recording and readout to be 
performed with the head positioned relatively far 
away from the storage medium, unlike magnetic 
hard drive heads. This allows the medium to be 
removable and effectively eliminates head crashes, 
increasing reliability. On the other hand, an opti- 
cal head is heavier and leads to slower access times 
when compared to hard disk drives. The data 
recorded on an optical disk are organized in tracks 
that might be a single spiral or concentric rings 
spreading out along the radial direction. 

In CD technologies, optical disks are either pre- 
recorded or preformatted with continuous grooves 
or a discontinuous groove structure that is needed 
to position the optical head on the disk. A laser 
beam focused to a small spot is used for record- 
ing and readout. Information is written bit-wise by 
modulating the properties of the recording mate- 
rial under illumination. During retrieval, recorded 
bits are detected as changes in some optical prop- 
erty of the light reflected from the disk. These 
changes might affect the amplitude, phase, or 
polarization of light, and are sensed by a detector 
in the optical head. The rotation of the disk as well 
as positioning of the pickup system is provided by a 


Table 13.2 Evolution of CD-based technologies from ROM to recordable 


and rewritable formats 


Disk format 
CD digital audio 


CD extra, CD interactive CD-ROM, video CD 


CD-R 
CD-RW 


Recording technology 
Read-only 
Read-only 
Write-once 
Rewritable 
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drive motor. In addition, the optical head contains 
a servo system based on some sort of optoelectronic 
feedback that is necessary to control the position of 
the focused laser spot on a rotating disk. 

The optical storage systems introduced first 
worked as write-once, read-many systems. The 
information is recorded on a WORM disk by 
applying the laser beam to change the reflectivity. 
On the first ablation type WORM disks, the write 
laser beam burns a hole or pit into a thin absorb- 
ing layer, that is, deposited aluminum. In the early 
WORM systems, different lasers have been used 
for writing and reading. In a single laser system, 
the laser power is reduced for readout so that the 
recorded pits can be read by detecting the reflected 
energy. Following the ablation media, WORM 
disks with dye polymer media have been devel- 
oped. Here, the laser beam induces a reflectivity 
change of dye polymer coating without material 
ablation. Rewritable storage has been first achieved 
in magneto-optical (MO) systems. The written 
information could be removed during an erase 
pass and thereby prepared for a new write cycle. 
To realize the MO systems, it was necessary to 
introduce new coating media and adapted optical 
systems. Consequently, there was no compatibility 
between WORM and MO storage media. 


13.2.1 Compact disk technologies 


The basic concept of an optical disk involves a 
reflecting layer modulated by the presence of pits 
that then switch the reflected signal by destruc- 
tive interference. In a CD-ROM drive, the data are 
read out by projecting a focused laser spot on the 
rotating disk, then detecting the reflected light. 
The detected signal fluctuates with the presence 
or absence of pits along the track corresponding 
to the bit-wise stored information. Similarly, the 
recorded pits of CD-R and CD-RW are read out 
by detecting laser light reflected back from the 
disk. On the simple CD-ROM, the information is 
printed as quarter-wave surface steps leading to 
zero reflectivity by interference. Tracks consist of 
discontinuous grooves or pits that are separated 
by lands, whereby the pit length is determined 
by the bit content stored in it. The light reflected 
from the disk will, therefore, be modulated by the 
pits. Readout is done by destructive interference 
between the light reflected from the pit and the 
light reflected from the land. In this way, the data 


content represented by a pit sequence is translated 
into the reflectivity changes that occur when this 
sequence is exposed to the laser beam. 

On a CD-ROM, pits are preformatted into 
the optical substrate (typically polycarbon- 
ate) that is then coated with a high-reflective 
alloy (e.g., aluminum or silver). The substrate 
itself entails the format that defines the physi- 
cal structure of the data recorded on the disk. 
Present optical disk technologies use different 
recording mechanisms and are also based on 
different disk formats. In the simplest case such 
as CD or DVD-ROM, the disk comprises a static 
physical structure represented by pits. Other 
disk formats can be categorized in two types, 
one of which relies on continuous grooves, and 
the other on wobble marks. Different disk for- 
mats also require different fabrication processes 
for mass production. The well-known mass pro- 
duction technique originates from the CD-ROM 
technology and is based on disk replication by 
stamping out copies of a disk master. This origi- 
nal disk is written using a short-wavelength 
laser that creates the pattern in a photoreac- 
tive material. The typical mastering process is 
based on similar photolithographic techniques 
as applied in the semiconductor technology. 
Exposing the photoresist yields the pit struc- 
ture, which allows for forming the so-called 
stamper. The final stamper comprises a metallic 
layer (usually nickel) with the inverted data pat- 
tern structure, which is then used for mass disk 
replication. In the production of ROM disks, 
stamping is followed by only two additional 
steps—coating with a reflective layer followed 
by a protective layer. In contrast, recordable and 
rewritable disks are more complex in structure 
as an optically active layer is needed for data 
recording. Depending on the technology, dif- 
ferent materials and layer stack configurations 
have been developed to provide both the func- 
tionality and compatibility. 

On the write-once CD-R, the focused laser 
beam locally and permanently changes the com- 
plex refractive index of an organic dye polymer 
layer (Figure 13.1). Readout is based on signal 
enhancement or decrease by optical interference 
effects in the multilayer structure. The specifica- 
tion of the optical characteristics and thickness of 
the dye polymer layer allows the signal reflected 
from the reflective layer to be significantly 
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Figure 13.1 CD-R: during recording, the pit is created by heating the dye absorptive layer with a 


focused laser beam. 


decreased by the optical change induced in 
the polymer layer. The reflectivity difference 
between recorded 1’s and 0’s of CD-R and simple 
CD-ROM media is similar, thus ensuring media 
interchangeability. 

Rewritable optical storage is presently based 
on MO or phase-change (PC) media. The CD-RW 
drives and media introduced in 1997 by Philips 
represented a major breakthrough in optical disk 
storage. Using phase-change technology, the 
CD-RW systems allow disks to be written and 
rewritten many times over. CD-RW is, therefore, 
a medium of choice for both temporary and long- 
term data storage. 

The disk structure in the case of phase-change 
media is more complex than the simple three- 
layer structure of a CD-ROM. The CD-RW disk 
consists of a grooved polycarbonate substrate 
onto which a stack of thin layers is sputtered, fol- 
lowed by a protective lacquer (Figure 13.2). The 
phase-change or recording layer is sandwiched 
between two dielectric layers. These are typi- 
cally zinc selenide-silicon dioxide (ZnS-SiO,) 
layers that provide thermal tuning of the record- 
ing layer. The material for erasable phase-change 


recording is typically a Ge,Sb,Te,; alloy which is 
sputter deposited on a plastic substrate, with an 
undercoat and an overcoat of dielectric layers. 
Other phase-change materials such as PdTeO,, 
InSnSb, AgInSbTe, etc., are also in use for both 
write-once and rewritable media. In addition, the 
stack comprises an aluminum layer from which 
the laser beam is reflected. As the disk comes out 
of the sputtering machine, the recording layer is 
in an amorphous state. The disk is then put into 
an initializer, which heats up the phase-change 
layer to the point where it crystallizes. Prior to 
recording, the phase-change layer is polycrystal- 
line in its original state. 

The phase-change recording mechanism 
relies on the reflectivity difference between 
dark amorphous zones and bright crystalline 
zones. For writing, a focused laser beam selec- 
tively heats small areas of the phase-change 
material near or above its melting temperature, 
which is between 500°C and 700°C. Heating 
the liquid state is achieved in the area under 
the laser beam spot. If the material is then 
cooled sufficiently quickly, the random liquid 
state is “frozen-in” resulting in the amorphous 


Protective layer 


Figure 13.2 Structure of a CD-RW: The active phase-change layer is surrounded by two dielectric 
barriers and covered by the reflective layer and protective overcoat. 
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Figure 13.3 For writing, the phase-change layer is heated by a high-power laser beam to form an 
amorphous pit. The data are erased with a low-power laser beam by reverting the material to the 


crystalline phase. 


state. To erase the recorded data marks, the 
laser heats the phase-change layer to recrys- 
tallize the material. Hereby, the temperature 
of the phase-change layer is kept below the 
melting temperature but above the crystalliza- 
tion temperature for longer than the crystal- 
lization time so that the atoms revert back to 
an ordered, crystalline state (Figure 13.3). The 
mark will become crystalline once again if the 
laser beam strikes the amorphous mark for suf- 
ficient time. Subsequent rewriting is possible 
when the material reverts back to an ordered 
crystalline state. The amorphous and crystal- 
line states have different values of refractive 
index which tune the reflectivity of a multilayer 
stack. The recorded data can, therefore, be read 
out optically as reflectivity change between 
the low-reflective amorphous pits and high- 
reflective surrounding area. The phase-change 
process is reversible many thousands of times. 
Because the reflectivity difference of a CD-RW 
is much lower than the 70% of a CD-ROM, the 
sensitivity of CD drives has to be increased to 
allow readout of CD-RW media. In recent years, 
significant improvements have been achieved in 
both rewritable media and drive performance to 
increase the recording speed and cyclability. 


13.2.2 Digital versatile disk 


Stimulated by the success of the CD, significant 
developments in many related fields have been 
undertaken resulting in further improvements. The 
large potential of accumulated R&D in that area 
made it possible to introduce the standard for the 
advanced DVD format. The basic structure of DVD- 
ROM is similar to the conventional CD-ROM but 
many parameters have been refined or reinvented 
to increase the surface data density and thereby the 
storage capacity. 

The surface data density on an optical disk is 
inversely related to the spot size of the address- 
ing beam. By reducing the spot size of the focused 
laser beam, smaller pits can be resolved and the 
surface density increases. This can be achieved by 
shortening the wavelength of the laser light and/ 
or by increasing the numerical aperture (NA) of 
the optical system because the spot size is related 
to the wavelength divided by the NA. The specifi- 
cation employed to expand DVD’s storage capac- 
ity includes smaller pit dimensions, more closely 
spaced tracks and shorter wavelength lasers. Both 
the track pitch and the shortest pit length are 
nearly a half less than those of the CD (Figure 
13.4). The optical system has also been refined 
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Figure 13.4 Pit-land structure of the CD compared to the refined data structure of the DVD (Sony). 


with a higher NA lens, resulting in a more tightly 
focused laser beam. The simple single-layer DVD 
stores 4.7 GB per layer, which is seven times the 
storage capacity of the CD. 

Improvements in the overall storage capacity per 
disk have been achieved in different DVD format 
extensions that use two layers or both sides of the 
disk. However, due to the presence of the reflective 
layer, the number of layers per disk side is strongly 
limited to two layers; one of them is semireflective 
and the other is high reflective. The optical signal 
degradation when the reading laser beam passes 
through the first, semireflective layer as well as 
crosstalk effects between the layers require two dis- 
tinct wavelengths, that is, lasers, one for each layer. 
In general, dual-layer or double-sided configura- 
tions can provide moderate increases in storage 
capacity but at the same time they require more 
complex and more expensive systems. 

Dual-layer, single-sided DVD is developed as a 
two-layer structure with the total storage capacity 
of 8.5 GB. The first semireflective layer reflects 18%- 
30% of the laser light, which is enough for the read- 
out of the stored bits. With the transmitted light, the 
information from a highly reflective layer can also be 
read out. Gold was previously used as semireflective 
layer material, but new silicon layers as gold replace- 
ment provide significantly reduced production costs. 
Double-sided dual-layer DVD promises 17 GB but 
the electronics for reading and decoding the multi- 
layer DVDs becomes much more complicated. An 
overview of different multilayer DVD formats is 
given in Figure 13.5. 


A similar two-layer structure is used in Super 
Audio CD, the new generation of digital audio 
media realized as a refined version of the CD 
audio to provide really high-fidelity playback. This 
is done by adding one semireflective layer to the 
conventional CD structure that contains high- 
density information. A silicon-based high density 
layer is semireflective at 650 nm wavelength and 
has almost 100% transmittance at 780 nm, thereby 
allowing the standard CD laser light to be reflected 
from metallic reflective layer. With 650 nm wave- 
length, the pits smaller than those on a standard 
CD can be readout, leading to an enhanced storage 
capacity of up to 4.7 GB. 

In recent years, implementation of record- 
able/rewriting technology in the DVD family has 
been the most serious challenge. Although, first 
recordable (DVD-R) and rewritable (DVD-RAM) 
products are available now, it is still not clear 
which format will become the standard DVD 
recording technology. Second generation record- 
able DVD format boosts capacities from 2.6 GB 
per side to 4.7 GB. However, rewritable DVD for- 
mats remain far away from achieving standards 
of compatibility, which have been the crucial 
issue for a successful introduction of CD-R and 
CD-RW. Three different DVD rewritable formats 
are presently competing for acceptance in the 
market place. The leading candidates to become 
the standard DVD recording technology—Sony’s 
DVD+RW and Pioneer’s DVD-RW—are incom- 
patible so that the DVD family remains far away 
from CD harmony. 
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Figure 13.5 Different DVD configurations and corresponding storage capacities. 


13.2.3 Blu-ray disk 


The implementation of blue-violet diode lasers in 
DVD systems leads to a further increase of storage 
density. In February 2002, nine electronics compa- 
nies (Philips, Sony, Pioneer, and others) have estab- 
lished the basic specification for a forthcoming 
optical disk video recording format called “Blu-ray 
Disk.” Blue generation optical disks store six times 
more data than DVD. Introduced as a new video 
format, Blu-ray Disk enables 27 GB to be stored on 
a single disk. With its third generation, the opti- 
cal disk technology follows the development paths 
established in the evolutionary step from CD to 
DVD. The most important issue in the develop- 
ment of Blu-ray was the availability of short- 
wavelength semiconductor lasers. The significant 
wavelength reduction from 650 nm in DVD to 
405 nm in the new generation was enabled by the 
realization of gallium-nitride laser diodes emit- 
ting blue-violet light at the short-wavelength end 
of the visible spectrum. First introduced by Nichia 
Chemical Corporation, a small chemical company 
in Japan, violet-emitting GaN diodes have found 
wide applications in light-emitting diode (LED)- 
based systems. The invention of efficient blue LEDs 
was crucial in developing white light sources for 
illumination. The related research and develop- 
ment achievements have been recognized by 2014 
Nobel Prize in Physics. 


In addition to the wavelength reduction, the 
Blu-ray system comprises a high NA lens that 
focuses the blue-violet light of a Ga-N diode to an 
extremely small spot size. However, refinements 
of the pit-land structure only would not allow 
such high storage capacity. Sophisticated coding 
and error correction technology implemented in 
Blu-ray disk video recorders makes it possible to 
additionally enhance the storage capacity. With 
the blue generation, conventional optical storage 
is approaching physical limits beyond which the 
data-bearing pit-land structure may become too 
fine to still be detected. 


13.2.4 MO disks 


MO disks were the first rewritable storage media, 
available on the market since the beginning of the 
1990s. MO disks are mainly used in professional 
data processing but with lower costs per megabyte, 
new applications are being opened. For consumer- 
oriented applications, the minidisk, an MO disk in 
the 2.5in. format was introduced as mini portable 
audio medium. In the last years, the MO disk tech- 
nology has been advanced continuously further to 
allow improved storage capacity and read/write 
performance. 

Data storage in MO disks is based on opto- 
thermic magnetic effects. Information is stored 
as a magnetized state of the magneto-optic layer 
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and will be read out as polarization change of the 
laser beam using the Kerr effect. The MO layer 
that stored the information as corresponding 
magnetized state is protected from oxidation by 
two dielectric barrier layers (SiN) (Figure 13.6). 
Together with the reflective layer, the barriers 
ensure an optical signal enhancement. Presently, 
all commercially available MO disks are based 
on an amorphous terbium-iron-cobalt magnetic 
alloy. This material belongs to a class of materials 
known as the rare earth-transition metal alloys. 

For the recording of binary data, the MO layer 
is heated by the laser spot above the Curie tempera- 
ture where its magnetic orientation is dissipated. 
When this spot cools, the new magnetic orienta- 
tion is set by the magnetic head corresponding to 
the “0” and “1” of the digital signal (Figure 13.7). 
During readout, the polarization of the read beam 
is rotated, thereby detecting the recorded bits. 


The MO media rotate the polarization vector of 
the incident beam upon reflection. This is known as 
the polar MO Kerr effect. The sense of polarization 
rotation is dependent on the state of magnetization 
of the medium. Thus, when the magnetization is 
pointing up, for example, the polarization rotation 
is clockwise, whereas downmagnetized domains 
rotate the polarization counterclockwise. The polar 
Kerr effect provides the mechanism for readout 
in MO disk data storage. Typical materials used 
today impart about 0.5° polarization rotation to 
the linearly polarized incident light. But, given the 
extremely low levels of noise in these media, the 
small Kerr signal nonetheless provides a sufficient 
signal-to-noise ratio for reliable readout. The media 
of MO recording are amorphous. Lack of crystallin- 
ity in these media makes their reflectivity extremely 
uniform, thereby reducing the fluctuations of the 
read signal as well as level of noise in readout. 
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Figure 13.7 Left: Recording of an MO disk. The pit mark is formed by heating the MO layer with a 
focused laser light. Right: Readout of an MO disk. The data are read out by the polarization change of 


the read beam. 
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To rewrite an MO disk, all previously recorded 
bits must be erased before new data can be 
recorded. This requires either a recorder with two 
lasers (one to erase and one to record) or longer 
recording times in a single laser system because 
the laser must erase the data in the first rotation 
and then record the new data in the second one. 
Alternatively, one can start with an erased track, 
apply a reverse-magnetizing DC magnetic field to 
the region of the interest, and modulate the laser 
power to record the information along the track. 
This is known as the laser power modulation 
recording (or light intensity modulation—LIM) 
scheme. The laser power modulation does not 
allow direct overwrite of the preexisting data on 
the track unless a more complex media structure is 
employed. Exchange-coupled magnetic multilayer 
structure allows LIM direct overwrite in MO sys- 
tems. In such systems, the top and bottom layers 
are exchange coupled together so that switching 
one layer would make the other layer to switch, 
too. The top and bottom layers have perpendicular 
magnetic anisotropy and are separated by an inter- 
mediate layer, which is in-plane magnetized and 
aimed to ease the transition between the top and 
bottom layers. In an LIM direct overwrite scheme, 
the recorded domains collapse under a laser beam 
spot of moderate power. A high-power beam is used 
to create the domains through the whole thickness 
of the multilayer stack. The recorded domains are 
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read out by a low-power beam, which do not dis- 
turb them so that only a laser beam of moderate 
power can erase prerecorded marks. Erasure is 
very similar to writing, in that it uses the heat gen- 
erated from the laser beam and requires assistance 
from an externally applied magnetic field to decide 
the direction of magnetization after cooling down. 
This method is simplified in a minidisk recorder 
based on a magnetic field modulation overwrite 
system. Here, the new data are written immedi- 
ately over the previous data. Writing is achieved 
by a continuous-wave laser beam and a modulated 
magnetic field. 

MO disks are currently available in 3.52 in 
single-sided and 5.25 in double-sided formats with 
storage capacities up to 640 MB or 2.6 GB, respec- 
tively. Storage densities of MO media are being 
substantially increased using new advanced tech- 
nologies like magnetic super resolution (MSR) and 
near-field recording (NFR). Doubling of storage 
capacity is expected for the first MO disks based on 
MSR. Further advances resulting from MSR allow 
storage capacities of up to 10 GB for 5.25 in. disks. 

On MO disks, information is written by heat- 
ing the recording material with the write laser 
beam. The stored data are retrieved by reflect- 
ing the read laser beam from the structured disk 
whereby the optical resolution is limited by dif- 
fraction. By using the MSR process, it is possible 
to write bits smaller than the laser spot, that is, 
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Figure 13.8 MSR. The temperature profile of a focused laser beam is used to create smaller pits than 


the laser spot. 
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below the optical diffraction limits. This method 
is based on specific magnetic properties of the 
“rare earth/transition metal” materials, which 
evolved as storage material for MO recording. The 
write process is the same as writing on conven- 
tional MO disks but the recorded bits are smaller 
than the laser spot. This becomes possible by using 
the temperature profile induced by the laser spot 
in a recording material (Figure 13.8). The bits are 
written only within the central area of the laser 
spot where the magnetic material is reversed by its 
high temperature. The advantage of MSR is that 
the minimum mark length readable in an MSR 
system is shorter than any that are readable in 
conventional systems. 

Comparing phase-change and MO technolo- 
gies, one can find several advantages and dis- 
advantages for each. For a number of reasons, 
phase-change recording appears to be the ideal 
solution for rewritable optical storage of informa- 
tion. CD-RW drives are simpler than MO drives, 
because they do not need magnets to create exter- 
nal magnetic fields, and because there is no need for 
sensitive polarization-detecting optics in CD-RW 
readout. Phase-change readout makes a profit from 
the reflectivity difference between crystalline and 
amorphous states which is large enough to provide 
much higher signals than the relatively weak MO 
effect. On the other hand, repeated melting, crys- 
tallization, and amorphization of phase-change 
media results in material segregation, formation 
of submicron areas that remain crystalline, stress 
buildup, etc. These factors might reduce data reli- 
ability and cyclability of the phase-change media. 
MO disks are guaranteed to sustain over 10° read/ 
write/erase cycles while the corresponding figure 
for phase-change media is typically one to two 
orders of magnitude lower. The maximum temper- 
ature reached in MO media during recording and 
erasure is typically around 300°C, as compared to 
600°C in PC media. The lower temperatures and 
the fact that magnetization reversal does not pro- 
duce material fatigue provide the longer life and 
better cyclability of the MO media. 


13.3 OPTICAL DISK STORAGE 
SYSTEM 


In conventional, well-established technologies, 
an optical storage system consists of a drive unit 
and a storage medium, which is usually a rotating 


disk. The disk storage medium and the laser pickup 
head are rotated and positioned through a drive 
motor. In general, optical disks are preformatted by 
grooves and lands that define the so-called tracks 
on a disk. A track represents an area along which 
the information is stored; an optical disk consists 
of a number of tracks that may be concentric rings 
of a certain track width separated by a land area. 
The separation between two neighboring tracks is 
the so-called track pitch. Typical track spacing on 
existing optical disks are on the order of micron. In 
the simplest recording scheme, pit marks of equal 
length are created along the track (either recorded 
or stamped) while the presence or absence of these 
marks corresponds to binary digits 0 and 1. Tracks 
might be physically existent prior to the recording 
or created by prerecorded data marks themselves. 

On a read-only medium, for example, CD audio 
disk, quarter-wave deep pits are stamped into 
an optical disk substrate and then coated with a 
reflective layer to provide the readout by interfer- 
ence effects in the reflection configuration. In this 
case, pit marks define their tracks that are discon- 
tinuous grooves consisting of pits separated by 
lands. The length of both pits and lands within a 
track is defined by the encoded bit stream stored 
within that area. Tracks are necessary to enable 
the positioning of the optical head that accesses 
the information on the disk and to guide the laser 
beam during readout. In case of recordable media, 
the disk is typically pregrooved, that is, continu- 
ous grooves are printed, etched, or molded onto 
the substrate to define preexisting tracks. The 
grooves represent tracks and are separated by 
lands. The information is recorded along the tracks 
that can be either concentric rings or a single spi- 
ral. Alternatively, the lands may also be used for 
recording—in this case, adjacent tracks are sepa- 
rated by grooves. Moreover, land-groove record- 
ing, that is, recording in both land and groove 
areas, has been introduced in DVD technology to 
increase storage density by a more efficient usage of 
the available storage area. 

All optical components needed for record- 
ing and retrieval of information are integrated in 
an optical pickup system or simply optical head. 
Today’s optical heads are small, compact, and 
highly optimized systems that fly close to the disk 
surface. The optical head must be able to rapidly 
access any position on the disk for error-free recov- 
ery. Depending on the disk technology, the optical 


516 Optical information storage and recovery 


head might have different architectures but the 
basic configuration is the same for all present sys- 
tems. Usually, the head comprises a laser diode, a 
collimator lens, an objective lens, a polarizing beam 
splitter, a quarter-wave plate, and the detector sys- 
tem. A typical setup for an optical disk system is 
shown in Figure 13.9 [1]. A linearly polarized laser 
beam is emitted by the laser diode. The collimator 
reduces the divergence and collimates the beam, 
which then passes through a polarizing beam 
splitter and a quarter-wave plate. The quarter- 
wave plate circularly polarizes the incident beam, 
which is then focused onto the disk by an objec- 
tive lens that also collects the light reflected back 
from the disk. For recording, a focused laser beam 
generates a small spot within the active material to 
induce some kind of optically detectable changes. 
The recorded bit marks will then change the phase, 
amplitude, or polarization of the readout beam. 
The total reflected light is therefore modulated by 
the presence of pits—the light fractions reflected 
from the pits and the light fractions reflected from 
the land interfere destructively and modulate the 
data signal according to the stored information. 
The beam reflected back from the disk again passes 
through the objective lens and becomes recolli- 
mated afterward. The quarter-wave plate converts 
the circular polarization of the reflected beam to 
a linear one with the direction perpendicular to 
that of the incident beam. In this way, the incident, 
readout beam, and the reflected, signal beam can 
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be separated by the polarizing beam splitter, which 
directs the reflected light to the data detector. The 
detection system produces the data or readout sig- 
nal but also optoelectronic signals needed for auto- 
matic focusing and track following. Specific servo 
systems are required to control the position of the 
optical head with respect to the tracks on the disk. 
Additional peripheral electronic units are used for 
functional drive control, data reconstruction, and 
encoding/decoding. 

An optical disk storage system is characterized 
by several functional quantities that specify its per- 
formance in terms of capacity and speed. Typical 
parameters are storage capacity, access time, data 
transfer rate, and cost. The storage capacity is 
determined by the areal density of the stored infor- 
mation and the geometrical dimensions of the disk 
medium, that is, available storage area. The areal 
density characterizes the efficiency of a system in 
using the storage area and is a direct function of 
the spot size and/or the minimum dimension of a 
stored bit mark. It is typically given in units of 
gigabits per square inch or bits per square micron. 
The areal density is limited by the optical resolu- 
tion of the laser pickup, that is, by the minimum 
dimensions of data marks that still can be detected 
by the optical system. The NA of the objective lens 
and the wavelength of the laser used for record- 
ing and readout determine the diffraction-limited 
spot size and therefore the data density but further 
factors also have to be taken into account such as 
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Figure 13.9 Basic configuration of an optical disk system. 
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track density and linear bit density. The track den- 
sity (tracks/in.) expresses how close to each other 
neighboring tracks on the disk can be arranged. 
The linear bit density (bits/in.) is a metric for the 
spacing between optical transitions along a track. 
Other parameters of interest are access time 
and data transfer rate, which characterize how fast 
the information stored on the disk can be accessed 
and read out. The access time depends primarily 
on the speed with which the optical head can move 
over tracks to access a given storage location on 
the disk. The data rate is a metric for recording and 
readout speed and it depends on the linear bit den- 
sity and the rotational speed of the drive. The data 
rate becomes important when large data files, for 
example, images or video files, have to be stored or 
retrieved. Different paths to higher data rates have 
been proposed in CD/DVD technology including, 
for example, recording/readout by multiple beams, 
that is, usage of several laser pickup systems simul- 
taneously. The laser beam power available for writ- 
ing and reading, and the speed of the servo system 
controlling the optical head position are some of 
the parameters that limit the data transfer rate. 


13.3.1 The objective 


The objective lens in an optical disk system must be 
designed to correct for spherical aberration, which 
is due to the substrate thickness. Current optical 
disk systems use molded glass lenses to focus the 
laser beam to a diffraction-limited spot. There are 
several advantages of the molded lens which is 
typically a single aspheric lens over conventional 
objective lenses made up of multiple elements. 
These advantages result from the molding process 
itself which is much better suited to mass fabrica- 
tion. Furthermore, the working distance, that is, 
air spacing between the objective lens and the disk 
surface, of a molded lens is larger, which simplifies 
the design and function of the head in a removable- 
media optical system. Also, the molded lens mass 
is lower than a conventional lens, which reduces 
actuator forces needed for automatic focusing and 
track following within the servo system. 

In order to achieve ultimately small spots on 
the reflecting data layer, the objective lens must 
have fairly large NA, and it must be free from 
aberrations. Hereby, the NA of a lens is defined by 
NA=nxsin®, where 0 is the half angle closed by 
the cone of the focused laser beam. The diameter of 


the diffraction-limited focused spot is then given 
by the ratio of the wavelength of the laser beam, 
and the NA of the objective lens, that is, d,...~A / 
NA, where A, is the vacuum wavelength of the 
laser beam. It becomes clear from the above rela- 
tion that higher NAs are desirable if smaller spots 
and therefore higher storage densities are to be 
achieved. The smaller the spot of the readout beam, 
the smaller data marks can be resolved. The areal 
data storage density depends directly on the spot 
size and according to the above equation, it can be 
increased by reducing the wavelength and/or by 
increasing the NA. In practice, both approaches 
encounter limitations and also require adaptations 
and advances in a number of constituting tech- 
nologies. The most important aspect in achieving 
greater storage densities by reducing the wave- 
length is the availability of short-wavelength lasers. 

Reducing the spot size by increasing the NA 
will also reduce the focal range of the laser beam, 
the so-called depth of focus. The focal range can be 
estimated by the Rayleigh length of the laser beam 
and is therefore proportional to 4/NA?, which 
means that the higher NA, the smaller will be the 
depth of focus. That clearly limits NA as an opti- 
cal storage system is capable of handling the focus 
error only within this range. Other tolerances such 
as those for the disk tilt and the substrate thick- 
ness are also limited by NA and have to be con- 
sidered in designing the objective lens. Presently, 
the NA of objectives used in CD technology is 0.45, 
which has been increased to 0.6 in DVD systems to 
achieve higher storage capacity. A further increase 
of NA would require far-reaching adaptations in 
both the disk configuration and optical system 
to provide that the diffraction-limited focus will 
be maintained with the desired accuracy on the 
reflective data layer. 


13.3.2 The laser 


Optical storage of information became first prac- 
tically possible with the invention of laser. Rapid 
developments in the field of laser systems have 
supported the technological realization of existing 
optical storage systems. In general, lasers are used 
in all optical technologies for data recording and 
often also for data recovery. The ultimate prem- 
ises for light sources in optical storage systems are 
small size, stability, long lifetime, and inexpensive 
mass production. The optical drive so as we know 
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it became therefore possible first with the estab- 
lishment of laser diodes. Conventional optical 
disk systems, such as CD, DVD, and MO, rely all 
on semiconductor laser diodes as their source of 
light. The diode laser technology has been the key 
enabling technology for optical storage while the 
success of CD consumer products (CD audio and 
CD-ROM in PCs) has pushed the diode laser to the 
best-selling laser products of all time. Compared to 
other laser types, the laser diode has many advan- 
tages with regard to the requirements defined by 
the design and functionality of an optical head. In 
general, the optical head contains one laser diode 
that provides light for recording, reading, and 
erasing (given in rewritable systems only). Each of 
these functions sets specific requirements on the 
light source. 

Optical data storage systems for mainstream 
applications such as computer and entertain- 
ment need compact and cheap lasers that can be 
with ease integrated in low-weight optical heads 
and small, low-cost drives. In addition, specific 
requirements of optical data storage concern the 
wavelength of emitted light, optical power avail- 
able for writing, reading and erasing, modulation, 
and life time of the laser diode which should exceed 
several thousand hours. The wavelength of light 
used to write and read bit marks is crucial for the 
optical resolution of the pickup system and, conse- 
quently, for the areal storage density. The shorter 
the wavelength the smaller bits can be resolved and 
the higher areal storage density can be achieved. 
Stimulated by the success of CD and later DVD, 
extensive research and industrial development 
efforts have been undertaken in past decades to 
satisfy demand for short-wavelength laser diodes 
with sufficient optical power for the operation in 
optical drives. 

The power requirement for diode lasers in opti- 
cal disk systems varies from several milliwatts 
for retrieval of stored information to several tens 
of milliwatts for data recording. The laser output 
radiation is modulated directly by modulating the 
input electrical current. The fast modulation of 
laser radiation is perhaps the most important char- 
acteristic of laser diodes for optical recording. Laser 
diodes can be modulated to GHz frequencies with 
rise and fall times of less than 1 ns. For read-only 
applications, low-power lasers with approximately 
5-10mW optical power can be used. In contrast, 
for recordable and rewritable media, more power 


is needed because the laser beam has to induce 
almost instantaneously detectable changes in the 
recording material. For example, amorphous pit 
marks on a CD-RW are thermally induced by a 
pulsed laser beam with an optical power of either 
50 or 60 mW. 

Another requirement concerns the spatial 
coherence and single transverse mode operation 
as the laser beam has to be focused to the dif- 
fraction limit. The laser cavity must be a single- 
mode waveguide over its operating power range 
to provide wave fronts that are needed to achieve 
ultimately required small spots. The longitudi- 
nal mode stability has not been a requirement up 
to now so that laser diodes incorporated in the 
present optical heads typically operate in several 
longitudinal modes. Otherwise, a single longitu- 
dinal mode operation might become important 
in reducing undesired wavelength fluctuations. 
Fluctuation of both intensity and wavelength is 
one of the characteristics of diode lasers. Although 
intensity fluctuations reduce the signal-to-noise 
ratio of the readout process and generate noise in 
the servo signals, wavelength fluctuations set addi- 
tional requirements on achromatic design of opti- 
cal components. 

The wavelengths of optical data storage have 
been continuously reduced since the introduc- 
tion of the first laser disk systems, which started 
at 830 nm. CD audio and early CD-ROM systems 
rely on infrared diodes at 780 nm while moderate 
power lasers at 680 nm are used in CD-R and MO 
products, and also in computer drives. The DVD- 
ROM standard relies on red-emitting diodes at 635 
and 650 nm. The wavelength reduction has been 
one of the crucial factors for the sevenfold increase 
in storage capacity from CD to DVD (Figure 13.10). 
A shorter wavelength laser would support a higher 
storage density as shorter wavelength light can be 
focused to a smaller spot at the diffraction limit. As 
the diameter of a focused laser spot is proportional 
to its wavelength, the reduction of the wavelength 
will lead to the reduction of the spot size by the 
same factor, and consequently, to an increase of the 
storage density by the square of that factor. 

Recent developments in the field of IH-V 
semiconductor diode lasers allow for an almost 
revolutionary transition from the red to the blue- 
violet spectral range. Enormous advances have 
been achieved by incorporating short-wavelength 
blue-violet diode lasers in optical storage systems. 
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Figure 13.10 The increase in storage density from CD to DVD has been primarily achieved by decreas- 
ing the bit dimension but also by reducing the tracks spacing, by using shorter wavelength lasers and 
higher NA optics. At the same time, the substrate thickness has been reduced from 1.2mm on a CD 
to 0.6mm on a DVD, whereas a Blu-ray disk comprises a protective layer of 0.1mm thickness only. 


Blue-violet and ultraviolet diode lasers operating 
in the range of 370-430 nm have been first devel- 
oped by Nichia Chemicals Corporation in Japan 
and introduced to the market in 1999. Announced 
as high-density DVD, the Blu-ray standard is based 
on a 50mW blue-violet diode laser. By adopting a 
405 nm laser, the “blue” optical disk technology 
minimizes its beam spot size by increasing the 
NA to 0.85. On the other hand, such an extremely 
sharply focused laser beam is characterized by a 
very small focus depth. Therefore, the substrate 
becomes also extremely thin with a thickness of 
0.1mm only so that the disk must be protected 
from outer influences and damages by a cartridge. 
Both issues, short wavelength and high NA, allow 
for reducing the pit size to approximately 0.2 1m, 
and the tracking pitch to 0.32j1m, almost half of 
that of a regular red DVD. All these refinements 
and improvements together push the DVD tech- 
nology up to 27 GB high-density recording. 


13.3.3 The servo system 


Readout in optical storage relies on data recon- 
struction from a disk rotating with a several thou- 
sand rotations per minute. To provide a faithful 
retrieval of the stored data, the laser beam must 
be focused exactly on the disk track and then 
maintained accurately within it during the entire 
readout process. Typical tolerances in optical disk 
systems are 11m for positioning of the focus in 
the reflective data layer, and one-tenth of micron 
for focus positioning on the track center. The axial 
and radial runout of an optical disk are two or 
three order of magnitudes larger than these allow- 
able focus positioning errors. The optical drive, 


therefore, requires a servo system to compensate 
the radial and vertical runout of the disk as it spins 
and to provide submicron focus and track-locking 
schemes. The servo system is a closed-loop opto- 
electro-mechanical system, which couples optical 
position detectors to high bandwidth actuators to 
actively follow the disk rotation. Consequently, the 
servo control in an optical disk system involves 
accurate and continuous focus position error sens- 
ing and sophisticated feedback mechanisms that 
dynamically convert the detected error signals in 
corresponding actuator movements. 

The task of actuators within the servo system is 
to correct and to control the position of the focus- 
ing optics, that is, objective lens. A typical objec- 
tive lens has an NA of 0.45 or higher to create a 
focused beam spot smaller than 1pm. In addi- 
tion, the focused beam has a focus depth, which 
is only a fraction of a micron. On the other hand, 
a rapidly rotating disk has a tendency to wobble in 
and out of its ideal position in the optical drive. 
There is a variety of reasons that may cause such 
wobble effects, some of them are imperfections in 
the disk construction, substrates that are not ide- 
ally flat, other manufacturing errors in both disks 
and drives, disk tilt, and eccentricity, etc. In the 
ideal case, the disk mounted in a drive would be 
perfectly centered, and sufficiently flat to maintain 
an ideal perpendicular position with respect to the 
rotation axis at all times. Deviations up to +100 pm 
in both vertical and radial directions usually occur 
during the disk rotation in an optical drive. 

The task of the servo system is twofold: First of 
all, the laser beam spot focused into the reflective 
layer of the disk must remain within the depth of 
focus, and second, the focused spot must remain 


520 Optical information storage and recovery 


within a submicron-sized track while the disk 
rotates and wobbles in and out of both focus and 
track. The mechanism to maintain the laser spot 
on the disk within the focus depth is called auto- 
matic focusing and it is aimed to compensate ver- 
tical runout of the disk. In addition, automatic 
tracking servo is needed to maintain the position 
of the focused spot within a particular track—the 
mechanism is called automatic track following. 
In the ideal case, tracks are perfectly circular and 
concentric, and the disk is perfectly centered on 
the rotation axis. In practice, the eccentricity of 
both the tracks and disk creates demand for active 
track following mechanisms that compensate wob- 
ble effects and enable the laser beam spot to follow 
the track. 


13.3.3.1 AUTOMATIC FOCUSING 


Current optical drives rely on several different 
methods that provide the feedback mechanisms, 
that is, error signals that drive the focus servo sys- 
tem. The objective lens is mounted in a voice coil 
actuator with a bandwidth of several kilohertz, and 
the feedback mechanism is used to position the lens 
relative to the rotating disk in such a way as to main- 
tain focus at all time. The basic premise is that an 
appropriate error signal is generated which is then 


Focus error 
signal 


Beam splitter 


fed back to the voice coil actuator for maintaining 
focus automatically. Depending on the detection 
scheme, various techniques have been proposed 
to generate the focus error signal (FES). The signal 
needed for the closed servo loop is usually derived 
from the light that is reflected from the disk. Several 
techniques use a field lens that creates a secondary 
focused spot; deviations from optimum focus are 
then analyzed by observing that secondary spot. 
The field lens, which might be a spherical, ring-toric, 
or astigmatic lens is placed after the objective lens in 
order to focus the light reflected back from the disk. 
The shape, size, and position of the focused spot 
depend on the position of the disk relative to the in- 
focus plane. Changes in the secondary spot caused 
by the off-focus status of the disk are detected via 
a photodetector and transformed into an electronic 
signal that contains the feedback information for 
the focus servo system. 

A very popular mechanism for automatic focus- 
ing relies on the astigmatic lens detection method. 
The most of current optical disk systems use an 
astigmatic servo sensor that comprises an astig- 
matic lens having two different focal lengths along 
orthogonal axes, and a quad detector. Figure 13.11 
shows a diagram of the astigmatic focus-error 
detection system used in many practical devices. 


Collimator 


Astigmatic lens 


Figure 13.11 Astigmatic focus-error detection system. 
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Figure 13.12 Four-quadrant-detector combined with an astigmatic lens is used in optical disk systems 
for both automatic focusing and track following. Control electronic signals are generated by an 
appropriate combination of the signals from four detector segments. 


The light beam returned from the disk and colli- 
mated by the objective lens might be convergent or 
divergent, dependent on position of the disk rela- 
tive to the plane of best focus. The reflected beam 
passes an astigmatic lens, which normally focuses 
the incident light beam to a circularly symmetric 
spot halfway between its focal planes. In the best, 
in-focus case, a quad detector placed at this plane 
receives equal amounts of light on its four quad- 
rants. In contrast, when the disk is out of focus, the 
astigmatic lens will create an elongated spot on the 
detector so that individual quadrants will be illu- 
minated differently and, consequently, they will 
create different electronic signals (Figure 13.12). 
Depending on the sign of defocus, this elongated 
spot may preferentially illuminate quadrants A 
and C or quadrants B and D of the detector. A 
bipolar FES can be then derived as the difference 
between diagonal quadrants, that is, 


(A+C)-(B+D) 
A+B+C+D — 


FES x 


13.3.3.2 AUTOMATIC TRACK FOLLOWING 


The information is recorded on an optical disk 
either around a series of concentric circular tracks 
or on a continuous spiral. Manufacturing errors 
and disk eccentricities caused by mounting errors 
or thermal expansion of the substrate, for exam- 
ple, will cause a given track to wobble in and out 
of position as the disk spins. Typically, a given 
track might be as much as +100 1m away from its 
intended position at any given time. The focused 


spot is only about 11m across and cannot be at 
the right place at all times. An automatic tracking 
scheme is, therefore, desired. The feedback signal 
for controlling the position of the objective lens 
within the tracking coil is again provided by the 
return beam itself. The four segments of the detec- 
tor are combined in different ways for FES and 
track error signal (TES). Several mechanisms for 
automatic track following have been proposed and 
applied in commercial devices. 

The push-pull tracking mechanism relies on 
the presence of either grooves or a trackful of data 
on the media. In the case of CD and CD-ROM, 
the data are prestamped along a spiral on the sub- 
strate, and the sequence of data marks along the 
spiral represents a sort of discontinuous groove 
structure. Writable media such as CD-R, MO, and 
PC require a tracking mechanism distinct from 
the data pattern, because prior to data recording, 
the write head must be able to follow the track 
before it can record anything. Once the data are 
recorded, the system will have a choice to follow 
either the original tracking mechanism or the 
recorded data pattern. Continuous grooves are the 
usual form of preexisting tracks on optical media. 
A typical groove is a fraction of a micron wide and 
one-eighth of the wavelength deep. As long as the 
focused beam is centered on a track, diffraction of 
light from the adjacent grooves will be symmetric. 
The symmetry of the reflected beam, as sensed by 
a quad detector in the return path, would produce 
a zero error signal (Figure 13.13). However, when 
the focused spot moves away from the center of 
the track, an asymmetry appears in the intensity 
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Figure 13.13 TES generated by push-pull method. 


pattern at the detector. The difference signal is suf- 
ficient to return the focused spot to the center of 
the track. The TES is thus generated by the quad 
detector according to 


TES oc (ATB)=(C#D) 
A+B+C+D 


In read-only media, where continuous grooves are 
not present, another method of tracking is applied, 
the so-called three-beam method. The laser beam 
is divided into three beams, one of which repre- 
sents the main spot and follows the track under 
consideration, while the other two are focused on 
adjacent tracks, immediately before and after the 
addressed track. Consequently, three detectors are 
necessary for position error sensing. Any move- 
ment of the central beam away from its desired 
position will cause an increase in the signal from 
one of the outriggers and, simultaneously, a 
decrease in the signal from the other outrigger. A 
comparison of the two outrigger signals provides 
the information for the track-following servo. 

In another possible tracking scheme, the so- 
called sampled servo scheme, a set of discrete 
pairs of marks is placed on the media at regular 
intervals. The wobble marks indicate the trans- 
versal boundaries of a track. Such marks might be 
embossed or written by a laser beam within the 
formatting procedure. These marks are slightly 
offset from the track center in opposite directions, 
so that the reflected light first indicates the arrival 
of one and then of the other wobble mark. The TES 


Beam splitter 


Ea) "Quad detector 


is generated as the difference between the signals 
detected from each wobble mark. Depending on 
the spot position on the track, one of these two 
pulses of reflected light may be stronger than the 
other, thus indicating the direction of track error. 
The sample servo technique is often used in record- 
able media systems including both write-once and 
rewritable media. 

In contrast, present DVD systems rely on the 
differential phase detection technique that is based 
on diffraction of the focused spot from the edges of 
data marks. If the spot is focused offtrack, the light 
reflected from the disk will show an asymmetric 
intensity distribution each time the spot strikes a 
mark edge. The intensity pattern rotates when the 
spot travels along the data mark and this rotation is 
then sensed by a quad detector that generates cor- 
responding TESs. 


13.3.4 Data coding and processing 


An important step in storing digital data is encod- 
ing of the bit stream to be stored prior to recording, 
and, consequently, decoding of the readout sig- 
nal after its conversion into digital form. Hereby, 
digital data are extracted from the analog signal 
obtained by the data detector. In a digital storage 
system, the input is typically a stream of binary 
data, that is, binary digits 1 and 0, which has to be 
recorded onto a storage medium in such a way as to 
provide reliable and error-free recovery. The stor- 
age system is requested to record the data, to store 
it, and to reconstruct it faithfully upon request. 
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The digital data are first converted into analog sig- 
nals and then stored as a stream of data marks cor- 
respondingly modulated and embedded into the 
recording format. For retrieval, the original digital 
data are extracted from the analog signal that is 
collected by the optical head via the data detector. 
From the user input to the recovered output, the 
data will undergo several steps of electronic pro- 
cessing including analog/digital conversion, equal- 
ization, and filtering of the playback signal, error 
correction and modular coding, data synchroniza- 
tion, and organization within the recording chan- 
nel, etc. [2]. 

Within the storage system, not only a recording 
unit is incorporated but also additional electron- 
ics units or subsystems that perform diverse steps 
in data processing from input to output after the 
stored data have been reconstructed by the opti- 
cal read head. There are a variety of error sources 
that can cause errors in retrieval of information 
from a rapidly rotating optical disk. Some of them 
are related to the dynamic operation regime with 
the data structure reduced to the limit where the 
readout signal can only just be separated from the 
system noise. Also, media imperfections, defects, 
damages, etc., lead to errors in reconstruction 
of information sequences stored at the affected 
locations. To retrieve the data faithfully, all sorts 
of errors must be eliminated or compensated by 
appropriate error correction techniques. Therefore, 
the binary input data (user data) undergo several 
steps of encoding and modulation prior to being 
recorded on the storage medium. 

The encoding process involves several measures 
against diverse error sources but it also entails 
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other features that simplify the data processing 
and recovery. The flow of the data stream in an 
optical storage system is depicted in Figure 13.14. 
Binary user data are subjected to two encod- 
ing processes prior to recording on the storage 
medium: error-correction coding (ECC) and mod- 
ulation or recording coding. The encoding process 
includes typically one or more ECC steps followed 
by a modulation coding step where appropriate 
features are incorporated into the bit pattern. The 
ECC step is designed to protect the data against 
random and burst errors, and the modulation step 
organizes the data to be stored so as to maximize 
the storage density and reliability. In the ECC step 
additional bits, the so-called check bits, are gener- 
ated and added to the stream of user data in order 
to create an appropriate level of redundancy in the 
overall bit sequence. Both the source data and the 
data emerging from the first (ECC) encoder are 
unconstrained, that is, a randomly selected bit in 
the data stream may be either a “1” or a “O” with 
equal probability and arbitrarily long sequences of 
all “ones” and “zeros” may appear. 

In addition to the error correction, binary 
sequences are encoded by a modulation cod- 
ing whereby the bit-patterns to be recorded are 
expanded by certain additional features. These 
enable the generation of a clocking signal for 
the electronic waveform and also provide more 
efficient usage of the storage area available on 
the disk. The modulation step involves mapping 
of small blocks from the error-correction coded 
sequence into larger blocks known as modulation 
code words. The data emerging after the modula- 
tion encoding step are usually d, k constrained, 
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Figure 13.14 Channel data flow in an optical storage system. 
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that is, binary sequences are constrained in sucha 
way that there must be at least d but no more that 
k zeros between two ones. Each binary segment 
consists therefore of a one that is followed by at 
least d but no more than k zeros. The data encoded 
in such a way is run-length limited (RLL) and it 
is referred to as channel data because it becomes 
then converted into electrical waveforms used 
to control the recording process. The CD opti- 
cal disk technologies rely on a block modula- 
tion code known as eight-to-fourteen modulation 
(EFM) that expands blocks containing 8 data bits 
to 14-bit channel blocks. This modulation code is 
also applied in DVD systems in its advanced ver- 
sion, known as EFMPlus. 

During retrieval, the RLL channel data are 
reconstructed by the detector and processed by 
a modulation decoder which gives the error- 
correction coded binary data as output. The 
modulation decoder must correctly determine 
the logical ECC code word boundaries within the 
stream of channel data to enable subsequent ECC 
decoding to take place. The ECC decoder processes 
the data leaving the demodulator to detect and cor- 
rect any data error type for that it is designed to 
recognize and eliminate. Optical storage systems 
use typically Reed-Solomon (RS) codes for ECC. 
These are block codes, that is, encoded data consist 
of code words, or blocks, that contain a fixed num- 
ber of bits. Encoding entails organizing the blocks 
of binary data into a succession of multibit symbols 
called information symbols, computing a number 
of additional symbols (of the same length) called 
parity checks and appending the parity checks to 
the information symbol to form a code word. RS 
codes employed in optical data storage systems 
use eight-bit information symbols, that is, they are 
designed to operate on bytes instead of bits. An RS 
code will correct up to a certain number of errone- 
ous symbols in a given code word, and for correct- 
ing a specific number of erroneous bytes, it needs 
twice as many parity check bytes. For example, one 
of the two ECC used in CD audio systems is the RS 
code, which can correct up to 2 bytes of error in a 
24-byte-long block of user data, with the addition 
of four parity bytes. 

Information to be recorded on a disk is orga- 
nized into uniformly sized blocks. Each of these 
blocks is written onto a portion of the storage 
medium that is referred to as a sector. Each track 
on the disk is then divided into a number of sectors 


that contain the user data and any ECC parity 
information related to it, that is, calculated from 
it. Several different track formats find application 
in practical systems. As mentioned before, tracks 
can be realized either as concentric rings or a sin- 
gle continuous spiral whereby optical disks usually 
rely on the spiral format. For digital data storage, 
each track consists of sectors that are defined as 
small segments containing a single block of data. 
Spiral tracks are more suitable for writing of large 
data files without interruption while concentric 
rings better support multiple operation mode 
when different operations such as write, erase, 
verify, etc., are performed simultaneously in differ- 
ent tracks. The block of digital data stored within 
a sector has a fixed length, which is usually either 
512 or 1024 bytes. Each sector has its own address, 
the so-called header that specifies the storage loca- 
tion of a given sector on the disk. 

The storage location is given by the track num- 
ber and azimuthal position in the track at which 
the sector will be written. The information that 
constitutes a sector is usually written onto the 
medium in two parts. The first part of the sector 
is the sector header, which consists of special pat- 
terns known as sector marks together with sector 
address data. The headers of all sectors are prere- 
corded, that is, they are placed on the disk either 
when it is manufactured or when it is formatted for 
use. The additional space used by the codes and by 
the header within each sector constitutes the over- 
head, which may take between 10% and 30% of 
disk’s raw capacity depending on application. 

Prior to retrieving the data from the recorded 
storage medium, the optical detection head must 
access the medium and find the data that are 
requested by the storage system controller. The 
optical head must move to a particular radial posi- 
tion on the disk and it must find the track and 
sector that contain the requested data. Especially, 
in the case of removable disk drives, the system 
checks other important information about the 
disk, for example, the sector size being used on 
the disk, the amplitude and polarity of signals 
obtained from prerecorded sectors and headers, 
etc. For reliable recording and readout, the design 
of a storage medium implements a defined record- 
ing format. The written data are embedded in this 
format, which entails certain system information 
to be prerecorded at specific locations on the disk. 
The recording format also provides that the user 
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Figure 13.15 NRZ (left) and NRZI (right) conversion of binary sequences into electronic wave fronts. 


data are written in its appropriate sector type at 
particular locations on the disk. 

To conform to the recording format, the physi- 
cal structure of the data to be stored on the disk 
must satisfy several requirements. In Figure 13.15, 
two possible schemes for conversion of binary 
sequences into electrical waveforms are illus- 
trated. In the so-called non-return-to-zero (NRZ) 
scheme, each bit is allotted one unit of time during 
which the voltage is either high or low, depend- 
ing on whether the bit is 1 or 0. In the ideal case, 
recording with an NRZ waveform will result in 
identical marks that have the same length propor- 
tional to one channel bit time. Neighboring data 
marks can have different center-to-center spacings, 
whereby the center of a mark represents a channel 
binary “one.” The modified version of this scheme 
known as NRZI (non-return-to-zero-inverted) 
conforms much better to the EFM as here a 1 cor- 
responds to a transition while a 0 is represented 


by no transitions at all. An NRZI waveform will 
induce data marks and intervening spaces that 
have variable discrete lengths. Here, the appear- 
ance of a binary “one” corresponds to an edge of a 
recorded mark and both the mark length and the 
spacing between two successive marks are given by 
the number of binary “zeros” between two ones. 
Using such a scheme, a single data mark can con- 
tain more bits than only one as in a simple coding 
scheme where each pit is allotted one bit. 

In summary, the optical resolution limit, 
known as the “Abbe barrier,” has dictated prog- 
ress and development among three generations of 
optical disk technology. The approach to higher 
data densities was straightforward—reducing the 
size of data marks by shortening the wavelength 
and by increasing the NA. Table 13.3 summarizes 
the optical specification, physical data structure, 
and storage performance of three optical disk stan- 
dards. In its third generation, based on a 405 nm 


Table 13.3 Three generations optical disk technology: optical specification, physical data structure, and 


storage performance 


cD 
Wavelength 780nm 
NA 0.45 
Minimum pit size 0.83 pm 
Minimum mark length 0.83 pm 
Track pitch 1.6um 
Storage density 1 bit/ pm? 
Data rate 1 Mbps 
Storage capacity 700 MB 


DVD Blu-ray 
650nm 405 nm 

0.6 0.85 
0.4m 200 nm 
0.441m 150 nm 
0.74um 320 nm 

5 bits/pm? 20 bits/p1m? 

5 Mbps 50-200 Mbps 
4.7 GB 27 GB 
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Figure 13.16 Novel approaches in optical storage to overcome the limitations of the surface storage 
by pits in CD/DVD as well as MO disk technology, including optical storage beyond the resolution 
limit by NFR and SIL, frequency-/time-domain optical storage by spectral hole burning and photon 
echo memories, multilayer storage within transparent materials—two-photon and fluorescent 
memories, page-oriented holographic memories, and bit-oriented microholographic disk. 


laser, optical disk storage definitively encounters 
the physical limits imposed by diffraction of light. 
Meanwhile, new techniques that use more than the 
surface of a flat disk are under investigation. 


13.4 NOVEL APPROACHES IN 
OPTICAL STORAGE 


The acceleration of processor speeds and the evo- 
lution of new multimedia and Internet applica- 
tions are creating an almost insatiable demand for 
high-performance data storage solutions. Storage 
requirements are growing at an exponential rate 
encouraged by immense technological advances, 
which have been achieved in recent decades. 
An ever-increasing amount of digital informa- 
tion is to be stored online, near-line, and off-line 
combining magnetic, MO, and optical systems. 
Rapidly growing demands are distributed through 
the data storage hierarchy, where diverse tech- 
nologies are combined in complimentary way to 
satisfy specific requirements in different applica- 
tion environments. The usual computer applica- 
tions are served mainly by hard disk drives but 
emerging consumer applications that combine 
audio, video, 3D image, and computer data files 
are creating an important category with sub- 
stantially different requirements. Optical disk 
technology has established itself as a mainstream 
product provider for audio, video, and computer 


storage. The extraordinary success of recordable 
and rewritable disk formats based on CD and 
DVD technology has opened new prospects but 
also new requirements. Advanced storage of digi- 
tal contents requires both higher storage capacity 
and fast data transfer. Although the optical disk 
technology with its three generations satisfy stor- 
age demands of the entertainment and content 
distribution industry, novel application areas 
make essentially new technologies necessary. 

In the future, optical data storage is expected to 
follow two directions to improve capacity and per- 
formance of disks that are currently available. The 
straightforward way predicts the further increase 
of the areal storage density by surpassing the limit 
imposed by the diffraction of light. Storage tech- 
nologies that use only the surface of a medium 
for writing and reading are constrained to this 
direction. On the other hand, optical information 
storage uses laser-material interaction effects for 
recording and retrieval so that an entire spectrum 
of different optical phenomena can be applied to 
realize an optical memory. Developments in the 
field of nonlinear optical materials that exhibit 
strong laser-induced changes of their optical prop- 
erties enable various novel approaches to become 
practically realizable. Using nonlinear optical 
effects, advanced technological solutions for opti- 
cal storage may take advantage of new spatial and 
spectral dimensions (Figure 13.16). 


13.4 Novel approaches in optical storage 527 


13.4.1 Beyond the resolution limit 


The traditional approach for increasing the areal 
density that has driven progress in data stor- 
age is to decrease the bit size. In optical storage, 
the attainable data density is largely determined 
by the size of the focused laser spot. A power- 
ful way to surpass the density limit imposed by 
the diffraction of light in optical data storage 
is the usage of near-field optical recording. The 
creation and detection of pit marks smaller than 
predicted by the diffraction barrier can be real- 
ized by numerous near-field optical techniques. 
For conventional optical systems, the achiev- 
able spot size governed by the diffraction law 
is <A/2NA. The resolution limit, known as the 
“Abbe barrier,” was empirically discovered and 
named for the German physicist, Dr. Ernst Abbe, 
best known for his work in optics in the 1860s. 

Recent progress in near-field optics has 
resulted in effective spot sizes smaller than 1/20 
of the wavelength of light. To achieve such a high 
resolution, an aperture smaller than the resolu- 
tion limit is placed between the light source and 
the medium. Light passing the aperture consists 
of propagating and evanescent waves; the smaller 
the aperture, the larger the fraction of evanescent 
field. The evanescent wave intensity decreases 
exponentially outside of the aperture; therefore, 
when the aperture-to-sample distance decreases, 
the evanescent power increases, and the resolution 
improves. If the aperture-to-medium distance is 
much less than the wavelength, the resolution will 
be determined by the aperture size rather than by 
the diffraction limit. Diverse techniques for near- 
field optical recording have been proposed mak- 
ing readout of subwavelength structures possible. 
However, a serious disadvantage for most of NFR 
techniques using small aperture is low optical 
efficiency. Although near-field optical recording 
can provide extraordinarily high areal densities, 
it is difficult to satisfy requirements on high data 
transfer while maintaining a working spacing of 
less than a wavelength. 

Scanning near-field optical microscopy 
(SNOM) makes it possible to overcome the dif- 
fraction limit of conventional far-field optical sys- 
tems by placing the pickup head very near (about 
50 nm above) the media. In near-field micros- 
copy, optical resolution beyond the diffraction 


limit is achieved by scanning a surface with the 
evanescent field behind a nanometer aperture. 
A small distance is necessary because the light 
field is confined only in the near field of the aper- 
ture. The technique can produce spots as small as 
40nm in diameter and conceptually can achieve 
areal densities in the order of 100 GB in.-?. SNOM 
technology provides high areal densities but until 
now readout speed is low because the scanning 
process is very slow due to the small light power 
behind the aperture. Furthermore, the probe 
must be in near contact with the medium, mak- 
ing it difficult to prevent head crashes and support 
removable media. Another technique uses a met- 
alized tapered optical fiber, in the end of which 
is a small aperture [3]. The tip of a fiber, which 
is smaller than the wavelength of the recording 
light, is positioned within 10nm to the sample. 
This approach has been already used to write and 
detect 60nm diameter MO domains. However, 
the tapered fiber approach also suffers from very 
low optical efficiency. 

A technique that might solve the trade-off 
between an extremely high resolution and practi- 
cal system implementation is based on the solid 
immersion lens (SIL). The principle of the SIL is 
that by focusing light inside a high refractive index 
glass where the propagation speed is slow, the spot 
size can be reduced below the minimum achievable 
spot size in air [4,5]. The SIL reduces the actual spot 
size by both refracting the light rays at the sphere 
surface and by having an increased index of refrac- 
tion within the lens. The hemispherical glass of 
refractive index n receives the rays of light at nor- 
mal incidence to its surface (Figure 13.17). These 
are focused at the center of the hemisphere to form 
a diffraction-limited spot that is smaller by a factor 
n compared to what would have been in the absence 
of the SIL. That becomes obvious if we consider the 
minimum achievable spot size which is given by 


A 


dimin = 0.61 4g 
nsin® 


where 0 is the aperture angle of the focusing lens. The 
SIL allows the aperture angle to be increased also. An 
increase of NA from 0.6 is typical for conventional 
optical disk systems to 0.95, and a refractive index 
change from 1.0 to 2.2 would result in a spot size of 
0.2 um at the flat surface of a SIL for light at 670nm. 
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Figure 13.17 SIL for near-field optical storage. The light is focused internally in a semispherical lens 


onto the flat surface. 


Although SILs cannot produce spot sizes 
as small as tip fiber apertures can, they have 
the advantage of a substantially higher optical 
throughput. Another advantage is that an SIL 
can be with ease integrated in any conventional 
system configuration as an addition to the objec- 
tive lens. The application of the SIL also requires 
an extremely short working distance of the lens 
to the recording layer—about 100 nm—as well 
as a very thin protective layer. Flying heads as 
in hard disk storage can be used to provide such 
a distance, which is maintained over a disk that 
rotates rapidly. The SIL combined with short- 
wavelength lasers should enable the technique 
to reach areal densities of more than 10 GB in.?. 
To date, implementations of NFR with SIL are 
planned in MO storage. The first NFR products 
announced will have a storage capacity of 20 GB 
per disk. 


13.4.2 Frequency-/time-domain 
optical storage 


Looking for solutions to overcome the restrictions 
of two-dimensional optical storage systems, vari- 
ous approaches have been explored that use one 
further dimension in spatial, spectral, or time 
domain. Novel technologies, such as holographic 
storage, two-photon or fluorescent memories, per- 
sistent spectral hole burning (PSHB), photon echo 
memories, etc., are at various stages of develop- 
ment. Opening a new dimension in addition to 
the two-dimensional surface of a storage medium, 
they have the potential to improve tremendously 
both capacity and data transfer rates of optical 
storage systems. 


Frequency- or time-domain optical storage 
techniques adhere to two-dimensional media but 
open one additional dimension in frequency or 
time domain. PSHB takes a step ahead of one- 
bit-per-spot memories, allowing multiple bits to 
be written, erased, and rewritten in a single loca- 
tion. High densities become possible because the 
diffraction limit does not limit PSHB memories to 
the recording of a single bit per spot, as it does in 
conventional optical data storage. In a PSHB mate- 
rial, it is possible to discriminate many different 
spectral addresses within a single 3 volume. 

Frequency-domain optical storage based on 
PSHB involves burning “holes” in a material’s 
absorption band [6,7]. The ideal material has 
many narrow, individual absorption lines that 
form a broad absorption band. Practical record- 
ing in PSHB media may be done as an extension of 
conventional holography with the difference that, 
instead of a single wavelength, a large number of 
independent spectral channels may be used. This 
number ranges from 10‘ to more than 10’, depend- 
ing on the material. For writing information, a 
frequency-tunable laser is focused on a single spot 
scanning down in wavelength to induce transition 
from one stable state to another in an absorbing 
center. As a result, there will be holes at certain 
frequencies that correspond to the presence of the 
written bits. PSHB using the frequency domain 
promises storage densities up to 10° bits pm, 
which could be achieved with 10° absorbing cen- 
ters, that is, spectral holes per diffraction limited 
laser spot. 

Another PSHB storage method, the so-called 
photon echo optical memories, is based on time- 
domain storage [8,9]. Time-domain hole burning 
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also uses spectral holes for data storage, but relies 
on coherent optical transient phenomena. Time- 
domain storage is realized by illuminating an 
inhomogeneously broadened material by two 
temporally separated resonant optical pulses. The 
first (reference) pulse creates an optical coherence 
in the material. The second pulse is temporally 
encoded with data and interferes with the optical 
coherence created by the reference pulse resulting 
in a frequency-dependent population grating. The 
recorded information is read out by illuminating 
the storage material with a read pulse (identical 
to the reference pulse) which generates a coherent 
optical signal having the same temporal profile as 
the data pulse. 

The maximum number of bits that can be stored 
in a single spot using PSHB is given by the ratio 
of the inhomogeneous Aw,, and homogenous 
A, absorption line width of the storage material. 
This ratio can range up to 10’ in some materials. 
Because of the large number of frequency channels 
available, high storage densities may be possible 
with reasonable laser spot sizes. Even though both 
of these methods have advantage of increased stor- 
age density (2Mbit/laser spot), their application 
capability is significantly limited by the operating 
temperatures which should be kept extremely low 
around liquid helium temperature. In particular, 
present research efforts concentrate on achieving 
room temperature hole burning with novel materi- 
als, but there are still a number of technical chal- 
lenges to overcome before PSHB becomes viable 
for data storage. 


13.5 THREE-DIMENSIONAL 
OPTICAL INFORMATION 
STORAGE 


In the present optical storage systems, one- 
dimensional serial information is stored in a two- 
dimensional medium. In three-dimensional optical 
memories, three independent coordinates specify 
the location of information. Multilayer [10-14], 
holographic [15-17], microholographic [18-23], and 
multidimensional [24-26] approaches are expected 
to exploit the entire volume of a storage medium. 
In addition to the spatial dimensions, optical mul- 
tiplexing techniques allow superimposing multiple 
states that differ in wavelength, angle of incidence, 
phase, or polarization. 


Optical storage in the form of holographic 
volume gratings has been investigated during the 
past three decades as a straightforward approach 
to realize three-dimensional high-density mem- 
ories. Beside diverse holographic techniques, 
alternative solutions for three-dimensional opti- 
cal memories are also under investigation. These 
include the extension of present disk systems to a 
layered format but also various new concepts of 
multilayer optical memories. In this case, the third 
dimension is introduced by recording the data in 
multiple layers through the thickness of a volumet- 
ric storage medium. 


13.5.1 Multilayer optical information 
storage 


The simplest way to use the third dimension of a 
storage medium is multilayer storage. Using mul- 
tiple data layers instead of one, the overall storage 
capacity will grow linearly with the number of 
layers. Multilayer recording is a simple approach 
to higher density in optical data storage. Storing 
the data in multiple layers the third dimension of 
a disk, its depth, becomes useable for the optical 
disk technology and the overall data capacity lin- 
early grows with the number of layers. A standard 
optical drive can easily address different depth 
positions while confocal filtering widely reduces 
interlayer cross talk. The success of dual-layer 
DVD-ROM has attracted interest but in conven- 
tional optical systems based on readout from the 
reflective layer, the multilayer approach has only a 
moderate potential to increase the storage capac- 
ity. It implies the costly production of multiple 
layer disks with rapidly decreasing tilt and flatness 
tolerances. Also there is a fundamental physical 
trade-off between the recording layer reflection (to 
generate the readout signal generation) and trans- 
mission (to optically access each data layer of a 
multilayer stack). The number of layers per side of 
a disk is limited strongly by higher optical power 
requirements, interlayer cross talk, and aberrations 
that appear while focusing on several layers at dif- 
ferent depths simultaneously. 

Optical recording in many layers primarily 
requires a homogeneous, low-absorption record- 
ing material while reflection mode is favored for 
confocal implementations of a standard opti- 
cal pickup. Combined with other recording 
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techniques, multilayer approach can become more 
attractive. In the case of two-photon or fluorescent 
memories that use transparent materials as storage 
media, the number of layers can become very large. 
Such quasi-three-dimensional optical memories 
use the volume of a storage medium by recording 
the data as binary planes stacked in three dimen- 
sions. The data are stored by discrete bits in the 
plane, but also through the volume (Figure 13.18). 
Following the experimental advances in media 
and system concepts made in the last years, optical 
recording by two-photon excitation in photochro- 
mic as well as photorefractive media became very 
attractive as an alternative for three-dimensional 
optical memories. The modulation of the record- 
ing material is usually localized by using focused 
Gaussian beams. The laser beam induces a photo- 
chemical change in the focal range. Selectivity in 
addressing individual bit locations, that is, volume 
elements, is typically increased by nonlinearities of 
the laser-material interaction. Diverse nonlinear 
optical effects can be used for recording but the 
requirement for strong multiphoton absorption 
limits their practicability. Typical media sensitiv- 
ity requires a strong laser-driven photon injection 
to initiate a chemical change in the medium. Upon 
excitation, a photochemical reaction leads to a 
permanent or reversible change in photochromic 
compounds of the medium. Chemical mechanisms 
such as photobleaching, photoisomerization, 
polymerization, and decomposition, have been 


investigated for nonlinear optical recording. 
Depending on the photochemical reaction, the 
optically detectable change allow for permanent, 
reversible or “gray scale” recording where multiple 
gray levels can be addressed in a single data mark 
volume element. 

Among nonlinear optical phenomena, two- 
photon absorption is particularly an attractive 
effect for multilayered optical recording. Two- 
photon excitation refers to the simultaneous 
absorption of two photons, whereby the excitation 
rate for this process is proportional to the square 
of the writing light intensity. Therefore, the excita- 
tion remains confined to the focal volume corre- 
sponding to the intensity distribution of a focused 
Gaussian beam. The basis of a two-photon record- 
ing system is the simultaneous absorption of two 
photons whose combined energy is equal to the 
energy difference between initial and final states of 
the recording material. This simultaneous absorp- 
tion results in a structural phase transition that is 
reversible and detectable by measuring the fluo- 
rescence of the material. The read beam is unab- 
sorbed and passes through the unwritten areas of 
the material while the recorded data marks will 
cause the absorption of the readout beam exciting 
the fluorescence at a longer wavelength. 

A variety of materials have been proposed 
for two-photon recording. The most important 
material requirements concern the photochro- 
mism, that is, the ability to change the chemical 
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Figure 13.18 3D optical data storage disk. The disk medium is highly transparent so that the laser 
light can address many layers of information. Nonlinearity in the optical medium response localize 
the light-matter interaction to the high-intensity focal region. 
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structure under light excitation, fluorescence in 
one of two chemical states, stability of both states 
at room temperature, etc. Although two-photon 
excitation is the strongest multiphoton absorption 
mechanisms, it is still a very weak optical effect. 
Therefore, research efforts focus on chromophore 
components possessing high cross section for 
two-photon absorption. 

A typical system involves two beams that are 
called data beam and address beam as shown in 
Figure 13.19. The data beam at 532 nm is modu- 
lated with a spatial light modulator (SLM) and 
focused at a particular plane within the medium. 
The addressing beam at 1064 nm provides the sec- 
ond photon required for the two-photon excita- 
tion. The data are written in the overlap region of 
the two beams and then read out by fluorescence 
when excited by single photons absorbed within 
the written bit volume. Hereby, the data beam is 
blocked and the read beam at 532 nm is focused 
to reconstruct the selected data page within the 
volume. The readout plane is then imaged onto a 
charge-coupled device (CCD). The spot size is lim- 
ited by the recording wavelengths through diffrac- 
tion. The approach promises not only ultra-high 
effective areal density but also parallel access to the 
stored data [27,28]. 

Besides the necessity for relatively high excita- 
tion energy, another critical issue in the realization 
of fluorescence-based multilayer memories is opti- 
cal selectivity of readout. The read beam addressing 
an individual data mark simultaneously illuminates 
many other bits located at different layers. The read 
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beam light interaction with the medium can alternate 
or completely erase the written data. In addition, 
the mostly linear interaction involves a number of 
data marks and contaminates the readout. Optical 
resolution in 3D can be achieved by nonlinearity 
(e.g., two-photon microscopy) or by spatial filter- 
ing of the response (e.g., confocal laser microscopy). 
Detection of refractive index differences between the 
written marks and surrounding medium provides 
nondestructive readout but requires costly equip- 
ment such as phase-contrast microscope or confocal 
reflection microscope. Alternatively, nondestructiv- 
ity can be achieved by using different absorption 
bands for recording and reading. 

Advances in material science and technology 
allow the design and development of novel mate- 
rials for optical recording. Tailored fabrication 
of nanostructured optical materials enables new 
recording and readout techniques. Based on mul- 
tilayer memory concept, five-dimensional opti- 
cal storage has been demonstrated in a recording 
material consisting of a plastic layer doped with 
plasmonic gold nanorods [26]. The longitudinal 
surface plasmon resonance (SPR) of gold nanorods 
is exploited to achieve a wavelength and polar- 
ization selectivity, whereas multilayer recording 
allows addressing three spatial dimensions. The 
readout mechanism is SPR mediated two-photon 
luminescence, which exhibits a high wavelength 
and angular selectivity necessary for crosstalk free 
readout. 

Beside two-photon excited fluorescence, further 
optical techniques have successfully been used for 
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Figure 13.19 3D two-photon multilayer recording. 2D binary data planes recorded by two-photon 
absorption are stacked in the depth of a transparent storage medium. Readout relies on 


laser-excited fluorescence. 
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3D multilayer data storage. These include micro- 
holography and 3D submicron-resolved structur- 
ing of the medium by high-intensity laser pulses. 


13.5.2 Holographic data storage 


Three-dimensional optical information storage by 
volume holograms has been proposed first in 1963. 
One of the unique characteristics of optical volume 
storage is the very high bit packing density that can 
be attained. The ultimate upper limit of the stor- 
age density in three-dimensional storage media is 
of the order of 1/A3, compared with 1/2? for surface 
of two-dimensional optical storage media. This 
results in 10'2-10" bits cm™, although the practical 
limit set by other parameters of the optical system 
and by the constraints of the recording material 
may be lower than this. 

Three-dimensional optical storage systems may 
generally be classified as bit oriented and page ori- 
ented. In holographic page-oriented memories, 
the information associated with stored bits is dis- 
tributed throughout the whole volume of the stor- 
age medium. In bit-oriented memories, each bit 
occupies a specific location in three-dimensional 
space. Various approaches to realize a three- 
dimensional optical memory by bit-oriented stor- 
age have already been presented not only including 
holographic but also nonholographic. Such storage 
methods are based in general on a point-like or 
bit-by-bit three-dimensional recording by creating 
small data marks within the medium. Hereby, data 
marks represent single bits and are defined by large 
contrast in some optical property of the storage 
material. Using a nonlinear optical response of the 
material, the optical interaction can be confined to 
a focal volume. With a diffraction-limited, focused 
light beam, the physical size of bits can become as 
small as the wavelength of the laser beam in all 
three dimensions. Owing to the submicrometer or 
dimensions of a single bit, bit-oriented optical data 
storage requires very strict tolerances for the focus- 
ing optics and recording alignment. 

In contrast to three-dimensional multilay- 
ered optical memories, in holographic storage, 
the information is recorded through volume. 
Recording is accomplished by interfering two 
coherent laser beams, the information-bearing 
signal beam and the reference or address beam. 
The resulting intensity pattern is then stored in a 
photosensitive material by inducing a grating-like 


modulation of its optical properties such as refrac- 
tive index or absorption coefficient. The data are 
reconstructed by diffracting the address beam at 
the induced grating. A unique characteristics of 
thick holographic gratings is the Bragg selectiv- 
ity, which allows many holograms to be stored 
overlapping by applying appropriate multiplexing 
methods. Compared to the conventional recording 
schemes, multiplexed optical recording provides 
an entirely new approach to higher information 
densities. Multiple individually addressable holo- 
graphic gratings can be stored in the same volume, 
representing multiplexed data bits or pages. 


13.5.3 Page-oriented holographic 
memories 


Holographic memories usually store and recall the 
data in page format, that is, as two-dimensional bit 
arrays, which offers the way to realize high data 
rates and fast access. Combined with multiplex- 
ing, the inherent parallelism of holographic stor- 
age can provide a huge increase in both capacity 
and speed. For more than 30 years, holography has 
been considered as a storage approach that can 
change standards and prospects for optical storage 
media in a revolutionary manner. Depending on 
a number of supporting technologies, holographic 
memories became realizable with advances in pho- 
tonics technology, particularly with improvements 
in liquid crystal modulators, CCDs, semiconduc- 
tor detectors, and laser sources. Ongoing research 
efforts have led to impressive advances [17]. The 
first completed working platforms demonstrated 
high storage densities of more than 300 bits/pm?, 
but they are still far from commercialization. 

In contrast to the conventional optical record- 
ing where an individual data bit is stored as 
localized change in some optical property of two- 
dimensional storage media, holographic recording 
allows to store the data page-wise in the volume 
of the material. Instead of storing one single bit at 
each location, large data pages can be recorded and 
read out at once. The information to be stored is 
first digitized and then loaded onto an SLM as a 
two-dimensional pattern of binary ones and zeros. 
The SLM imprints that binary data page to the sig- 
nal beam. The data are recorded by intersecting the 
signal beam with a reference beam inside the stor- 
age medium (Figure 13.20). The three-dimensional 
interference pattern induces a corresponding 
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Figure 13.20 Page-oriented holographic storage. A two-dimensional data pattern created by SLM 
is stored by interfering signal and reference beam to record the hologram. One whole page is 
written at once, many pages overlap in the same volume. One page from many is read out with the 


corresponding reference beam. 


spatial modulation of the refractive index of the 
recording material. Consequently, the data are 
stored as a refractive index grating representing a 
phase volume hologram. 

As in page-oriented holographic storage, each 
data bit is distributed in three dimensions through 
the recording medium, there is no direct correla- 
tion between the data bit and a single volume ele- 
ment in the recorded structure. The stored data are 
retrieved by introducing the same reference beam 
used to record it, and read out by projecting the 
reconstructed signal beam onto the output detec- 
tor array for optic-to-electronic conversion. 

Due to the Bragg selectivity of holographic vol- 
ume gratings, it is possible to store several holo- 
grams overlapping in the same volume element 
by either changing the angle or the wavelength 
of the reference beam. Such overlapping record- 
ing of multiple holograms in one single position 
is referred to as holographic multiplexing. In a 
multiplexing scheme, holographic structures are 
mixed together whereby the Bragg effect allows 
retrieval of an individual data page while minimiz- 
ing cross talk from other pages stored in the same 
volume. The diffraction efficiency varies according 
to the mismatches in angle or wavelength between 
recording and readout. Deviations from the Bragg 


condition lead to a rapid decrease in diffraction 
efficiency, which makes a selective reconstruction 
of multiple holographic gratings possible. 

Various multiplexing methods have been pro- 
posed in diverse holographic storage systems. 
Wavelength and angle multiplexing result directly 
from the Bragg-selective character of thick volume 
holograms. The addressing mechanism here is the 
wavelength or angle of incidence of the reference 
beam. Both methods allow many holograms to 
be recorded in the same position but their practi- 
cal impact is limited. Angle multiplexing requires 
complex optics, and the crucial component for 
efficient wavelength multiplexing is a laser light 
source that should be tunable in a sufficiently 
wide range. Such lasers are available but complex 
so that a significant increase of storage density by 
wavelength multiplexing only is difficult to realize 
in practical systems. New methods such as peris- 
trophic or shift multiplexing have been developed 
for holographic storage systems which use a disk- 
shaped medium instead of a photorefractive crys- 
tal cube. Peristrophic multiplexing is based on the 
rotation of the plane wave reference beam around 
the optical axis; during the readout, the recon- 
structed holograms follow the motion of the refer- 
ence beam. The address of an individual hologram 


534 Optical information storage and recovery 


is the relative rotational position of the storage 
medium. Shift multiplexing relies on a spherical 
wave as the reference beam while the signal beam 
is still a plane wave. Such spherical holograms can 
be shifted relative to the reference and stored next 
to another with a shift distance below 10pm. The 
relative displacements are small enough that holo- 
grams in subsequent locations overlap significantly. 

Volume holography is a powerful approach 
for digital storage systems with high densities 
(2100 bits/m?) and fast transfer rate (=Gbitps). 
However, the practical realization of holographic 
memories suffers from the lack of suitable storage 
media. Both the performance and viability of sys- 
tems under development are significantly limited 
by the characteristics of the available materials. To 
date, the requirement of adequate storage materi- 
als has been one of the most crucial aspects in the 
development of holographic memories. Indeed, 
there are very rigorous demands on storage materi- 
als, which should be satisfied to realize holographic 
memories as competitive, reliable optical storage 
systems of improved performance. The search for 
an optimum material to be used in holographic 
data storage does not appear to be finished yet. An 
ideal recording material should be of high optical 
quality, it should be able to hold the recorded data 
for a long time, and, for commercial applications 
and should be very reliable and not too expensive. 
Considering the physical processes, one can define 
a number of parameters to be controlled. The most 
important for the viability of holographic storage 
are high resolution (>3000 lines/mm), high pho- 
tosensitivity, large dynamic range (ie., diffrac- 
tion efficiency of multiplexed data pages), archival 
storage time, low absorption, and low scatter. In 
particular, a large dynamic range of the storage 
material is necessary to allow multiplex recording 
of many holograms in the same volume. 

Many kinds of materials have been investigated 
as holographic storage media. With sufficient 
material development efforts, the necessary opti- 
cal quality has been achieved for both inorganic 
photorefractive crystals and organic photopolymer 
media. Photorefractive crystals such as lithium 
niobate, barium titanate, and strontium barium 
titanate were used previously in holographic sys- 
tems. In the last years, a new class of photosensi- 
tive polymers has been introduced to satisfy the 
demand on adequate materials for holographic 
storage. Depending on the recording material, 


different optical system architectures have been 
developed for holographic memories. Holographic 
storage media can be classified in two categories: 
thin photosensitive organic media and thick inor- 
ganic photorefractive crystals. Thin media (a few 
hundreds micrometers thick) are most sui for 
transmission type architecture using a variety of 
shift or phase multiplexing techniques, while angu- 
lar multiplexing in various modifications is usually 
applied in thick media (about centimeter thick). 
Typical system architecture in this case is based on 
the 90° geometry, whereas thin phototable poly- 
mer layers are often used in a disk-based configu- 
ration. In this concept, digital holographic pages 
are stored on a disk-shaped medium and organized 
in tracks similar to those on conventional optical 
disks. The disk can rotate continuously and it can 
also move across tracks to allow the optical head 
to access the entire area of the medium. The stor- 
age medium is typically an organic photopolymer 
layer sandwiched between two glass plates. 


13.5.4 Bit-oriented holographic 
storage 


An alternative to page-oriented holographic 
memories is three-dimensional, bit-oriented opti- 
cal storage on a holographic disk medium. The 
microholographic approach combines the bit- 
wise storage of CD/DVD and holographic volume 
recording, which makes it possible to advance the 
capabilities of conventional disk technologies by 
implementing spatial and wavelength multiplexing 
[18,19]. Microholography expands surface storage 
into three dimensions by storing the data as micro- 
scopic volume gratings instead of pits. Holographic 
recording is realized within a system that is in its 
main features very similar to the recordable CD 
or DVD systems. The technology provides volume 
storage of information while the areal structure of 
the stored data remains comparable with pits on a 
DVD. The microholographic storage concept ben- 
efits from both technologies: The bit-oriented stor- 
age allows for using many solutions of the highly 
developed CD/DVD technology. In addition, holo- 
graphic recording offers a path to overcome the 
limitations of this technology, which are related to 
its two-dimensional nature. 

In contrast to binary page-wise holographic 
data storage, the microholographic approach 
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capitalizes on its fundamental compatibility with 
the established optical disk technology. The data 
are stored holographically in three dimensions but 
bit-wise in tracks and layers similar to those of a 
standard optical disk. The reflectivity of a photo- 
sensitive material, typically a holographic record- 
ing photopolymer, is locally varied by recording 
submicron-sized reflection gratings. Cross talk, 
typical in page-wise recording, is eliminated by the 
bit-wise nature of the recording and readout pro- 
cess: At any point in time, the focused light beam 
illuminates only one microholographic bit feature. 
Multiplexing techniques open additional paths 
to storage densities beyond the resolution limit 
imposed on 2D optical data storage. 

For data recording, microscopic reflection grat- 
ings are holographically induced to vary the reflec- 
tivity of the disk by diffraction (Figure 13.21). The 
laser beam is focused into the photosensitive layer 
and reflected back with the mirror. The interfer- 
ence pattern of the incident and reflected beam 
induces a grating-like modulation of the refractive 
index of the storage medium. To retrieve the stored 
data, the original signal beam is reconstructed by 
reflection of the read beam at the induced grat- 
ings. Recording with sharply focused laser beams 
results in localized volume storage. The micrograt- 
ings can be packed very densely and arranged in 
tracks similar to those on a CD. When the disk 
is rotating, microgratings of variable length are 
induced dynamically whereby grating fringes are 
extended in the motion direction. 

The storage system (Figure 13.22) is very similar 
to the conventional optical drives: A focused laser 
beam is used for writing and reading, the data are 
stored bit-wise in tracks on the rotating disk, and 
similar systems for automatic focusing and track 
following are needed to control the position of the 
laser beam focus on the rotating disk. All these 
common aspects simplify the practical realization 
of the microholographic system as many compo- 
nents developed for the CD/DVD technology can 
be directly used or adapted for this purpose. The 
main difference here is the reflecting unit under- 
neath the disk that is needed for writing. During 
readout, this unit, in the simplest realization an 
aspherical mirror, can be tilted or removed so that 
only reflection from the gratings will be detected. 

The physical structure of the stored data is simi- 
lar to the pit-land structure on a CD/DVD as stripe- 
shaped microgratings are written dynamically 


while the grating length corresponds to a coded 
bitstream. Multiplexing is integrated parallel into 
the coding scheme in order to maximize writing 
and reading speed. Each wavelength/layer channel 
corresponds to standard data channels in CD/DVD 
systems so that recording and error-correction 
code algorithms evolved in conventional tech- 
nologies can be implemented. Being fundamen- 
tally a matching of the optical disk technology 
and holographic volume recording, the microho- 
lographic approach concomitantly offers reflective 
bit features and confocal optical pickup design, as 
well. In addition, photopolymer materials used as 
recording media allow a simple disk design with 
a monolithic photopolymer layer sandwiched 
between two substrates. Photopolymers developed 
for holographic storage possess the characteristics 
essential for multilayer recording including large 
thickness of up to 1mm, low absorption for writ- 
ing, sufficient refractive index change, and negli- 
gible absorption for reading. 

Multilayer recording is a simple method of spa- 
tial multiplexing that relies on the depth localiza- 
tion and selectivity of holographic microgratings. 
Although multilayer recording in conventional 
optical disk technology requires a stack of physi- 
cally distinct layers, the microholographic method 
allows a simple disk design consisting of a single 
homogeneous photopolymer layer. As photopoly- 
mers are substantially optically transparent, many 
microholographic layers can be stored through the 
depth of a disk. Starting from data densities com- 
parable to DVD or Blu-ray, the microholographic 
multilayer approach targets the terabyte capacity 
range. At the same time, it allows a cost-effective 
and downward compatible technology imple- 
mentation as the drive system has most optical 
and optoelectronic components in common with 
a standard optical drive. Multiple data layers are 
addressed by simple confocal movement to differ- 
ent depths within the photopolymer layer. 

In addition to multilayer recording, the appli- 
cation of wavelength multiplexing would allow a 
linear increase of the storage density and also of 
the write/read rate with the number of wavelength 
used. Wavelength multiplexing can be realized by 
simultaneous recording of several gratings in the 
same position with write beams of different wave- 
lengths. In this case, a complex periodical grating 
structure is induced that contains all single-color 
gratings. Due to the wavelength selectivity, each 
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Figure 13.21 Microholographic recording in retroreflector configuration. (Top) The optical recording 
configuration is based on a single beam path: The “second” write beam is created by retroreflection 
to fully overlap with the incident write beam. Two identical objectives with high NA are adjusted to 
image the focal points of the incident and reflected beam at the same storage location. The grating 
formation takes place in the joint focal region of the two beams when a photosensitive polymer is 
exposed to their interference. (Middle) The interference pattern is plotted in the logarithmic scale to 
display the wave fronts. (Bottom) Driven by the linearity of the photoresponse, the index modulation 
mirrors the intensity distribution. With the given optical specification for high-density recording in 
the violet spectral range (A=405 nm, NA=0.75), microgratings effectively consist of less than 10 
grating fringes. 
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Figure 13.22 Optical write/read system for multilayer microholographic data storage. Essential 
features of the optical configuration primarily are single-beam path, diffraction-limited high-NA 


focusing, retroreflection, and confocal filtering. 


laser beam with a certain wavelength detects only 
the corresponding grating during readout. The 
data rate is increased since all bits contained in 
one multiplex grating can be recorded and read in 
parallel. 

The experimental results show that resolution- 
limited optical storage is also possible in 3D with 
submicron-sized volume gratings representing 
individual bit features [23]. The storage perfor- 
mance is governed by the interaction between 
the focused laser beam and photopolymer mate- 
rial. The micrometer-scaled depth localization of 
microgratings is the basis for multilayer recording. 
Depth multiplexing of microgratings recorded has 
been demonstrated by recording up to 75 layers 
spaced by 4um [29]. These recordings achieved 
without any SA compensation evidence the poten- 
tial of microholographic data storage to capital- 
ize on multilayer recording for data capacities far 
beyond the state of the art. Furthermore, manu- 
facturing technology can provide low-cost, remov- 
able media, and downward compatible systems. 
The key issues in the design and development of 
the optical system are 3D microlocalization of 
recording, selectivity, and sensitivity of readout. 


Much research fails to recognize the importance 
and implications of the storage medium for sys- 
tem design. The microholographic disks may be 
successful as removable storage media that satisfy 
high-capacity demands in specific areas including 
data banks, archiving and security systems, image 
processing, and multimedia applications. 


13.6 POTENTIAL IMPACT OF 
NOVEL TECHNOLOGIES 


Optical systems for recording and retrieval of 
digital information represent a rapidly developing 
field with a huge potential to encompass entirely 
new applications and to provide solutions to prob- 
lems arising from these applications. Continuous 
advances and discoveries in related technologies, 
devices, and materials have opened an entire spec- 
trum of optical effects and materials that can be 
used to provide writing and reading mechanisms 
in an optical data storage system. 

Trends toward rewritability and higher stor- 
age capacity have moved optical storage into 
competition with high-end magnetic storage. The 
key difference between these two technologies is 
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the removability of optical media but also their 
excellent robustness, archival lifetime, and very 
low cost. An additional advantage of optical tech- 
nology is the stability of written data, a feature that 
makes optical media suitable for archival lifetimes. 
An optical disk can be removed after recording 
and read out in any compatible drive which enables 
data storage separately from the main computer 
system or network. Typical applications range 
from archival storage, including software distri- 
bution, digital photographs and imaging, movies, 
and other video materials. 

For archival storage, many disks are organized 
in an optical library system capable of storing and 
managing large amounts of data. Library systems 
are usually constructed as jukeboxes compris- 
ing hundreds of disks to provide high capacities 
for long-term storage. An important advantage of 
optical archiving systems is that the data are stored 
off-line, which releases computers or networks but 
also provides data security and retrieval even if the 
network is irreparably destroyed. Optical systems 
are already widely accepted in enterprise and insti- 
tutional storage with applications ranging from 
extending existing server capacities to publish- 
ing and image storage. One of the most important 
applications for optical archival storage is docu- 
ment and image management—where documents 
and images such as receipts, X-rays, photographs, 
and other records are stored in the digital form on 
optical disks. Plurality of disks is then arranged in 
a database to facilitate rapid retrieval. 

Optical disk technology across three genera- 
tions has brought optical data storage to perfection 
at the physical limits of resolution and diffraction. 
CD, DVD, and Blu-ray rely on identical record- 
ing and readout processes while the differences 
originate from the format specifications. Despite 
impressive advances and continuous increases in 
storage density, the existing bit-wise, serial-access 
optical storage is far away from realizing the full 
potential of optical technology. With opening 
computer applications such as three-dimensional 
imaging, video mail and server applications 
including, for example, digital libraries, satellite 
imagery, medical document, and image archiving, 
optical storage should ensure capacities exceeding 
a terabyte per disk. Numerous techniques pro- 
vide the ability to achieve high storage capacities 
by a more effective use of the volume of a storage 
medium and/or by taking advantage of additional 


degrees of freedom such as the recording/reading 
wavelength. Moreover, the inherent parallelism of 
optics offers the possibility to record and retrieve 
large data files with extremely high data rates com- 
pared to those achievable by electronics. 

Three-dimensional optical storage by volume 
holography or two-photon recording hold prom- 
ises for high-capacity, high-speed systems. In 
addition, microholographic disks or fluorescent 
multilayer disks that store the data bit-wise as 
length-coded marks can also satisfy the require- 
ments for downward compatibility and low-cost 
media. Figure 13.23 gives a comparison of storage 
densities achievable in different technologies. A 
crucial aspect for the reliability of all these sys- 
tems is the storage material itself. Many types of 
materials have been investigated in recent years 
as optical storage media including inorganic pho- 
torefractive crystals, organic photopolymers, and 
biological systems such as protein bacteriorho- 
dopsin or DNA polymers. Progress in the last 
couple of years has been impressive, particularly 
in the field of photosensitive polymers that offer 
a wide variety of possible recording mechanisms 
including both write-once and rewritable media. 
In particular, a new class of photopolymer materi- 
als has been introduced for holographic storage. 
Optimization and further development of photo- 
sensitive polymeric media will be key to the suc- 
cess of this and other advanced optical storage 
technologies. 

Page-oriented holographic memories hold the 
top of the table, promising terabyte devices and 
Gbit/s data rates, but it is questionable if they 
would be able to compete with the existing opti- 
cal disks in daily life. Holographic storage can 
find important niche applications in professional 
storage, backup and archiving systems, and data 
banks, where large data files have to be stored and 
recalled with fast access. The possibility of asso- 
ciative retrieval enables holographic memories to 
be used as data search engines, that is, content- 
addressable database servers or large web servers. 

In small end-user systems, the requirements 
will rather be governed by the convergence of 
entertainment and computing. From this point 
of view, bit-oriented optical storage offers more 
realistic solutions for the next generation. With 
the availability of adequate storage media, the 
microholographic disk technology might provide 
the successor to the blue DVD generation. The 
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Figure 13.23 Potential impact of novel optical technologies in comparison with achievable areal 
densities in conventional optical storage systems and hard disk systems. 


recording technique can improve performance 
significantly, reaching with ease the threshold of 
100 GB per disk. Such advanced optical disks are 
asked for many applications and are particularly 
attractive for three-dimensional high-definition 
imaging, video and multimedia storage. 

Although recent advances toward commercial 
devices are impressive, it remains to be seen which 
technology will be successful in providing the next 
generation optical storage media. A wide variety 
of materials as well as recording and readout tech- 
niques are under development. The requirements 
of foremost importance in optical information 
storage are high performance in terms of capacity 
and data rates, removability, and compatibility. The 
demand on downward compatibility is to ensure 
the use of universal drives, but also reliability 
and low-cost media. Meeting these requirements 
simultaneously is a major challenge to develop- 
ers of novel optical technologies. Competition 
combined with an immense market potential and 
appetite for storage keeps the field exciting. 
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Optical technology has been developed for highly 
effective transport of information, either as very 
high speed temporal streams, e.g., in optical 
fibers or in free-space, or as in high-frame-rate 
two-dimensional (2D) image displays. There is, 
therefore, interest in performing routing, signal- 
processing and computing functions directly on 
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such optical data streams. The development of 
various optical modulation, display, and storage 
techniques allows the investigation of process- 
ing concepts. The attraction of optical process- 
ing techniques is the promise for parallel routing 
and processing of data in the multiple dimensions 
of space, time, and wavelength at possible optical 
data rates. For example, in the temporal domain a 
1nm wide optical band at a wavelength of 1500 nm 
has a bandwidth of approximately 100GHz, 
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and temporal light modulators with 100GHz 
bandwidth have also been demonstrated for optical 
fiber systems [1,2]. However, notional optical pro- 
cessing techniques can be envisioned that handle 
many such narrow-wavelength bands in parallel, 
and also operate in a combined spatiotemporal 
domain. Employing all domains simultaneously, it 
is theoretically possible to perform spatiotemporal 
routing and processing at an enormously high 
throughput in the four dimensions x, y, t, and A. 
Throughput of 10 samples/s would result from 
simple examples based on feasible modulation and 
display capabilities. In one case a 100 GHz tempo- 
ral modulators can be combined with wavelength- 
selective devices to provide several hundred 1nm 
wide channels at the operating wavelengths of 
existing photodetectors and light sources. A sec- 
ond example would consider that 2D spatial light 
modulator (SLM) devices can be constructed to 
have >10’ pixels/frame (see Chapter 6) and that 
material developments allow optical frame update 
rates on the order of 1MHz [3]. Unfortunately, 
although an optical processor operates on data in 
optical form, it is presently not possible to equate 
these maximal modulation and display rates to 
the expected information-processing throughput 
rates of such processors. There are penalties on the 
throughput due to necessary data pre-processing 
and post-processing in any information-process- 
ing system. These include the need to format and 
condition the input data to a processor, to com- 
pensate for shortcomings of any analogue signals 
(e.g., nonuniformities in space and time), and 
perhaps most importantly, to examine the proces- 
sor’s output data and extract the useful informa- 
tion. The latter is often an iterative process and 
requires fusion with other data processing results. 
An optical processor’s speed advantage could 
be largely negated unless all processing opera- 
tions can be performed at speeds commensurate 
with modulation and display rates. Thus, equally 
important considerations are the need to identify 
those operations that can be effectively performed 
optically, and the need to develop optical process- 
ing architectures that minimize the penalties on 
optical throughput. Because of these consider- 
ations optical information-processing approaches 
have covered a wide range of topics. Therefore, 
we first provide a brief review of the various par- 
adigms that have been investigated in optical 
processing. 


14.1.1 Analogue optical processing 


In the oldest paradigm, optical data in analogue 
form can be manipulated to perform useful func- 
tions. The classic implementation of an analogue 
spatiotemporal processing function is the use of a 
simple lens to produce, at the back focal plane of a 
lens, the complex Fourier transform (both phase 
and amplitude) of optical data at the front focal 
plane [4]. In the most common and simplest con- 
figuration, shown in Figure 14.1 a one-dimensional 
(1D) object with complex transmission, 

t(xo)=a(xo) exp[—j27b(x9)] (14.1) 
where a and b are the amplitude and phase value 
at pixel location x,, is positioned at a distance d in 
front of a lens of focal length f,. Illuminating the 
object with coherent light of wavelength i, one 
obtains at one focal length distance behind the lens 
the amplitude distribution [4] 


Aexp[(j 2/Af,)(1-d/E,)x?] 


Ur(xe) = iA, 
1 


| t(x)exp[-j(27 /(Af,) (xox) ]dx (14.2) 


0 


where L is the spatial extent of t(x,), A is a constant, 
and the subscripts 0 and f are used to denote the 
object and output space, respectively. When 


fi=d (14.3) 
; Fourier transform 
Object plane 
t(xo) 
Xo *f 


|}+——¢—_+ as ——+| 


Figure 14.1 Optical arrangement for Fourier 
transformation with a single lens. 
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aside from the finite spatial limits of the integral, a 
Fourier transform of t(x,) results (to within a con- 
stant multiplicative factor). The 1D example given 
in Equation 14.2 employs a cylindrical lens. Use of 
a spherical lens results in a 2D Fourier transform, 
but since the x and y variables in the exponen- 
tial term of the Fourier transform are separable, 
a spherical lens can be used even for the 1D data 
in Figure 14.1. Alternative lens configurations can 
produce the Fourier transform but require two 
instead of one lens, or produce the correct ampli- 
tude of the Fourier transform but with a curved 
phase front [4]. 

The core idea of Fourier transformation by 
a lens is the basis for many demonstrations of 
single-function optical processors. Electro-optic 
and acousto-optic modulation devices have been 
developed for optical input and output at the focal 
planes. To exploit this powerful concept, other 
mathematical integral transforms have often been 
re-cast as Fourier transformation problems, in 
particular the matched filter operation, or equiva- 
lently the correlation integral 


R= | s(ossie+ t)dt. (14.42) 


According to the convolution theorem, Equation 
14.4a is mathematically equivalent to the Fourier 
transform of the product of the instantaneous 
Fourier transforms of the two signals, S,(@) and 
S,*(@), ie., 


R(t)= S\()S;(@)exp(jat)dw. (14.4b) 


Alternative transform kernels to the Fourier 
Transform have been developed, and implemented 
using computer-generated holograms rather than 
lenses [5]. Processing applications involving mas- 
sive amounts of linear processing have been suc- 
cessfully addressed with prototype analogue 
signal-processing systems that perform temporal 
spectral analysis [6] and correlation [7,8]. However, 
the optical processing performed in these systems 
implementations are linear operations involving 
only multiplication and addition. These analogue 
processors therefore address an application niche 
in signal processing, and because they are ana- 
logue, must contend with accuracy and dynamic 


range concerns. Single-purpose analogue com- 
putation engines, described in the references, 
have been embedded into conventional electronic 
processing systems to accelerate specific signal- 
processing tasks, but not to perform general com- 
putations. It has often been the case that insertion 
of such engines leads to bottlenecks due to the opti- 
cal-to-electronic and electronic-to-optical conver- 
sions, and the incompatibility of optical processor 
speeds with electronic computer limits such as 
memory access time. An approach that potentially 
relieves these bottlenecks involves “all-optical” 
systems whereby several analogue optical process- 
ing modules are cascaded, either via free-space or 
guided-wave optical paths or interconnections. At 
the end of the processing cascade the output data 
rate is presumably reduced to a manageable band- 
width for high-speed electronics. However, one 
problem with all-optical schemes is the need to 
store intermediate processor results and to access 
memory for subsequent processing. Advanced 
optical memories could be employed to enable all- 
optical schemes. Section 14.2.4.3.3 and Chapter 
3.4 discuss advanced optical storage concepts such 
as page-oriented memories and holographic 3D 
memories, using SLMs and materials such as pho- 
torefractives, as described in Section 14.2.3.2. 


14.1.2 Numerical optical processing 


To obtain more general applicability of analogue 
optical processors, various encoding techniques 
have been investigated to improve the dynamic 
range and accuracy of such processors, and to 
overcome limitations of a processor that performs 
only multiplication and addition. Logarithmic 
encoding has been employed to compress dynamic 
range requirements into a smaller analogue volt- 
age range, and to address the difficulty of per- 
forming the division operation optically, i.e, by 
converting division to an addition/subtraction 
problem. Numerical optical processing for poten- 
tial implementation of an optical computer has 
also been considered. Approaches such as resi- 
due arithmetic [9] and digital multiplication by 
analogue convolution (DMAC) [10,11] have been 
explored extensively. However, all such approaches 
involve many nonlinear, logic operations that 
have to be performed optically to avoid numer- 
ous conversions between the optical and electrical 
domains. Consequently, many all-optical (optical 
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IN, optical OUT) switch devices have been inves- 
tigated, for performing logic operations, such as 
nonlinear Fabry-Perot etalons and the self-electro- 
optic effect device (SEED) which is based on the 
quantum-confined Stark effect [12]. Significantly 
reduced power consumption in performing these 
nonlinear operations has been achieved with SEED 
devices [13] compared to early switches. However, 
no approach has, to date, resulted in an optical 
numerical computer competitive with electronic 
numerical computers. Fundamental arguments 
have been forwarded about the disadvantages of 
performing numerical operations optically, such as 
the large minimum-power requirement to perform 
nonlinear operations optically compared to elec- 
tronically, even with SEED devices [14], the limits 
on integration density in such processors due to 
lower limits on the size of optical processors due to 
the wavelength of light [15], and the lower signal- 
to-noise of optical signals compared to electrical 
signals because the fundamental shot-noise power 
limit of optical signals is larger (for the same band- 
width) than the Johnson (or thermal) noise limit 
for electrical signals. 


14.1.3 Optical interconnections/ 
optics in computers 


An alternative paradigm for optical implementa- 
tion of computational algorithms (vice the more 
familiar arithmetic formulation) is to recast par- 
ticular signal and image processing algorithms 
in terms of routing of optical data among various 
nodes according to a particular interconnection 
scheme and with various weighting factors for data 
that recombine at nodes. This paradigm is similar 
to that used in neural-net formulations of process- 
ing problems [16]. The data nodes are “neurons”. 
At these nodes only multiplication and addition 
are required, the former for applying weighting 
factors, and the latter at the recombination nodes, 
respectively. A simple threshold operation at the 
recombination node(s) residing at the output plane 
is often required. The advantage of using optics to 
perform the routing function is based on the free- 
space propagation characteristics of optical beams. 
Different optical channels (at reasonable power 
levels, so nonlinear effects can be ignored) can be 
routed in free space without interfering. Unlike 
physical wires, optical channels can overlap within 
the same space. 


A data-routing paradigm can be used to 
describe the basic Fourier transform operation 
by a lens or a hologram, as has been described 
above and illustrated in Figure 14.1. The Fourier 
Transform is effected as a global optical “inter- 
connection”; the interconnections are global since 
every input datum (or pixel) is connected to every 
output (transform) datum, and the input weight- 
ing factors are the amplitude and phase values at 
the pixel locations at front focal plane. A more 
general formulation involves both global and non- 
global interconnection, such as in neural-network 
formulations where layers of “neurons”, or simple 
processors that perform addition and multiplica- 
tion of data, are interconnected. The multiplicative 
weighting factors applied at various neurons can 
be adaptively adjusted to solve signal-and image- 
processing problems [16]. Optics has been explored 
for implementation of specific neural net algo- 
rithms [17,18]. The dynamic range and accuracy 
requirement for the additions and multiplications 
in neural net algorithms is generally not high, and 
therefore can be performed adequately with ana- 
logue data in an optical implementation. However, 
while optics has potential to implement massively 
parallel neural networks that might be difficult for 
electronic implementation, the maturity of neural 
network theory at this time has not yet progressed 
to the point where neural processor performance 
is limited by the size and interconnectivity pos- 
sible with electronically implemented neural nets. 
Hence, the intrinsically large optical interconnec- 
tion capability is not yet needed. Therefore, the 
alternate development of optical routing and pro- 
cessing capabilities has been towards enhancing 
the capabilities of computers and telecommuni- 
cation networks (see Chapter 2). All-optical digi- 
tal switching fabrics for interconnection of large 
fiber-optic networks have been addressed with 
arrays of SEED devices [19] that provide the logic 
operations needed for switching. While switching 
fabrics have been successfully demonstrated [20], 
the scale of fiber-optic telecommunication switch- 
ing needs have not yet called for such ultra-high- 
bandwidth fabrics. Hence, optical interconnection 
has concentrated on use with within electronic 
computers, vice optical computing, and towards 
interconnecting large arrays of sensors and their 
associated electronic processors. 

One avenue of research for using optics 
in computers has been to explore simply the 
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interconnection among and within all-electronic 
computing elements in conventional and novel 
multiprocessor architectures. A second avenue 
has been to optically perform necessary switch- 
ing, routing, and pre-processing operations for 
individual fiber links and networks of comput- 
ing elements. Many investigations have addressed 
advanced concepts for interconnection and storage 
to implement novel multiprocessor and networked 
computing architectures [21]. Expectations have 
been driven by the fact that optical communica- 
tions and optical storage have been the two most 
successful commercial optical technologies. 
However, these commercial technologies usually 
function exactly as in corresponding electronic 
subsystems. Therefore, significant research is 
required to adapt commercial technologies to fully 
exploit the interconnect potential of optics. 

Optical interconnection schemes for comput- 
ers range from computer-to-computer, box-to-box, 
card-to-card backplane, and finally chip-to-chip 
within a circuit card. The first of these already exists 
in several forms commercially; hence, research has 
concentrated on the latter applications. Guided- 
wave (also known as lightwave) interconnections 
are used for high throughput, low power and low 
bit error rate. The application is generally in novel 
multiprocessor computer configurations that focus 
on reducing or eliminating bottlenecks in conven- 
tional computers such as relating to memory access 
time. Free-space optical interconnections can be 
used advantageously as the number of processors 
increases, and they allow three-dimensional (3D) 
multiprocessor configurations [22]. For circuit- 
card and chip interconnections free-space opti- 
cal beams do not have the limitations of physical 
wires, so circuit cards and elements may be laid out 
with more degrees of freedom. 


14.1.4 Interconnection processing/ 
in-fiber processing 


In addition to interconnecting network nodes 
for computer or communication networks, one 
can consider processing data residing within the 
interconnect path. Such processing is generally 
feasible only for fiber or guided-wave intercon- 
nections. The data may be either in digital or 
analogue form, the former for conventional digi- 
tal computer and telecommunication systems, 
and the latter for arrays of fiber-optic sensors. 


While the type of processing will vary depend- 
ing on the nature of the optical fiber systems, the 
processing load can be expected to scale with the 
bandwidth of the data conveyed. With optical 
modulation rates now in excess of 100 GHz (see 
Section 4.2.1), commensurate processing speeds 
are required. Faster processing will allow more 
flexibility and capability in fiber systems, such as 
routing and switching a larger number of digital 
data channels. Use of conventional all-electronic 
approaches imposes a need to perform optical-to- 
electronic and electronic-to-optical conversions. 
Such conversions add complexity, and high-speed 
electronics will tend to be power hungry and may 
not provide the throughput required, e.g., due to 
electronic processor latencies. Hence, it is attrac- 
tive to perform processing directly on the optical 
data stream at the transmission rate, e.g., in-fiber 
processing (see Section 14.2.4.3.3). The possibility 
of manipulating the information stream within 
an optical fiber has been made possible by (1) the 
development of optical fiber amplifiers, (2) the 
capability to build long fiber delay lines needed for 
short-duration buffer storage and for implementa- 
tion of tapped delay lines, and (3) fiber couplers for 
tapping into delay lines and forming devices such 
as interferometers. 

High-bandwidth digital and analogue data on 
fiber interconnects are often comprised of a large 
number of lower-bandwidth channels. There is 
need to multiplex and de-multiplex these channels. 
Digital systems may eventually require optical 
implementation of functions now performed elec- 
tronically; these include clock recovery and read- 
ing of packet headers to enable packet switching 
and routing. However, if optical de-multiplexing 
is possible, electronic means for clock recov- 
ery will usually be available at the lower channel 
bandwidths. Routing of de-multiplexed channels 
requires reading of digital packet header infor- 
mation. As has been noted above, it has been dif- 
ficult for optics to implement a digital computer; 
thus it is presently difficult to optically perform the 
logic needed to read packet headers and perform 
packet switching. However, it appears attractive 
to optically switch/rout high-bandwidth, fully- 
multiplexed, light streams, since these can usually 
be switched at rates much lower than the data band- 
width, and therefore avoids optical-to-electronic 
conversion at the signal bandwidth. Optical fiber 
networks (see Chapter 2) also can possess unique 
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characteristics that require corresponding new 
processing functions. A major unique aspect is 
the wavelength-division multiplexing (WDM) 
capability of most optical networks, which then 
requires functions such as wavelength selection 
and amplitude equalization among all the wave- 
length channels ina WDM system. 

Optical processing involving any of the vari- 
ous paradigms described above is practical only 
if materials and technology exist to construct 
devices that effectively implement the desired 
processing operations. The early literature gen- 
erally showed architecture developments that 
required device performance exceeding capa- 
bilities at the time. Experience has shown that 
the former must work in conjunction with the 
development of the latter. Therefore, in the fol- 
lowing we describe materials and devices that 
have been important to the development of opti- 
cal processing techniques. Specific applications 
are described immediately after descriptions of 
device performance. 


14.2 OPTICAL DEVICES AND 
PROCESSING APPLICATIONS 


The needs in optical processing for optical modu- 
lation, display, storage and routing are shared with 
many applications, covered in other chapters in 
this Handbook (see Chapters 6, and 13). However, 
optical processors must not only efficiently modu- 
late information onto optical beams, but must also 
rapidly manipulate the optical data to perform 
useful information processing. The needs for large 
temporal bandwidths and high-frame-rate 1D and 
2D spatiotemporal optical modulation are clear, 
and are covered in Sections 14.2.1, 14.2.2 and 14.2.4 
respectively, below. In addition, wavelength multi- 
plexing can be performed in conjunction with both 
temporal and spatiotemporal processing, and vari- 
ous wavelengths can be selected or rejected out of 
either temporal or multidimensional data streams. 
Various wavelength-processing techniques are dis- 
cussed in Section 14.2.3. 


14.2.1 Temporal modulation and 
processing 


Processing rate can be maximized via either 
increase in single-channel modulation rate or 
increase in number of modulated channels. 


High-speed single-channel modulation can be 
done either by direct modulation of a semicon- 
ductor laser/light emitting diode (LED) or by 
using modulation devices external to the laser. 
The latter affords more flexibility to address 
a range of applications, although it is bulkier 
than the former. The former is attractive for 
their simplicity and high electrical-to-optical 
efficiency. However, modulated LED output 
power is limited, and there is a theoretical limit 
of approximately 30 GHz for high-bandwidth 
modulation with laser diodes, due to relaxation 
oscillations of charge carrier density in the laser 
cavity [23]. 

Temporal modulation with external devices 
achieve the highest speeds and lowest power in 
optically-guided lightwave structures such as fiber 
and channel waveguides. Either phase or intensity 
modulation devices can be employed. 

The basic guided-wave structures for external 
phase modulation are the channel waveguide 
and the waveguide splitter where a channel 
waveguide branches out into two guides in 
the form of a “Y”. Two waveguide splitters can 
be combined to form the guided-wave version 
Mach-Zehnder interferometer, as shown in 
Figure 14.2. Application of a voltage, V, across a 
channel waveguide fabricated in an electro-optic 
material such as LiNbO, will alter the optical path 
length, or equivalently, the phase of light passing 
through the waveguide due to the electro-optic 
effect [24]. If desired, the phase-modulated light 
can be converted to intensity-modulated light 
by using the guided-wave Mach-Zehnder and, 
most commonly, modulating the light in one of 
the two guides. High-speed modulation requires 
the voltage signal to be applied to a transmission 
line running parallel to the optical waveguide. 
The transmission line, which replaces the elec- 
trodes in Figure 14.2, is designed so the instan- 
taneous voltage signal of the transmission line 
travels at close to the same velocity as the guided 
optical wave, greatly increasing modulation 
efficiency. This design approach has produced 
intensity-modulation devices with speeds and 
bandwidths of up to 100 GHz at less than 6 V 
drive signal [2]. 

The Mach-Zehnder modulator can be used 
for analogue intensity modulation. The output 
intensity as a function of V is a nonlinear but well- 
known function: 
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Figure 14.2 Guided-wave Mach-Zehnder interferometer. 


I,/Inp=1+{sinn(V /Va)}, (14.5) 
where Vr is the voltage to induce a x phase shift in 
the light beam, I, is the incident light beam inten- 
sity and I, is the output intensity. The sine-squared 
function can lead to spurious responses in a broad- 
band signal. The production of spurious third- 
order signals can be seen from a Taylor-series 
expansion of the modulated quadrature-point 
signal 


1tsin(1V (t)/V;) 


As 


1t(nV(tVV;.) 


+{(1/6)(17V(t)/V;,) thigher order terms}. (14.6) 


Hence, if there are two frequency components f, 
and f, in the broad band signal, third-order inter- 
modulation signals at 2f, — f, and 2f, —f, will appear 
within the band [25]. The effects of the sine-squared 
transfer function can be minimized by operating 
two Mach-Zehnder interferometers in parallel or 
back-to-back [26,27], but generally at an increase 
of optical insertion loss. 

Fiber-optic versions of a Mach-Zehnder inter- 
ferometer can be constructed, as shown in Figure 
14.3a. Fused fiber couplers are at both ends of the 
interferometer legs to split and recombine the light 
beams from the single-mode fiber. Since silica 
fiber is not an electro-optic material, an additional 
component such as a piezoelectric cylinder is used 
to impose phase shifts onto the light in one leg 


(Figure 14.3a) by increasing/decreasing the length 
of the fiber. Use of a piezoelectric element maxi- 
mizes the possible phase change; however, it also 
limits the modulation speed of such devices. 
Intensity modulation can be achieved using elec- 
troabsorption in III-V materials such as GaAs and 
its ternary and quaternary compounds with In, Al, 
and P [28]. In electroabsorption the optical density 
of materials whose bandgap is closely matched to a 
laser wavelength changes with applied voltage. In 
a device such as the electro-optic Mach-Zehnder 
device, the modulation voltage changes the optical 
density (imaginary part of the index of refraction), 
vice the optical phase (real part of the refractive 
index) as described above. Use of multiple quan- 
tum-well (MQW) materials, with an engineered 
exciton-line band edge, allows devices with faster 
operation at lower voltages and with higher signal- 
to-noise than the classical Franz-Keldysh effect in 
materials like bulk GaAs. MQW devices consist of a 
stack of ultra-thin, ~10-nm thick, layers of an alter- 
nating composition of III-V materials. Alternating 
layers of GaAs and AlGaAs are used for modula- 
tion at 850nm wavelength. Alternating layers of 
quaternary materials such as InAlGaAs/InGaAs 
are used for operation at 1550nm. Excitons pro- 
duced by illumination of the MQW material have 
a much sharper and narrower exciton-absorption 
band than in bulk non-quantum-well material. 
Application of the electric field moves the exciton 
absorption line towards the red, causing change 
in the absorption at a fixed laser wavelength. 
Maximum optical contrast requires operation at 
the specific wavelength close to that of the bound 
exciton line in the MQW structure. The contrast 
change can also be greatly increased by placing the 
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MQW stack within a Fabry-Perot cavity [29], so 
the incident light makes multiple passes through 
the stack. For maximum modulation speed a 
transmission line electrode must be used to main- 
tain synchronism between the modulating signal 
and the optical signal, as described above for the 
Mach-Zehnder phase modulator. Since the modu- 
lation is due to shifting of a band edge, the modu- 
lation is highly nonlinear with respect to drive 
voltage. Hence, this type of modulator has been 
primarily used for digital modulation. 

It is important to remember that speed, linear- 
ity, and efficiency are not the only parameters that 
need to be considered for temporal modulators. 
Other parameters that can be important include 
the capability to handle high optical power, and 
the device response to variables in the optical beam 
properties such as polarization state and wave- 
length. Hence, while the discussion above has con- 
centrated on guided-wave devices, depending on 


application one may need to consider slower and 
larger temporal modulators such as acousto-optic 
devices and micro-mechanical devices such as 
described in Sections 14.2.2.1 and 14.2.2.2 below. 


14.2.1.1 TEMPORAL PROCESSING 
APPLICATIONS IN FIBER OPTIC 
SYSTEMS 


14.2.1.1.1_ Microwave optics and 
beamforming 


Mach-Zehnder waveguide interferometer devices 
are crucial in the area of microwave optics. The 
most basic goal in microwave optics is trans- 
mission of high-bandwidth analogue signals. 
Microwave signals are modulated onto an opti- 
cal carrier for transmission by a fiber-optic line; 
the advantage of optical transmission is the lower 
size and weight compared to the use of electrical 
transmission lines and microwave waveguides. 
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However, care must be taken in order to maximize 
analogue dynamic range, such as by minimizing 
third-order intermodulation signals, as discussed 
above for Equations 14.5 and 14.6. Areas of appli- 
cation for microwave-optic transmission include 


e Antenna remoting for radar and communica- 
tions systems, to minimize the weight of trans- 
mission lines when antennas must be at some 
distance from the receiver/transmitter, such as 
at the top of a mast 

e In wireless networks, for feed lines from base 
stations to the cellular-network antennas 


In addition to signal transmission, coherent opti- 
cal devices can be constructed for antenna-array 
applications such as beamforming. In simplest 
terms, beamforming is the maximization of the 
transmit power of an array of antennas (for a 
transmitting array, or receive power for a receiv- 
ing array) at particular look angles and frequen- 
cies. Minimization of receive sensitivity can also 
be an objective for rejection of spatially distrib- 
uted interference. Maximization/minimization 
of transmit/receive power can be achieved by 
applying the correct phases to the antenna signals 
to produce constructive/destructive interference 
in the desired directions. For microwave-optic 
devices, the optical phases can be adjusted to pro- 
duce the desired beams. The attraction of opti- 
cal beamforming at microwave frequencies is the 
present difficulty in performing the necessary 
operations digitally at this high frequency range. 
To illustrate the beamforming operation we con- 
sider the simple case of a 1D linear array of N 
equally-spaced antenna elements. These elements 
can be either transmitters or receivers of a micro- 
wave signal at frequency fo. 

The phase difference of the signals from/by 
adjacent array elements at look angle a measured 
from the perpendicular to the line connecting the 
array elements is 


exp[j2mfo(d sina)/(v)] 


where d is the array-element spacing, and v is the 
signal velocity. The phase shift at the nth element 
relative to the first is therefore 


exp[j21fo(nd sina) /(v)] 


If the signal strengths at the antenna elements x, 
are sampled at a given instant of time and summed, 
one obtains (with wavelength i = f,/v) 


N-l 


b= > x,expl j2m(nd sinoyi(2)] 


n=0 


(14.7) 


To obtain signal gain, each term must be multiplied 
by a phase factor. For reasons soon to be apparent, 
the phase factor is chosen as 


exp[—j2mnk/N] 


The new summation is 


N-1 
b= ee exp[j2m(nd sina)(A) ]exp[—j2mnk/N] 
n=0 


(14.8) 


which is maximized when the exponential factors 
in every term cancel to produce unity, i-e., when 


k =(Ndsina)/v 


The quantity k is thus a phase corresponding to a 
beam at look angle a. The choice of phase factor 
for producing the beam b, is be seen to be equiva- 
lent to a spatial discrete Fourier transform (DFT) 
operation on the signal samples received from/by 
the transmit/receive array, respectively. 

The above beamforming example applies best 
to a continuous-wave signal with small fractional 
bandwidth centered at microwave frequency f,. 
If the microwave signal is short, e.g., an impulse 
function, as commonly for radar, the above beam- 
forming algorithm is not optimum, since the signal 
has large fractional bandwidth Af/f). In this large 
bandwidth case, time delays rather than phase 
shifts need to be applied to the array elements to 
produce a beam B(f). 


N-1 


B(t)= )a,x,(t-t,) 


n=0 


(14.9) 


where T,, is the delay applied to the nth element 
in an array of size N, and a, is a weighting func- 
tion for shaping the angular characteristics of the 
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beam, e.g., half-intensity mainlobe width and sid- 
elobe levels [30]. The required time delays between 
adjacent elements is given by 


Tru —-T, =(dsina)/v. (14.10) 


A fiber delay line for each array element can be 
used to implement the required delays, but this 
approach lacks the flexibility to easily change the 
beam direction. A novel alternate approach is to 
use different optical wavelengths for signal trans- 
mission to each array element. The wavelength- 
dispersion characteristics of the optical fiber, 
where different wavelengths will have different 
time delays over the same length of fiber [31], then 
contribute to the time delays between adjacent 
array elements. 

The 1D example generalizes to unequal spac- 
ings between array elements and to 2D arrays in a 
straightforward manner. 


14.2.1.1.2 Optical sensors 


The Mach-Zehnder device, either fiber or wave- 
guide version, is also an important class of device 
for sensors (see Chapter 11). For example, one leg 
of a fiber Mach-Zehnder is coated with a material 
that will change the length of the fiber in response 
to a specific external stimulus such as stress/strain, 
temperature, magnetic field, etc. Both the wave- 
guide and fiber-optic Mach-Zehnder interferom- 
eters can perform a number of temporal operations 
on the optical signal: phase shifting (Figure 14.3a), 
mixing of a modulated signal with another (Figure 
14.3b—note that the modulated input signal must 
be applied with a bias), switching of the polarity of 
signals (Figure 14.3c), spectral filtering using either 
a fiber Fabry-Perot or a Bragg grating (Figure 
14.3d), and sampling or time filtering of high-band- 
width signals (Figure 14.3e). Note the necessity for 
optical amplification, and its implementation in the 
optical domain with fiber amplifiers, as shown in 
Figure 14.3e, where the fiber is doped with material 
such as erbium and the probe beam can also acti- 
vate the amplification. When a similar fiber-optic 
Mach-Zehnder is used for demultiplexing and 
demodulation of sensor array data, higher accuracy 
is achieved because of compensation for the non- 
linear (1 + sine) transfer characteristic of a single 
Mach-Zehnder device, as already discussed above 
for Equations 14.5 and 14.6. 


14.2.1.1.3 Optical interconnect for 
computers 


Parallel high bandwidth optical channels, either 
guided-wave or fiber-optic, have been explored 
for optical interconnection applications [32], 
thereby multiplying the throughput rates that can 
be achieved on a single channel. However, simple 
aggregation of a multiplicity of single-channel 
optical-link hardware quickly becomes impracti- 
cal, especially in applications where volume and 
power must be limited, e.g., within a computer. 
Fiber-optic “ribbon” cabling, consisting of a num- 
ber of closely spaced fibers, is thus usually chosen 
to interconnect a number of high-speed proces- 
sors or boards. Challenges in producing such opti- 
cal interconnect hardware include the monolithic 
integration of arrays of lasers, modulators, and 
detectors. Achieving the requisite density for laser 
arrays that can be coupled to fibers has necessi- 
tated the development of vertical-cavity surface- 
emitting laser arrays (VCSELs) with low power 
threshold for lasing and high electrical-to-optical 
efficiency [33]. The required laser efficiency has 
been achieved through use of quantum well mate- 
rials that reduce the number of allowed excited 
states that must be optically pumped [34]. 


14.2.1.1.4 Wavelength processing in 
fiber optic systems 


WDM technology has been a major development 
in optical fiber telecommunications systems. 
Figure 14.3d illustrates how wavelength chan- 
nels can be passed or rejected using, respectively, 
fiber devices such as the fiber Fabry-Perot filter, a 
mechanical device (Chapter 8.5), and fiber Bragg 
gratings, refractive index gratings written perpen- 
dicular to the long dimension of the fiber via the 
photorefractive effect. While WDM technology 
is primarily used in long-haul telecommunica- 
tions to increase capacity without commensurate 
increase in physical plant, WDM technology can 
also be applied to short distance optical intercon- 
nects to provide an additional degree of freedom. 
For example, in ribbon fiber interconnects, each 
fiber can have a separate wavelength. The wave- 
length of a channel can be used as an identifier 
for data routing, reducing the need, or even 
avoiding the need to read the packet header (a dif- 
ficulty for optics mentioned in Section 14.1.4). A 
key enabler of WDM optical interconnects is the 
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monolithic multiple wavelength surface emitting 
laser array [33], which provides either a unique 
wavelength for each laser, or fewer but redundant 
wavelengths. 


14.2.1.2 FREE-SPACE OPTICAL APPLICATION 


Nonguided-wave temporal modulators can be 
applied to free-space communications, distributed 
computing architectures, laser radar, and laser 
designators. In the latter the use of high band- 
width coding is often needed for increased detec- 
tion margin or range, using correlation methods, 
mathematically described in Section 14.2.2.1. For 
free-space optical communications bulk electro- 
optic modulators often require high drive powers 
and have low contrast. However, a novel quantum- 
well modulator device holds promise for both 
high speed and high contrast [35]. The modula- 
tor is positioned on the entry/exit facet of a corner 
cube retro-reflector. Electrical signals applied to 
the retro-reflector result in modulation of an inci- 
dent laser beam. Data rates of up to 10 Mbps over 
several meters at bit error rates of 10-6 have been 
demonstrated; the approach has the potential for 
hundreds of megabits/second at power consump- 
tions below 100 mW. Use of two retro-reflectors 
allows a two-way link to be established. 


14.2.2 One-dimensional spatial light 
modulators and applications 


14.2.2.1 ACOUSTO-OPTIC DEVICES 


Optical modulation of data in a 1D spatial format 
allows parallel processing of blocks of temporal 
data. The most effective 1D light modulators have 
been acousto-optic, and this technology has played 
an important role in the demonstration of opti- 
cal processing architectures. The basic construc- 
tion of an acousto-optic device is shown in Figure 
14.4. A high-bandwidth rf drive signal is applied 
to an acoustic, piezoelectric, transducer that has 
been bonded to one end of the acousto-optic cell, 
using acoustic impedance-matching materials. The 
resultant acoustic wave is a replica of the rf drive 
signal. The rarefactions and compressions of the 
acoustic wave produce corresponding refractive- 
index changes due to the elasto-optic effect. The 
cell thus contains a travelling phase grating cor- 
responding to the acoustic wave. The main features 
of acousto-optic diffraction important to optical 
processing are now summarized. 


Undiffracted light 


Figure 14.4 Construction of acousto-optic cell 
and Bragg cell geometry for data multiplication. 


e The maximum diffraction efficiency, into 
a single order, occurs when the difference 
between the momentum vectors of the incident 
and diffracted light is equal to the acoustic- 
wave momentum vector. For an isotropic mate- 
rial momentum matching occurs when light is 
incident to the acoustic wavefront at the Bragg 
angle 0,, defined by 


sin®z =A /(nA)=Af(2nv) (14.11) 


where A, f, and v are the acoustic-wave wave- 
length, frequency and velocity, respectively, A is 
the optical wavelength of the monochromatic 
incident light, and n is the index of refraction 
of the medium at A. To diffract light predomi- 
nantly into only a single diffraction order one 
must examine the quantity Q, 


_ 2MAZ 


a= nd?” 


For Q > 7, analysis shows that the first dif- 
fraction order contains >90% of the diffracted 
light [36]. 

e The diffraction efficiency is given by a nonlin- 
ear sine-squared relationship, that arises from 
considering the diffracted (I,) and undiffracted 
(I,) light as two modes of a coupled mode 
system [37]. 


T/Ip =Cp sin? {Mo (1? /227)(L/ H) CP}? 
(14.12) 
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where P,, is the rf drive power, L/H is the ratio 
of width-to-height for the acoustic transducer, 
C, and C,,are constants, and M, is a figure-of- 
merit for diffraction efficiency that depends 
only on material parameters, and is given by 


M,=(n°p’)/(pv>) (14.13) 


where nis the refractive index, p is the elasto- 
optic coefficient, p is the density and v is the 
acoustic velocity of the material. The sine- 
squared relationship, Equation 14.12, leads 

to similar considerations as for the Mach- 
Zehnder devices discussed earlier, Equation 
14.5, such as limitation on wideband dynamic 
range due to third-order intermodulation 
products. Also, I,/I, less than 3% in order for 
the relationship between I, and P,, to be linear 
to better than 1%. 


Extensive literature exists on materials and design 
of acousto-optic devices [38,39]. The figure-of 
merit M,, Equation 14.12, provides guidance on 
choice of optimum AO device material, especially 
dependence on refractive index n. Device develop- 
ment has converged on LiNbO, for operation at 
high frequencies and modest time-bandwidth (up 
to 4 GHz, ~100 TBW), TeO, for lower frequency 
application but large time-bandwidth (<100 MHz, 
several thousand TBW), and GaP for intermediate 
bandwidths and time-bandwidths (up to 1 GHz, 
several hundred TBW) [6]. 

The major aspects of acousto-optic devices are 
summarized below: 


e For the isotropic case, the diffracted light is 
deflected by an angle 20,. The more general 
nonisotropic diffraction case is shown in 
Figure 14.5, compared with the isotropic case. 
In this nonisotropic case, the incident light is 
polarized as an extraordinary ray with momen- 
tum vector K; and the diffracted light is of 
opposite polarization from that of the incident 
light, K{ The angular relationship among the 
incident and diffracted light beams and the 
acoustic vector k, is determined by the necessity 
for each beam to reside on its respective optical- 
index ellipsoid, and is shown in Figure 14.5. 

e The information in the cell is constantly 
updated due to the travelling wave nature of 
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Wave-vector diagram for noncolliner acousto-optic filter. 


Figure 14.5 Anisotropic Bragg diffraction 
geometry. 


the acoustic signal. If the cell is driven by an 
electrical signal r(¢), the information displayed 
along the cell direction, x, is given by r(t — x/v), 
as shown in Figure 14.4. 

e Complex (both amplitude and phase) data on 
an incident light beam, s(x, #), that spatially 
fills the cell aperture will be diffracted by the 
acoustic grating and therefore contain the 
product of s(x, t) and the complex data on the 
acoustic wave, r(t — x/v). The amount of spatial 
data contained on the modulated beam is 
equivalent to the number of resolvable deflec- 
tion position for the optical beam and is the so- 
called time-bandwidth (TBW) product, equal 
to the product of the length of the cell, in units 
of time, and the temporal bandwidth of the 
drive signal [8]. This result is easily derivable 
from Equation 14.11 and the angular optical- 
diffraction limit due to the finite spatial extent 
of the incident light beam. 

e Ifthe continual temporal update of data in the 
cell, represented by r(t — x/v), is not desired, 

a short pulse of light can be used to “freeze 
frame” the instantaneous contents of the cell. 
Alternatively, the incident light beam can be 
focused into a small diameter spot within the 
cell, resulting in a temporal modulator [40]; 
the bandwidth of such a temporal modula- 
tor is determined by the traversal time of 

the acoustic wave through the spot of light, 
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and initiation of modulation is determined 
by the delay caused by transit of the acoustic 
wave from the transducer to the optical spot 
location. 

e The center frequency of the diffracted light is 
shifted relative to the incident light frequency 
by the carrier frequency of the acoustic signal. 
The frequency shift arises from energy and 
momentum conservation considerations in dif- 
fraction by a moving grating. Either frequency 
upshift or downshift is possible, depending on 
the momentum direction of the incident light 
relative to the acoustic beam momentum direc- 
tion. Acousto-optic diffraction is an important 
means of producing small frequency shifts 
onto the ~100 THz optical carrier. 


14.2.2.1.1 Multi-dimensional devices 
using acousto-optics 

To better exploit the two spatial dimensions of 
optics, acousto-optic devices have been extended 
to 2D architectures either by, constructing an array 
of 1D devices in a single crystal, or by constructing 
a device with acoustic transducers on orthogonal 
edges of a crystal. In either of these approaches a 
large, high quality crystal is required. For multi- 
channel acousto-optic devices, a large number of 
channels is desirable, but in the absence of electri- 
cal crosstalk, the major fundamental limitation is 
acoustic diffraction. Acoustic diffraction depends 
not only on the dimensions of the transducer, but 
also on material. Ideally, the near-field acoustic 
wavefront is planar over the aperture of the trans- 
ducer, and diffraction effects are observed in the 
far-field. Hence, it is desirable to maintain a near- 
field condition over as long a propagation distance 
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as possible. In an isotropic medium, the transition 
from near to far field occurs at a distance from the 
transducer of approximately 
D=H?/8L (14.14) 
where H is the vertical dimension of the transducer 
and L is the acoustic wavelength. However, for an 
anisotropic medium, D can be increased by an 
additional factor of c= (1 — 2b)". The quantity b is 
the coefficient of the q? term in a power series rep- 
resentation of the acoustic slowness surface [41], 


K,(qa)=K,(1t+bqit+dqi+L...) (14.15) 


where q, is the acoustic-wave direction relative to 
the normal to the transducer. For shear mode TeO, 
the quantity c is 0.02 so that the acoustic spreading 
is very rapid. Because of the severe acoustic spread- 
ing in shear mode TeO,, multichannel devices have 
been constructed only of longitudinal mode TeO, 
where the spreading factor c is 2. A 32-channel 
TeO, device at a center frequency of 250 MHz has 
been demonstrated [42,43]. 

The second 2D approach uses orthogonally- 
propagating acoustic waves. Using two separate 
1D cells orthogonal to each other, a set of anamor- 
phic optics passes light from one cell to the other, 
where the data in the first cell are focused and 
passes through every point on the second cell. This 
scheme is illustrated in Figure 14.6. A much more 
compact alternative to individual 1D devices is to 
employ a single large square crystal with transduc- 
ers along two edge facets and light propagating 
perpendicular to the square aperture. However, 
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Figure 14.6 Triple-product processor using orthogonal acousto-optic Bragg cells. 
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this approach is feasible only in cubic materials. 
TeO, satisfies this criterion, and shear-mode TeO, 
devices have been demonstrated. However, if the 
two acoustic waves propagate within the same vol- 
ume of crystal, nonlinear mixing of the signals will 
occur at lower power levels; this is particularly true 
for TeO,, which exhibits acoustic nonlinearities at 
relatively low power levels. Thus, devices are made 
with the acoustic transducers offset from each 
other along the optical-beam direction. The trans- 
ducer dimension along the optical path and the 
offset distance must be minimized to prevent loss 
of resolution or to avoid use of optics with large 
depth of focus. 


14.2.2.1.2 Acousto-optic applications 
Analogue processors 


One-dimensional processing. The most basic appli- 
cation of the acousto-optic cell is to place informa- 
tion at the front focal plane of a Fourier Transform 
lens. The spatial spectrum of the travelling wave 
is then displayed at the back focal plane. Since 
acousto-optic devices have been constructed into 
the microwave frequency range, up to 4 GHz [6], 
an rf spectrum analyzer may be constructed as 
shown in Figure 14.7. A collimated optical beam 
with intensity profile a(x) illuminates an acousto- 
optic cell P, that is driven by the signal s(t). The 
signal s(t) must first be mixed with a carrier sig- 
nal cos (2zf.t) to produce the drive signal f(f) at the 
acoustic frequency f.. The complex spectrum of f(f) 
appears at the back focal plane P, of the Fourier 
transform lens. The spectral components with fre- 
quency upshift are represented by F,(a, f), and with 
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Figure 14.7 Acousto-optical spectrum analyzer. 


frequency downshift by F_(a, #). The Bragg dif- 
fraction geometry determines whether F, or F_ is 
displayed. The number of resolvable spectral posi- 
tions is equal to the TBW product, as described 
above. The spectral update rate is approximately 
equal to the length of the illuminated portion of 
the Bragg cell, typically in the sub-microsecond to 
ten-microsecond range. The photodetector array 
at the back focal plane must therefore have corre- 
sponding output frame rate. The spectral informa- 
tion must also be measured with high fidelity and 
dynamic range, placing additional demands on the 
dynamic range performance of the photodetector 
array [44]. For a power spectrum analyzer only the 
amplitude A(«) is measured by square-law photo- 
detector elements. If the phase information must 
be preserved, each photodetector element must 
be illuminated with a reference optical beam and 
the bandwidth of each element must support the 
difference frequency between signal and reference 
beam. 

The back focal plane information of a spectrum 
analyzer can be cascaded into a second Fourier 
Transform lens, producing an optical image of the 
spatial information. Such an optical image can be 
multiplied with another information array by locat- 
ing a second acousto-optic cell at the image plane, as 
shown in Figure 14.8. The travelling-wave nature of 
the acoustic-cell data has been advantageously used 
to implement the correlation integral, Equation 
14.4. A final lens performs the Fourier Transform of 
the product of the functions shown in Figure 14.8, 
and the output of a small detector at the center of the 
back focal plane is the correlation integral 
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Figure 14.8 Space-integrating acousto-optical correlator. 


R(t)= fre + x/v)s(t—x/v)dx = [Festa —2t)du. 
(14.16) 


The arrangement of Figure 14.8 has been used 
to perform high-speed correlation according to 
Equation 14.16 [7]. The integration can be thought 
of as performed by the focusing action of the 
lens, hence the correlation architecture is termed 
“space-integrating”. Processing of a long-duration 
signal with the space-integrating architecture 
requires formatting of input data into frames of 
size, or length (when inserted into the cell), less 
than that of the illuminated portions of the cells. 
This formatting adds to the burden for high speed 
pre-processing in this processor. 

An alternative to spatial integration using 
a lens, as shown in Figures 14.7 and 14.8, is to 
employ time-integration in the photodetection 
process [45]. The product terms within the kernel 
of integral transforms such as Equations 14.4 and 
14.16 can be obtained by successive diffraction by 
acousto-optic cells of modulated light. The dura- 
tion of signals is not limited by the size of the cells 
and photodetector array, but by the feasible inte- 
gration time. The photodetector array does not 
need to be read until its charge capacity is reached, 
and digitization of the output allows further signal 
integration time. Output data rates in these “time- 
integrating” architectures are much lower than 
with the space-integrating architectures. However, 
only a small fraction of the resultant correlation 
function is produced, and some a priori knowledge 
of the location of the region of interest is required. 
Otherwise, the amount of output data does not 
decrease. 

Multi-dimensional processing. A  multiplic- 
ity of 1D spectrum analyzer and correlation pro- 
cessors can be implemented in parallel using the 


multichannel acousto-optic devices described in 
Section 14.2.2.1.1. The main application of these 
multi-channel processors is to process data from 
arrays of sensors. Array functions are performed 
in addition to the 1D analysis; these functions 
include 


e Direction-finding in conjunction with spec- 
tral analysis [44] using an array of rf receive 
antennas. Each array element drives a sepa- 
rate channel of a multi-channel acousto-optic 
device where the acoustic transducers are 
relatively spaced exactly as for the array anten- 
nas. By coherently illuminating the multiplicity 
of channels, the phase differences among the 
various channels result in additional deflection 
orthogonal to the spectrum-analysis direction 
that provides information on the angle-of- 
arrival of the signal at the antenna array. 

e Generalized multi-dimensional array beam- 
forming [46]. The acousto- optical channels 
impose phase factors or time delays on array 
signals to produce directional transmit or 
receive beams, using a straightforward exten- 
sion of the formalism described in Section 
14.2.1.1 for a 1D array. 


A more general extension of Fourier transform 
processing into two dimensions using acousto- 
optic input devices has been explored. With the 
orthogonal-cell arrangement shown in Figure 14.6, 
one can perform (1) 2D beam deflection in free 
space, for applications such as optical interconnec- 
tion and raster addressing in displays, and (2) 2D 
processing where the processing kernel consists of 
two 1D factors. For the latter, the two orthogonal 
acousto-optic devices are combined with a tempo- 
rally-modulated light source and a 2D integrating 
photodetector array to produce what is known as a 
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triple-product processor (TPP) [47]. As illustrated 
in Figure 14.6, the time-integrating version of the 
TPP output has the form 


kT 
gltn)= [AOA (tm) h(e+u,)de (14.17) 


where u, and u, are the coordinates of the output 
plane, T is the maximum integration time of the 
product signal onto the 2D photodetector array, 
and kis a constant <1 depending on the duration of 
the input signals. By employing various functions 
t> f» and f,, the TPP, Equation 14.17, can imple- 
ment a number of important signal-processing 
operations at extremely high speed: 


e Time-frequency transformation, such as 
Ambiguity processing (also known as range- 
Doppler processing) in radar and sonar [48]. 
The product of the functions f() and £,(t+ u,) 
constitute the cross-correlation kernel, as in 
Equation 14.4, to provide range determina- 
tion. Function f, is the Fourier phase factor for 
compensating for any Doppler effects on the 
cross-correlation function, thereby resulting in 
the 2D ambiguity surface. Additional 2D time- 
frequency transforms that can be performed 
with the TPP include the Wigner function, 
instantaneous power spectrum, and cyclosta- 
tionary function [46]. 

e Fourier transformation of very large 1D blocks 
of data in 2D format, also known as folded 
spectrum analysis [47]. Since each cell of the 
TPP can have TBW of about 1000, a signal can 
be spectrally analyzed into approximately one 
million narrow frequency bins. 

e@ Matrix operations [49]. Matrix operations are 
employed, not for numerical processing, but 
to allow use of linear algebraic techniques to 
formulate an optical implementation of vari- 
ous signal-processing operations for the TPP. 
For example, analogue multiplication of three 
matrices has been implemented by cascading 
the TPP into a 2D SLM (covered in Section 
14.2.4). In particular, the triple-matrix prod- 
uct allows the implementation of similarity 
transforms to be performed; these transforms, 
which diagonalize an input matrix, perform 
the important data-reduction function that 


permits lower-bandwidth results to be passed 
to subsequent processing. 

e Synthetic aperture radar (SAR) image forma- 
tion [50]. SAR image formation is historically 
important as one of the first successful applica- 
tions of optical signal processing. The original 
processors as input film onto which the radar 
return signals were written. For near-real time 
image formation the film can be replaced by 
a 2D SLM (discussed in Section 14.2.4 and 
Chapter 6). However, the use of acousto-optic 
cells allows more-rapid generation of SAR 
imagery over either film or present 2D SLMs. 
The original method where the radar return 
signals were written onto film and the devel- 
oped film used as the input into an optical sys- 
tem. The ability to use 1D acousto-optic cells to 
produce 2D SAR imagery follows from recog- 
nition that the SAR image-formation equation 
is separable into factors involving integration 
over only the range variable n, followed by inte- 
gration over only the azimuth variable € The 
equation governing the formation of an image 
point at coordinates (x, y) is given by 


Al A2 


F(n, i(m/ADy Wd 
AeWE | [' (n,S)expLi(a/AD, (n+ y)" Jan} 


0 0 Expl{(M/ADy(y))(E+ x)” }d& 
(14.18) 


where f(n, €) represents the signal from point 
reflector on the ground, and D, and D,(y) are 
parameters or functions related to signal cur- 
vature (SAR systems transmit a train of short 
impulses; the resultant radar returns are chirp, 
or quadratic phase functions, but the chirp 
rates are different for the n and € directions). 
In the limits of integration A1 is proportional 
to the integrated number of radar pulses in 
the azimuth direction, and A2 determines the 
duration of the radar pulse train in the range 
dimension. 


The analogue processors described in this sec- 
tion have found only niche application to date. 
However, a number of prototype acousto-optical 
processors have exhibited some advantageous 
features. The advantages generally lie in the size, 
weight and power consumption advantages over 
a comparable all-electronic implementation of 
equivalent processing power. For example, analogue 
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data multiplication using acousto-optic technol- 
ogy at eight-bit precision can have a 350 x power 
consumption advantage over digital multipli- 
ers [51]. Specific examples of compact processors 
include a 6 in.> rf spectrum analyzer [6], andaSAR 
image-formation processor in <1 ft? [52]. But the 
need for significant high speed pre-and post-pro- 
cessing electronics (as described in Sections 14.1 
and 14.1.1) results in significant disincentive to 
use optical processing unless the size, weight and 
power advantages are paramount. Without such 
advantages even the early use of optics for SAR 
image formation has been largely replaced with 
digital electronic processing. An added measure 
of advantage could be obtained for optical proces- 
sors if many of the electronic functions can be per- 
formed optically or if, at the output of an optical 
processor, the amount of data for subsequent elec- 
tronic processing can be reduced. If a data-reduc- 
tion algorithm can be implemented, such as the 
similarity transform mentioned above, one would 
not need to read out an entire 2D array or could 
use a linear array; however, this situation is clearly 
specific only to a special class of data processing. 
An alternative is to construct a “smart” photode- 
tector array that selects only the region of interest 
for readout, such as a local peak [15]. Increasing 
the variety of operations that can be performed 
opto-electronically would also allow intelligent 
algorithms to be implemented. Many all-electronic 
approaches can be intelligently formulated to 
reduce the computation load required for a given 
processing task; use of the fast Fourier transform 
(FFT) algorithm to greatly reduce computation 
in digital electronic processing is an example that 
cannot yet be duplicated in optics. 


14.2.2.2 ONE-DIMENSIONAL ELECTRO- 
OPTIC MODULATOR AND 
APPLICATIONS 


An electro-optic modulator is a potential alterna- 
tive to the acousto-optic device. A phase diffrac- 
tion grating is produced by the electro-optic effect 
rather than a pressure wave. Such modulators con- 
structed of LiNbO, have been demonstrated for 
high-speed printing applications [53,54]. However, 
in the most common configuration the electro- 
optic modulator requires many parallel addressing 
lines. Hence, there is significant difficulty in gen- 
erating a multiplicity of phase gratings. Further, 
significant effort is required to format data for 


parallel addressing. However, careful design of 
the electrical addressing (the intrinsic speed of the 
electro-optic effect can be sub-picosecond) can 
result in a device that creates a grating much more 
rapidly than acousto-optic devices. An alternative 
approach to parallel electrodes is to use a high- 
speed shift register to address the electro-optic 
material. This approach is architecturally similar 
to that of acousto-optics. Use of a shift register has 
advantages such as capability to vary clocking rate. 


14.2.2.3 ONE-DIMENSIONAL MECHANICAL 
MODULATORS AND APPLICATION 


Spatial modulation can be performed using micro- 
electromechanical structures (MEMS). Devices 
are constructed entirely of silicon for both the 
addressing circuitry and the modulation struc- 
ture. The micromechanical modulator design con- 
sist of support structures and cantilevered beams 
[55] or torsion beams extending from the struc- 
tures; these are all fabricated out of silicon by an 
etching process [56]. A silicon circuit below the 
micromechanical structure provides addressing 
and activation voltages that deflect the structure. 
The deflection changes the angular position of the 
reflected light beam, producing intensity modula- 
tion in the readout system. The early devices were 
called deformable membrane devices (DMDs), and 
this terminology persists although no longer lim- 
ited to the original membrane MEMS approach. 

The importance of MEMS devices has been in 
their application to all-optical switching for fiber- 
optic telecommunication systems, where switch 
speed is not important. The advantage is that the 
data remain in optical form during the switching 
operation. 


14.2.3 Wavelength processing 


The bandwidth of optics can be exploited more 
fully by using a broad band of the optical spectrum 
and by subdividing an optical signal into many 
narrow-wavelength data channels and processing 
each channel individually. Processing in the wave- 
length dimension has already been introduced in 
Section 14.2.1.2 with respect to the importance 
of WDM fiber-optic systems. However, process- 
ing in the wavelength dimension can be applied 
to both temporal and multi-dimensional data. 
Wavelength-selective acousto-optic and electro- 
optic devices can be applied to spectroscopic 
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systems, to channelize temporal data, and to per- 
form 1D, 2D and 3D imagery analysis and display. 


14.2.3.1 ACOUSTO-OPTICS 


The separation of a broadband optical signal into 
narrow wavelength channels can be performed 
using acousto-optics. Many optical modulators are 
sensitive to the wavelength of operation. However, 
acousto-optic modulators are unique in that, with- 
out a large degree of modification, they can be 
used as flexible elements that can select a number 
of wavelengths simultaneously. The phase gratings 
produced by the acoustic wave will, as any grating, 
select out specific wavelengths at the appropriate 
angle of incidence for Bragg interaction. The grat- 
ing period, corresponding to specific wavelength 
to be selected, is generated by driving the acousto- 
optic device with the appropriate rf frequency 
according to Equation 14.6. This spectral filtering 
action is notable in that by driving the device with 
a composite rf signal a multiplicity of wavelengths 
may be selected simultaneously, unlike optical 
glass and interference filters and resonant tunable 
structures such as Fabry-Perot cavities. 
The resolving power of a grating, 


R=AA/X 


is equal to the number of grating lines. Hence, for 
an acousto-optic tunable filter (AOTF), depending 
on whether the incident optical beam is parallel or 
perpendicular to the grating lines, the resolving 
power is then 


R=D/A or R=L/A, respectively (14.19) 


where D is the extent of the illuminated region 
(parallel case, beam transverse to the acoustic 
wave) and L is the length over which the optical 
and acoustic wave overlap (perpendicular case, 
beam collinear with the acoustic wave). 

AOTFs generally operate in the anisotropic dif- 
fraction mode, as shown in Figure 14.5, so that the 
filtered output light is orthogonally polarized to 
the input, i-e., the input beam must also be polar- 
ized to allow high contrast isolation between input 
and output optical beams by using crossed polar- 
ization. Good AOTF design for high spectral reso- 
lution must maximize 


R=L(An)/A 


where An is the maximum difference in refrac- 
tive indices for the birefringent material [57]. The 
rf frequencies required for specific wavelengths 
is governed by phase matching conditions for 
anisotropic diffraction [41,58]. Popular materials 
for AOTFs are LiNbO, and TeO, for the visible to 
mid-wave IR, Tl,AsSe, for near-to long-wave IR, 
and quartz (SiO,) and MgF for the UV. Both bulk 
and integrated-optic devices are possible. The lat- 
ter are naturally compatible with optical fiber, and 
utilize the polarization anisotropy in thin-film 
optical waveguides in addition to material anisot- 
ropy [59]. 

The major opportunity for AOTF in informa- 
tion-processing applications is in their potential 
employment in WDM techniques and systems. 
For conventional fiber optic systems guided-wave 
AOTFs have been developed to filter the various 
WDM channels at moderate, microsecond, speeds 
[59]. However, significant interest has been gener- 
ated in the use of AOTFs to flatten power across 
the WDM spectral band in long-distance commu- 
nications networks. Since the erbium-doped fiber 
amplifiers used in long-distance networks do not 
amplify uniformly across the WDM band, and 
networks need to be dynamically reconfigurable, a 
rapid means is required to pre-condition the chan- 
nel powers such that all channel power levels are 
always equal following the fiber amplifier, allowing 
proper functioning of the WDM network [60]. In 
the future, just as in fiber-optic communications, 
developments in optical interconnections are 
expected to encompass WDM techniques. AOTFs 
presently represent a viable wavelength switch for 
initial development. 

The multiple-wavelength capability of AOTFs 
allows numerous other applications including 


e Color generation and correction in multi- 
dimensional laser displays [61], such as for laser 
light shows. 3D displays systems have been 
developed using the rapid wavelength tunabil- 
ity and scanning capability of acousto-optic 
devices in conjunction with commensurate 
optical-beam scanning within a 3D medium 
such as with a rapidly spinning turbine blade. 

e Scene imaging in many spectral bands, ie., 
creation of image “cubes” for multi- and 
hyperspectral techniques for remote sensing 
in commercial, astronomical, and military 
applications. 
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e Wavelength-selective microscopic probing for 
research in biological processes, such as by 
selectively quenching fluorescence in biological 
samples with specific wavelengths [62]. 


Some of the present development issues in the use 
of AOTFs include, materials and devices for opera- 
tion further into the UV and IR bands, power 
requirements due to the necessity for the rf drive, 
spectral sidelobe levels which limit channel selec- 
tivity in WDM networks, and imaging quality 
through AOTF devices. Finally, the Bragg diffrac- 
tion requirement often limits the angular accep- 
tance angle for imaging, and the use of rectilinear 
shapes for the AOTF can introduce astigmatism 
into the imaging system. 


14.2.3.2 ELECTRO-OPTICS 


Electro-optic devices can produce phase gratings 
analogous to those in acousto-optic devices, as 
previously mentioned in Section 14.2.2.2. Electro- 
optic devices have also been considered for wave- 
length filtering. By laying an array of electrodes 
onto a guided-wave structure, optical wavelength 
filtering can be performed analogous to the 
integrated-optic. However, it is desirable to have 
devices that are faster than either the electro-optic 
and acousto-optic devices discussed so far. One 
approach is to employ a bulk device with volume 
holograms as the wavelength-sensitive element. 
Volume diffraction gratings can be produced 
optically in a number of materials using the photore- 
fractive effect [63,64]. The photorefractive effect is a 
manifestation of the electro-optic effect on a micro- 
scopic scale. Internal space-charge electric fields, 
E,,, arise from photocharge generation, followed 
by separation of the charges via photoconduction 
(transport of charge from illuminated to unillumi- 
nated regions) or via application of an external field, 
E,,. Gratings are produced from an interference pat- 
tern obtained with two intersecting coherent optical 
beams. However, a rapid means is needed for either 
changing the grating period or changing to a differ- 
ent grating. Presently, sub-microsecond photorefrac- 
tive processes are possible in quantum-well materials 
[3], but packet switching requires even faster speeds. 
A recent development, entitled electrohologra- 
phy has potential to achieve nanosecond switching 
[65]. The quadratic electro-optic effect is used in 
paraelectric crystals such as KTa,Nb,_.O, (KTN). 
In addition to a space-charge field E,., a dc field, 


sc? 


E,,, is applied, leading to a cross term E,. £dc. Until 
E,, is applied, the information contained in the 
hologram is latent, the diffracted light contain- 
ing only bias (E?,,) and unipolar (E?,.) terms. In 
principle, such switchable gratings would allow 
processing operations required in packet switches 
(e.g., multicasting, power management, data moni- 
toring) to be performed at close to the data bit rate, 
entirely in the optical domain. 


14.2.4 Two-dimensional spatial light 
modulators 


2D SLMs are the most critical devices required for 
full and efficient exploitation of the inherent paral- 
lel-processing and interconnection capabilities of 
optics. As discussed above, and shown in Figure 
14.6, 1D acousto-optic devices have successfully 
performed a number of important 2D signal- 
processing operations. However, that approach 
will work only for problems that can be separated 
into two 1D factors. Many image processing opera- 
tions cannot be so factored. 

Although Chapter 6 discusses SLM devices and 
technology, this section reviews necessary and 
desired performance characteristics for information 
processing. Devices will be cited without repeating 
descriptions already provided in Chapter 6. We first 
briefly review the major types of SLM. 


14.2.4.1 TWO-DIMENSIONAL SLM DEVICES 
CATEGORIES 


The first level of categorization is whether the 
device is electrically-addressed (E-SLM) or opti- 
cally-addressed (O-SLM). The next level of catego- 
rization is by modulation mechanism or material. 
Within each category both O-SLMs and E-SLMs 
are possible. 


e Electro-optic. These use traditional linear 
electro-optic materials such as LiNbO,, KD*P, 
and bismuth silicon oxide, and quadratic 
electro-optic material such as PLZT. There is 
usually a strong trade-off between frame rate 
and resolution. 

e Liquid crystal. There are two major classes. 
Twisted nematics, as used in commercial 
displays. Large arrays are available (~107 
pixels), but frame rate is limited by the 
natural relaxation time of the liquid crys- 
tal, of the order of 10 ms. 
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Ferroelectric liquid crystals (FLCs) possess 
a permanent electric dipole. Both write and 
erase speed can be increased in proportion 
to drive signal level. Frame rate can there- 
fore be high, limited by addressing cir- 
cuitry. The most mature devices are binary 
contrast only. Analogue devices with large 
grey-scale capability are less mature 

© MQW structures. These devices employ the 
quantum confined Stark effect in stacks of 
ultra-thin, ~ 10-nm thick, layers of IH-V mate- 
rials, generally of an alternating composition. 
Quantum well materials have intrinsically very 
fast response times, and have been described 
in Section 14.2.1. Large 2D arrays with high- 
speed readout have been produced. Monolithic 
integration with III-V, e.g., GaAs, electronic 
circuitry is a desired goal. However, integration 
with silicon circuitry is more common. 

e MEMS. Previously introduced in Section 
14.2.2.3, large 2D arrays have been developed, 
e.g., >750 x 500 pixels for display purposes, 
but the DMD has generally fewer pixels but 
higher frame rate. These are strictly E-SLMs 
and therefore the addressing circuitry limits 
the frame rate. However, MEMS arrays can be 
constructed of silicon, so the modulator and 
addressing circuitry can be monolithically 
integrated. 


14.2.4.2 SLM FUNCTIONS AND GOALS FOR 
OPTICAL PROCESSING 


A large variety of SLM devices were categorized 
in Section 14.2.4.1. However, to understand the 
desired functioning of an SLM a general construct 
can be used, describing an SLM as a “sandwich” 
of structures (Figure 14.9): a modulating mate- 
rial between control and conversion structures. 
The conversion structures handle the input to and 
output from the SLM, the “write” beam and the 
“read” beam, respectively, and may consist of pho- 
tosensitive material for an O-SLM, or an array of 
electrodes for an E-SLM. The control structure is 
the third port of this “three-port” device. Figure 
14.9 can be used to illustrate the variants of SLMs 
and the basic functions they provide for optical 
processing: 


e Electrical-to-optical transduction. The write 
image is in electrical form, e.g., the SLM input 
is via an electron beam, as in a cathode-ray 


Read 
image 


Write 
image 


; Modulated 
output image 


Photoconductor Modulating material 


Figure 14.9 Generic sandwich construction for a 
2D SLM. 


tube display, or via write lines on an electrical 
circuit. The input information is modulated 
onto an optical read beam (no read image 
data). 

© Optical-to-electrical transduction. If the write 
image is in optical form, there is no read image, 
and the Modulated Output is electrical, then 
the SLM acts as a classical photodetector array. 

e Light conversion. The write image may be on 
an optical beam of either incoherent light or 
coherent light of a particular wavelength. The 
read image is a readout beam of coherent light 
of another wavelength onto which the write 
image is modulated to produce an optical 
modulated output beam. For incoherent optical 
input, the image data is converted from inco- 
herent to coherent light; in the coherent input 
case the image data can be converted from 
one wavelength to another. Additionally, the 
polarization state of the output can be changed 
relative to the input polarization. 

e Imagery projection. The write image data may 
be either electrical or optical. If the read image 
is a readout beam, an optical beam of suf- 
ficient intensity then that beam may be used 
for projection display of the write image data 
modulated onto it. 

e Image processing. An SLM can be used as an 
input and processing device for image-pro- 
cessing systems that perform operations such 
as transformations (e.g., Fourier transform), 
image correlation, nonlinear operations such 
as thresholding and level slicing, and linear 
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re-mapping of images. The write image data 
may be either electrical or optical. The read 
image and the modulated output can then 

be either optical or electrical, respectively 
(depending on the write image). Some image 
processing may be performed first by the SLM, 
but the modulated output will be into either a 
further optical or electronic image-processing 
system. 

e Image amplification. The read image is a 
readout beam that is modulated to produce an 
optical modulated output. The optical power in 
the read beam may be larger than in the write 
beam, or the SLM can amplify the write beam 
signal. 

e Analogue multiplication, addition and subtrac- 
tion of data arrays. Data on the write beam can 
interact with data on the read beam using any 
of a variety of mechanisms to perform these 
operations. 

e Storage of data arrays. The modulating mate- 
rial may retain the information impressed onto 
it for some time. During the retention time a 
modulated output is obtained. If the SLM is 
optically read out, the modulating material is 
an optical memory. 

e Input-to-output isolation and gain element for 
2D data arrays, akin to a transistor for 1D tem- 
poral data. This three-port behavior is particu- 
larly important for optical interconnection and 
cascading optical elements in an all-optical 
architecture, since with a three-port device the 
modulated output levels are independent of the 
strength of the write image signal. 

e Analogue phase conjugator for adaptive optical 
systems. Write and read images and modulated 
output are all optical; the SLM performs the 
phase conjugation operation. 


For optical information processing systems utiliz- 
ing 2D SLMs to be effective, SLM devices need to 
exhibit a minimum throughput (frame rate times 
pixels/frame) on par with that already demon- 
strated with acousto-optic devices, roughly 10° 
samples/s. However, this level of throughput can 
be partitioned between frame rate and frame size, 
as in the following two examples. 


e A liquid crystal display of 107 pixels and 30 
frames/s would approach the 10° samples/s 
goal. 


@ MQW modulator with 6.5 x 104 pixels (256 x 
256 device) at 10° frames/s would exceed the 
goal. Such a modulator would likely be opti- 
cally addressed, to avoid use of ultra-high 
speed electronic addressing circuits. 


Depending on application it is likely that 10° pixels/ 
frame would be desirable for processing of large 
images, and >10° framing rate would be needed for 
applications such as 3D displays and rapid search 
of volume holographic databases (see Section 
14.2.4.3.3). In addition to these two parameters, 
additional requirements are 


e Large dynamic range and good image quality. 
A uniform 1000:1 contrast ratio would satisfy 
most requirements. 

© Low power consumption. A power consumption 
figure of merit would have units of mW cm 
kHz". This figure of merit would relate also 
to the size and resolution of the SLM. To keep 
power consumption low for a given number of 
pixels, the size of the pixels needs to be mini- 
mized. Pixellation corresponding to 50 line 
pairs/mm, easily within the state-of-the-art of 
most fabrication technologies, would minimize 
the focal length of the optics needed for relay- 
ing information to/from the SLM, and overall 
minimize optical system size and power. 

e Data storage time. Flexibility to address a vari- 
ety of applications (Section 14.2.4.3) is desir- 
able, e.g., a fast frame rate SLM with storage 
times of seconds to almost indefinite. It may be 
necessary to keep the same data displayed in an 
SLM indefinitely as one rapidly processes imag- 
ery or searches databases. Another example, 
would be where the SLM responds to slowly- 
changing phenomena, such as environmental 
conditions, in an adaptive optics application. 


At the extreme, the data are time invariant; except 
for write-time considerations, use of film as an 
SLM would be acceptable. Time-invariant applica- 
tions include feature extractors and fixed filters for 
image processing. 


14.2.4.3 APPLICATIONS OF 2D SLM DEVICES 
14.2.4.3.1 Image processing 


Next to the use of optical processing in syn- 
thetic aperture radar image formation, the 2D 
optical processor with the longest history is the 
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Fourier-transform based image correlator. This 
processor has arguably seen the most develop- 
ment over a number of years, with interest for 
high speed automatic target recognition systems. 
But performance has been determined by available 
SLM devices and photodetector arrays. Because 
of a lower requirement on space-bandwidth than 
alternate architectures, and ability to use direct 
phase encoding for the input and filter function, 
the preferred correlator architecture is a modifica- 
tion of the arrangement for temporal correlation, 
Figure 14.8. Known as the VanderLugt correlator, 
2D SLMs are used at the input image plane f(x, y) 
and the Fourier-filter plane, and a 2D photodetec- 
tor is used to output the correlation function (at 
the location of the second acousto-optic cell in 
Figure 14.8, and the last lens and point photodetec- 
tor are eliminated). Correlation of one image f(x, y) 
with another s(x, y) is performed as multiplication 
in Fourier space followed by inverse Fourier trans- 
formation. The key to use of this architecture is the 
scheme for encoding both phase and amplitude of 
the Fourier-plane filter onto an SLM device or film 
[66]. Variants of the VanderLugt filter have been 
explored to match the capabilities of existing SLMs 
and to improve throughput. These variants gener- 
ally take advantage of the well-known fact in signal 
processing that clipping the amplitude of a signal 
does not greatly degrade the detection margin in 
the correlation process, only a 1 dB penalty typi- 
cally. Hence, binary amplitude Fourier filters have 
been used. A further extension has been to addi- 
tionally binarize the phase values in the filter, ie., a 
binary phase only filter where the amplitude trans- 
mittances of +1 correspond to phase shifts of 0 and 
m. Various high-frame-rate SLMs with limited or 
no grey scale capability are therefore viable can- 
didates for a high-speed correlation system such 
as the FLC SLM, the DMD SLM, and the electro- 
optic PLZT-on-silicon SLM. A compact correla- 
tor operating at 500 Hz using FLC technology has 
been packaged in a volume of 0.6’ 1.0’ x 0.4’ [15]. 
Many issues in producing viable optical proces- 
sors have been addressed for optical correlators. 
The performance and size of an optical correlator 
is found to depend on SLM pixel pitch and size, 
and device flatness and uniformity. For example, 
the level of detail of the Fourier-plane filter will 
increase with number of pixels in the SLM and 
thus increase the ability to differentiate among 
various objects. Also, the correlator system size 


scales linearly with the number of pixels and qua- 
dratically with pixel pitch [15]. Thus, smaller pixel 
pitch in larger more-uniform frames are needed. 
However, equally important are the performance 
parameters of the photodetector array. Since it is 
often only necessary to detect the location of the 
peak of the correlation function, many proposals 
have been made for smart photodetector arrays 
that can report correlation results as peak/no 
peak at rates commensurate with the SLM rate. If 
peak detection is insufficient for the application, 
then faster image processing will be required to 
extract correlation plane features. Whether such 
processing can be done in another stage of pro- 
cessing remains a future challenge. Unless many 
of these issues are addressed, optical systems 
will remain noncompetitive with all-electronic 
implementations. 

Adaptive techniques are a way to improve the 
performance of optical processing systems, and 
SLM technology can been applied to such tech- 
niques. One technique is to perform phase con- 
jugation on an optical wavefront to restore the 
wavefront to its original state by removing aber- 
rations and distortions introduced by the environ- 
ment or imperfections in an optical system. Phase 
conjugation requires the storage of detailed inter- 
ference fringe patterns produced by the aberrated 
wavefront and an unaberrated reference beam. 
A phase-conjugate mirror can be implemented 
with an SLM constructed of a photorefractive 
material [67]. Progress to date has centered on 
development of the required material parameters. 
High speed of response and efficiency are required 
from the photorefractive material. Photorefractive 
quantum-well material is capable of sub-microsec- 
ond response, and the efficiency has been increased 
by constructing pixels consisting of etalons [3]. 
Phase conjugate efficiency of 100% with high net 
gain has been achieved with polymer photorefrac- 
tive materials [68]. 


14.2.4.3.2 Optical interconnection 


Free-space: crossbar switch. Free-space optical 
interconnection can be described in matrix for- 
mulation that is easily related to capabilities in 
optical systems. Consider an array of optical input 
and output data channels, such as fiber-optic. If the 
array of input channels is represented as a vector 
u, any element or combination of elements can be 
routed to any element of the output array or vector 
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v using an appropriately constructed matrix M in 
the equation 

v= Mu (14.20) 
SLMs are needed in this concept because of several 
desirable characteristics they can provide. First, 
when interconnecting among a number of differ- 
ent communications channels an SLM provides 
isolation between input and outputs. Second, the 
routing process is naturally lossy, because each 
element of v must fan out onto a row of M. Optical- 
to-optical SLMs can then provide the amplifica- 
tion gain to compensate for these optical losses. 


14.2.4.3.3 Optical storage 


Effective processing systems need to have access 
to memory. It is natural to explore possible use 
of optical processing hardware with optical stor- 
age technology. One function of an SLM is stor- 
age of data, as noted in Section 14.2.4.2. An SLM 
can store the results from one stage of process- 
ing, and present those results to the next stage. 
However, the amount of information that can be 
stored on an SLM is limited to the number of pix- 
els in the device. Optical devices that store large 
amounts of data are, however, of interest in optical 
information processing, e.g., large databases may 
need to be accessed rapidly and repeatedly, such as 
for cross-correlation. Conventional high-density 
optical storage technology has been reviewed in 
Chapter 13. However, conventional optical disk 
storage technology has not been found useful for 
optical processing without significant modification 
or additional electronics. One nonconventional 
approach stores a library of analogue holograms 
on a rotating photopolymer disk for subsequent 
correlation. Large numbers of holograms can be 
stored on a single disk, and rapid scanning of the 
library is possible with standard mechanical rotary 
drives. Another disk approach is to store conven- 
tional digital data in a 2D page format. Such holo- 
graphic disk systems have been demonstrated that 
perform at 1 Gbit s' sustained and 10 Gbits s7 
peak transfer rates; along with the holography, is 
electronic processing that includes data modula- 
tion coding, interleaving, and error correction toa 
bit error rate of 10-3 [69]. 

Volume holographic storage has a more natu- 
ral optical interface using 2D SLMs to transfer 
data from previous and to subsequent stages of 


2D optical processing. Volume storage also would 
have a much higher storage capacity than disk, 
with a theoretical storage limit of ~10 Tbits cm™ 
(~volume/A? where A is approximately the small- 
est linear dimension for storing one bit). In com- 
parison, present single-layer optical disks have areal 
storage density of 15 Gbit s, which can be extended 
with multilayer disks, the increase depending on the 
number of layers and the laser wavelength. Among 
the issues in cascaded optical SLM and storage 
elements is rapid addressing of and access to data 
arrays, and the proper registration of data arrays. 
The latter issue has been addressed with electronic 
postprocessing techniques [70]. 


14.3 SUMMARY 


The area of optical information processing has 
explored many avenues of research. Areas of appli- 
cation are still developing, mainly as devices capa- 
bilities increase and new capabilities emerge. Many 
lessons have been learned on the proper applica- 
tion of optical capabilities. 


e Using optics for general numeric computations 
is not presently desirable, because many of 
the digital operations are intrinsically nonlin- 
ear. Necessary nonlinear optical devices are 
still large compared to electronic integrated 
circuits, and also inefficient, i.e., power hungry. 
This does not mean the architectures that 
have been developed are without value. Such 
architectures were developed mainly to cleverly 
exploit the parallel-processing capabilities of 
optics, and may be applicable to building spe- 
cialized processors [71]. 

e The advent of optical communications has led 
to a near-term processing application for fiber- 
optic interconnects between high-performance 
electronic digital processors and sensor arrays, 
whereby simple operations are performed on 
the optical data within fibers, such as multi- 
plexing/demultiplexing and signal condition- 
ing. Fiber interconnects are a straightforward 
insertion of optics, mainly to address input- 
output data-bandwidth bottlenecks that can 
occur with high-speed digital processing 
elements. But developments in fiber-optic 
technology have important implications for 
optical processing; these include the develop- 
ment of wavelength diversity techniques, and 
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of all-optical techniques, presently used to 
cascade fiber-optical subsystems in a network 
to avoid costly conversions between the optical 
and electrical domains. However, with fiber 
optics the full free-space 3D interconnection 
capability of optics is not fully exploited. Full 
exploitation of free-space 3D interconnection 
technology will require the use of wavelength 
diversity and all-optical techniques employing 
cascaded 2D SLMs. There will also be concur- 
rent development of novel multi-processor 
computing architectures. Candidate archi- 
tectures are likely to involve neural network 
parallel-distributed-processing concepts, where 
the interconnection scheme can be integral to 
the processing. 

Analogue optical signal processors, particu- 
larly for Fourier-transform-based operations 
such as spectral analysis and correlation, have 
been developed because they can perform 
linear operations, at a given rate, in imple- 
mentations with smaller size-weight-power 
product than all-electronic implementations. 
Avoidance of power-hungry analogue-to- 
digital conversion by processing directly in 
the analogue domain and concentrating on 
applications in the microwave-frequency 
region make such processors a true comple- 
ment to digital processors. Most development 
of compact systems has been done with 1D 
acousto-optic modulators, due to the matu- 
rity of acousto-optic technology. However, 

the highest performance should be obtained 
using 2D SLMs addressing intrinsically 2D 
problems such as pattern recognition and 3D 
optical interconnection. Use of wavelength- 
diversity techniques should further increase 
performance. Various 2D SLMs are capable of 
implementing functions needed for all-optical 
architectures, especially optical amplification, 
data storage, and data thresholding. All-optical 
systems are needed to address the fundamental 
problem with present analogue processors, by 
optically performing the present electronic 
pre-and post-processing of data into/out of 
optical processors. Such processing includes 
dynamic range re-scaling and data format- 
ting, various nonlinear operations, and signal 
amplification and data encoding schemes. 
These needs can be met potentially with devel- 
opments in optical-to-optical SLMs, adaptive 


optics, and volume holography. Throughput 
improvements of 2D devices are still required, 
since present throughput rates are lower than 
for the acousto-optic devices. Hence, initial 
implementations may still be restricted to 1D 
and quasi-multi-dimensional architectures. 


A feasible vision for the future for all application 
areas therefore involves all-optical architectures. 
Optical device technology exists for implementing 
subsystems that perform optical input, processing, 
and storage functions. The challenge is to develop 
efficient information and data flow between such 
subsystems in the optical implementations of pre- 
and post-processing functions that are done elec- 
tronically for present optical systems. 
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15.1 INTRODUCTION 


Optical spectroscopy is an invaluable tool for the 
characterization of many physical and chemical 
components and processes. It can provide complex 
and quantitative data and use it to precisely charac- 
terize a wide variety of physical objects or chemical 
analytes. Instruments can not only achieve much 
higher spectral resolution compared to eyes, but 
can also cover a much wider wavelength range and, 
if desired, even provide time- or distance-resolved 
results. A major advantage of optical methods of 
analysis is that they are usually nondestructive, 
noninvasive, and quick in operation. In optical spec- 
troscopy, electromagnetic radiation in the range 
of 10-1015 Hz is most commonly used, covering 
infrared (IR), visible (VIS), and ultraviolet (UV) 
radiation. Optoelectronic devices detect the interac- 
tions of this electromagnetic radiation with matter. 

This section will review the main methods of 
spectroscopy, concentrating mainly on covering 
general concepts of spectroscopy and introduce 
the technology behind some applications being 
covered in later chapters. 

The passage of electromagnetic radiation can be 
influenced by matter in processes such as refrac- 
tion, polarization change, specular reflection, 
scattering, or absorbance [1]. In most cases, the 
magnitude of these effects depends significantly 
on optical wavelength or photon energy. 

The technique of absorbance spectroscopy, 
or spectrophotometry, monitors attenuation of 
a beam of light by matter and is one of the most 
commonly applied methods in spectrometry, par- 
ticularly in the IR, VIS, and UV regions. 

Measurement of light lost from a beam due 
to scattering (nephelometry or turbidity mea- 
surement) [2] is also often used. Apart from the 
more common light-scattering mechanisms, vis. 
Rayleigh [3], Rayleigh-Gans, Mie and geometric 
scattering, other elastic scattering (i.e., scattering 
with no change of wavelength) can arise due to the 
effects of diffraction from periodic structures (i.e., 
like X-ray scattering in crystals). 
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Inelastic scattering involves a change in opti- 
cal wavelength (change in photon energy occurs 
inversely) and can arise, for example, in Raman [4] 
or Brillouin scattering, where a photon-phonon 
interaction process occurs. 

After absorption of light, matter can dissipate 
the absorbed energy by several means, for example 
from an excited electronic state to lower vibra- 
tional or rotational states, i.e., to produce phonons 
or heat. The absorbed radiation can also be rera- 
diated, resulting in luminescence behavior (this 
behavior can be subdivided into fluorescence and 
phosphorescence). In addition, after excitation 
with particularly energetic particles, matter can 
reemit strong radiation, a behavior made use of 
X-ray emission spectroscopy and atom emission 
spectroscopy. 

In some cases (for example, in flames), ther- 
mally excited electrons in the hot gaseous materi- 
als can exhibit direct radiation of light by energy 
transitions from the excited states. 

In addition to those that were discussed 
in Volume I, Chapter 1 (Introduction to 
Optoelectronics), processes relevant to spectros- 
copy will be briefly reviewed below [5-8]. The 
focus of this discussion will be particularly on 
the UV/VIS/NIR spectral region, especially when 
we describe quantitative aspects. The theoreti- 
cal principles of atomic states and dispersion will 
be discussed, ending with some description of 
photophysics and aspects of polarization states in 
waveguides. Furthermore important spectroscopic 
methods and types of spectrometers will be men- 
tioned together with a brief description of some 
interesting case studies of optical sensor systems. 


15.2 THEORETICAL PRINCIPLES 


15.2.1 Properties of electromagnetic 
radiation 


Electromagnetic radiation is characterized by 
its amplitude, frequency, state-of-polarization, 
phase, and in the case of modulated light, the time 
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dependence of its amplitude, phase, or frequency. 
Depending on the physical effects to be described, 
electromagnetic radiation can best be considered 
as either a particle or a wave, a behavior called de 
Broglie dualism. The choice of the wave descrip- 
tion, however, is best used to understand inter- 
ference and refraction effects [9,10]. The impact 
momentum, p, is proportional to the inverse of 
the wavelength, A, of the radiation according to 
Equation 15.1: 


a (15.1) 
P a : 
This can be given, in terms of effective mass (kg), 
m, and velocity (ms~), v, by 


p=mv, (15.2) 
where p is the momentum (kg m s”), and h is 
Planck’s constant=6.63 x 10-*4J s. Propagation of 
electromagnetic radiation can be explained by the 
well-known Maxwell’s equations, which predict 
that, in a propagating E-M wave, the electric and 
the magnetic field vectors lie perpendicular to each 
other and are perpendicular to the direction of 
propagation (see Figure 15.1). Considering the par- 
ticulate (or quantized) nature of the light, the radi- 
ation is composed of a stream of photons and the 
energy of each is proportional to the frequency in 
the wave model, so is therefore reciprocally depen- 
dent on the optical wavelength (see Equation 15.3). 


_ he 


E= hy =—-=hew. (15.3) 


Here, c=3X10*m s* is the velocity of light in vac- 
uum, and ¥ is the wave number in cm. 

There is a distinct lack of standardization in 
the units most spectroscopists use, depending 
on the field they work in. The unit of optical fre- 
quency that is most commonly used by scientists 
working in the IR region or using Raman scat- 
tering is the wave number. In the field of highly 
energetic radiation, eV (electron volts) is usually 
used. In the UV, VIS, and NIR wavelength range, 
most scientists use units of wavelength, A, in nm, 
or, in the IR, units in pm. Yet again, many of 
these are not S.I. units, reflecting the fact that the 
reasons for the choice of the different units were 
historical in nature. 


(a) 


) i 


Figure 15.1 Electrical fields of a Hertz dipole. 
These are perpendicular to both the magnetic 
field and to the direction of propagation of the 
electromagnetic radiation (a). The time depen- 
dence of the electric field distribution is given 
in (b). The polar diagram of radiated intensity 
(power) is shown in (c). 


The approximate order-of-magnitude ranges 
of wavelength, frequency, wave number, and pho- 
ton energy that correspond to a number of differ- 
ent types of electromagnetic radiation are shown 
in Table 15.1 and Figure 15.2. The typical type of 
interaction that each type of electromagnetic radi- 
ation might have with matter is also shown, but 
this will be discussed later. As can be seen, the fre- 
quency and wavelength scales are logarithmic in 
the table, in order to cover a very wide range. 

Electromagnetic radiation in the UV, visible, 
and IR wavelength range is especially interesting in 
spectroscopy. For this reason, this particular spec- 
tral range has been “zoomed out” in Figure 15.2 
and we have indicated the colors associated with 
the different parts of the visible spectrum. In the 


Table 15.1 Spectral regions of electromagnetic radiation, giving the approximate order of magnitude of relevant wavelengths, frequencies, wave 
numbers and energies, plus indications of typical interactions that particular type of radiation might have with matter 


Spectroscopic 
technique 
Nuclear magnetic 
resonance (NMR) 
0.1-10m 
Electron spin 
resonance (ESR) 
0.1-10cm 
Microwaves 
0.1-10cm 
IR 0.78-103 pm, 
3000-106 nm 


VIS 380-780 nm, 
UV 200-400 nm, 
UVU 100-200 nm 


X-rays 0.01-10 nm 


Moessbauer 100 
keV, y absorption 
y radiation, >1 MeV 


Approximate 
order of the 
wavelength (nm) 


1010 


108 


10° 


104 


102 


Approximate 
order of the 
frequency (s~*) 
3x10’ 


3x10° 


3x10" 


3x 1018 


3x1015 


3x1017 


3x 1019 


3x10! 


Approximate 
order of the wave 
number (cm) 


10 


10-1 


10 


10 


105 


107 


10° 


1011 


Approximate magnitude of 
energy (in various units) 


(keal 
mol") 
3x 10-6 


3x10-4 


3x10 


3x 104 


3x 106 


3x 108 


(kJ 
mol-") 
1.2x10°5 


1.2x 10-3 


12 


1.2108 


1.2105 


1.2x 107 


1.2x 10° 


(eV) 
1.21077 


1.2x10-5 


1.2x 10-3 


1.2x10-' 


12x104 


1.2108 


1.2105 


1.2x107 


Type of interaction 
Excitation of magnetic 
transitions of nuclei 
(spin > . 0) 
Excitation of unpaired 
electrons 


Excitation of rotation of 
molecules 

Absorption by excitation of 
molecular vibrations. Raman 
scattering 

Excitation of electronic 
transitions, emission and 
atomic absorption 
processes. Raman scattering 

Excitation and transitions of 
electrons involving inner- 
orbit energy levels 

Resonant absorption by nuclei 


Nuclei transformation 


siskjeue dIdodso4j2eds Z/G 
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Figure 15.2 Depiction of the full electromagnetic spectrum of radiation. 


case of simple absorption photometry, a transmis- 
sion measurement is taken at only one wavelength 
(ie., monochromatic light is used), but in more 
general spectrophotometry, a transmission spec- 
trum is usually measured over a wider wavelength 
range, involving perhaps a superposition of many 
rotational, vibrational, and electronic interactions. 


15.2.2 Interaction between radiation 
and matter 


15.2.2.1 DISPERSION 


The most usual and important interaction with 
matter for use in spectroscopy involves the electric 
field vector. The periodic change in the electric field 
vector of an incident E-M wave can cause either a 
new orientation of a molecule or a displacement of 
electrical charges in a dielectric medium (matter). 
Both of these result in electrical polarization of 
the matter, the so-called dielectric effect process. 
If there is any variation of this effect with optical 
wavelength or frequency, the effect is called dis- 
persion [9], as it changes the velocity of the wave, 
depending on its wavelength. 

Radiation induces a type of Hertz dipole in 
a dielectric material, and if this material is not 
optically homogenous with its surroundings, it 
can result in radiation, or scattering, into the sur- 
roundings. The charge displacement in a dielectric 


by an electromagnetic wave can be explained in 
terms of a simple harmonic oscillator model, where 
an alternating dipole is induced in each molecule 
or microregion of the material. The total effect is 
proportional to the number of molecules or mac- 
romolecular particles affected, i.e., polarization is 
a volume-based (colligative) quantity. 

This Hertz dipole oscillates at the excitation fre- 
quency of the radiation and behaves like a radia- 
tion source, emitting radiation most strongly in a 
direction perpendicular to the direction of polar- 
ization, as was shown earlier in Figure 15.1c. When 
propagating through matter, the radiation main- 
tains its frequency, because of the conservation of 
photon energy hy, but decreases in wavelength and 
velocity. The velocity of propagation within matter 
is inversely proportional to the refractive index, n, 
given by the ratio between the electric field vectors 
of the radiation propagating in vacuum and matter: 


_ E(vac) 
&£ (mat) 


(15.4) 


According to Maxwell’s equation, the square of the 
optical refractive index (ie., the value of refrac- 
tive index in the optical range of the dispersion 
curve) is equivalent to the dielectric constant, 
which, in turn, can conveniently be described 
mathematically in terms of an equation with real 
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Figure 15.3 Depiction of optical dispersion, showing the real (refraction) and imaginary part (absorp- 
tion) of dielectric constant versus frequency on a logarithmic scale. The lower curve shows various 


absorption bands. 


and imaginary parts. This latter parameter can 
be graphically depicted, as shown in Figure 15.3. 
The imaginary part corresponds to absorption (or, 
more rarely, optical gain) in the medium. 

In addition to polarization of matter, as occurs 
with scattering, absorption of light can also occur. 
Except in condensed matter (ie., solids or liquids 
where spectral broadening occurs due to degen- 
eracy of energy levels), absorption is usually not of 
a broadband nature, i.e., it often occurs at specific 
wavelengths. For many materials, e.g., liquids in the 
microwave region, a reorientation and a relaxation 
effect can be observed, which changes the effective 
dielectric constants and causes absorption. 

A resonance (caused by displacement of charges) 
can typically be observed in the IR (vibration) and 
UV/VIS (electronic transition) spectral regions. 
This causes strong absorption bands (i.e., a larger 
imaginary part of the dielectric constant). In the 
case of gases, rotational resonance is also observ- 
able at microwave frequencies. 

To summarize the above, two fundamental 
types of interaction between electric fields and 
dielectric matter can therefore occur: the first is 
scattering and the second is absorption (or, more 
rarely, optical gain). These will both be discussed 


in more detail in the following sections and we 
shall start with the case of scattering where no 
absorption occurs and light energy is conserved. 
In contrast, inelastic scattering (Raman) will be 
discussed later. 


15.2.2.2 SCATTERING 


15.2.2.2.1 Scattering from single 
particles 


Within molecules, charges may either be sym- 
metrically (nonpolar) distributed or they may have 
an asymmetric distribution, making the molecule 
polar in nature. In nonpolar molecules, the nuclei 
and the electron density distribution have the same 
center of charge distribution. However, in an elec- 
trical field, positive and negative centers can be 
displaced relative to each other, forming a Hertz 
dipole acting as an electromagnetic transmitter. 
The excited system behaves elastically, emit- 
ting energy directly from a nonresonant state 
(sometimes called a virtual state), with no energy 
absorbed. Even the smallest of molecules show 
such scattering effects, but, for such Rayleigh scat- 
tering (scattering by molecules or particles hav- 
ing a diameter of order of 4/10 or less), the effect 
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is usually very small, as the intensity of scattered 
radiation increases in proportion to the sixth 
power of the length of the induced Hertz dipole 
[3,11]. For these small particles, the intensity of the 
radiation varies as the fourth power of the incident 
optical frequency (if the particle has low optical 
dispersion) and the light is scattered at essentially 
the same frequency as the incident beam. It is this 
strong dependence on wavelength that causes 
white light from the sun to be scattered mainly at 
short wavelengths, giving the blue appearance of 
sky on a clear day or the aurora/afterglow. 

The radial distribution of scattered radiation 
depends, in general, on the state of polarization of 
incident radiation, on the shape of the particle or 
molecule, its dimensions and on the wavelength of 
scattered radiation. Thus, the following effects can 
be observed: 


e An isotropic particle with a diameter smaller 
than 4/10, forms with polarized incident radia- 
tion a scattered intensity distribution (polar 
diagram) as shown in Figure 15.4a. Maximum 
energy is emitted in a direction perpendicular 
to the axis of the dipole, but zero energy is 
scattered in a direction along the axis of the 
dipole. 

@ Many very small particles (<A/10) have some 
anisotropy (e.g., randomly oriented, needle- 
like, or egg-shaped particles) and so exhibit 
many induced Hertz dipoles at different angles 
(depending on the orientation). Then, even for 
incident polarized light, the polar diagram of 
the scattering is no longer as in Figure 15.4a, 
but has a new shape as in Figure 15.4b, and 
there is now a nonzero scattering intensity 
along the direction of polarization of the 
incident light. This phenomenon is known as 
depolarization of scattering. 

e Randomly polarized (or unpolarized) radia- 
tion generates a set of random Hertz dipoles, 
whereby the intensity of radiation in the 
forward or backward direction is unchanged, 
but the radiation field at other angles (e.g., at 
90°) is a superposition of the wavelets from all 
these different polarizations, with of course the 
contribution falling to zero whenever the inci- 
dent polarization (or the induced Hertz dipole 
axis) is in the direction of observation. This 
gives a polar diagram with the greatest scat- 
tered intensity in the forward and backward 


Figure 15.4 Polar diagrams for various cases of 
Rayleigh and Mie scattering. The horizontal arrow 
shows the direction of incident light. Where 

the incident light is plane polarized, the vertical 
arrow indicates the polarization direction. The 
inclined arrow shows a scattered light direction 
and the intersection with the outer surface indi- 
cates the scattered intensity at this angle. The 
top left figure (a) shows classical Rayleigh scat- 
tering, with small particles, where a Hertz dipole 
is induced along the polarization direction. The 
bottom left figure (b) shows the same when the 
particles are small, but have a small degree of 
anisotropy. The top right figure (c) shows the 
case for small particles with randomly polarized 
(or unpolarized) incident light. The bottom right 
figure (d) depicts a typical Mie scattering polar 
diagram for larger particles. 


directions, but with half that intensity in a 
direction at right angles to the direction of 
incident radiation (see Figure 15.4c). 

e If we now consider slightly larger particles (e.g., 
being of the same order as, or even larger than, 
the optical wavelength), but ones where the 
particle refractive index is very close to that of 
the surrounding medium, the behavior differs 
again. In this case, scattering occurs from 
many small points at different locations within 
each particle, yet the shape of the incident wave 
front remains essentially unchanged during 
passage through the particle. Then the excita- 
tion light has a phase that is dependent on the 
distance through the particle (Rayleigh Gans 
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scattering condition). Interference between all 
the scattered wavelets from different positions 
will clearly influence the more complex radia- 
tion pattern, so, to obtain the far-field polar 
diagram, the Rayleigh scattering intensity from 
each has to be multiplied by a weighting func- 
tion, which depends on the geometrical form of 
the particle (comparable to Mie scattering) [3]. 
If the scattering center or molecule is much 
larger and of significantly higher refractive 
index compared with its surroundings, then 
more than one center of scattering within 

the molecules will arise. However, the greater 
refractive index difference may now be enough 
to cause relative phase delays and even refrac- 
tion of incident light, giving rise to significant 
wave front distortion. Thus, there is a phase 
shift, and in extreme cases, even a change in 
direction of the transmitted light, relative to 
light outside the particle. Under these con- 
ditions, the so-called Mie scattering [3,11] 
occurs. As with Rayleigh—-Gans scattering, the 
scattered intensity distribution is determined 
by the interference of many scattered wavelets 
from different sites in the particle, but the 
situation is now far more complex as the phase 
of excitation of each point can no longer be so 
easily determined. The polar diagram is, there- 
fore, generally complex and usually has many 
scattering lobes (see Figure 15.4d). 


Thus, the intensity distribution and interference 
pattern contain information regarding the size 
and the shape of the molecules [3]. It is worth 
mentioning briefly that there is a further type of 
scattering, from very large, but optically homoge- 
neous particles (raindrops are a typical example), 
several orders of magnitude greater than the wave- 
length of the light, where the scattering can be 
considered solely in terms of simple ray tracing, 
assuming Fresnel refraction and reflection at the 
optical interfaces. This form of scattering, which is 
observed in rainbows, is known as Fresnel or geo- 
metric scattering. 


15.2.2.2.2 Scattering from multiple 
particles 


Scattering from multiple particles depends strongly 
on whether the light is incoherent (broadband) or 
coherent (narrowband). If the light is incoherent, 
scattered power from each of the particles can 


simply be summed to give an overall polar far-field 
scattered intensity that is similar to that from one 
of the particles, but this is only if not too many 
strongly scattering particles are present. If the 
population density of scattering particles becomes 
very high (as can occur, for example, in milk or in 
white paints), then the light may suffer strong mul- 
tiscattering paths, i.e., substantial secondary, ter- 
tiary, or even higher order scattering occurs before 
light from the first excitation (illumination) point 
can reach the observation point. A medium that 
causes significant light to be lost by scattering from 
a direct optical beam passing through it is called a 
turbid medium. 

If an ensemble of particles is excited with highly 
coherent light (e.g., monochromatic laser light), 
the behavior is very different. Now the scattering 
from every small particle, or from every part of a 
larger particle, results in a monochromatic scat- 
tered wavelet being reemitted from each. Thus, at 
the point of remote observation, the electric fields 
of all these coherent wavelets will add coherently. 
This results in constructive, destructive, or inter- 
mediate levels of interference, depending on the 
relative phase of the light from the particles, which, 
in turn, depends on their relative positions relative 
to both the incident beam and to the direction or 
point of observation. These effects cause the famil- 
iar “speckle” effect when light from a visible laser 
is first scattered from a diffusely scattering object, 
such as a turbid liquid, or from a rough surface. 

In a solid material, or particle in solid sus- 
pension, the positions of the particles may be 
reasonably stable, so then true elastic scattering 
occurs and a relatively fixed speckle pattern can 
be observed in the far-field scattering diagram, 
provided, of course, that no significant mechani- 
cal movement, vibrations, or thermal expansions 
occur. In fluid (gas or liquid) suspensions, how- 
ever, the particle will generally undergo more sig- 
nificant and rapid movements due to Brownian 
motion, convection or turbulence, so the speckle 
pattern will move continuously with time. Because 
of the small Doppler shifts, the light scattering can 
be described as quasi-elastic. This time-dependent 
scattering behavior, which has come to prominence 
since the 1970s, can be used to deduce certain use- 
ful parameters, such as particle size information, 
using a method called photon correlation spec- 
troscopy (PCS). The light scattered from a mono- 
chromatic laser beam interferes at the detector 
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and gives rise to intensity changes (or to changes 
in photon-arrival-time statistics), which can, with 
knowledge of the temperature and viscosity of 
the fluid in which the particles are suspended, be 
related to the hydrodynamic radius of the particles 
using the Stokes-Einstein equation. The method 
works best with single-size (monodisperse) parti- 
cles, but more complex correlation functions, from 
suspensions of two or even three particle types/ 
sizes, can still be inverted using Laplace transfor- 
mation to obtain useful information. 


15.2.2.3 REFLECTION 


Reflection usually describes the process where a light 
beam is reradiated back into the incident medium 
at a boundary. If only single particles (or random 
ensembles of such particles) are present in gases 
or liquids, they can be considered to be disconti- 
nuities that will cause diffuse light scattering, as we 
discussed earlier. However, if atoms and molecules 
are in very highly ordered arrays, as is the case in 
flat surface layers of solid matter (especially crystals 
and metals, but even in some amorphous materials, 
such as glasses, having only small-scale disorder), 
then the many Hertz dipoles can show constructive 
interference in preferred directions. This gives rise 
to directional or specular reflection at these solid 
surfaces, as seen, for example, on silvered-glass mir- 
rors. This results in reflection at the same (yet oppo- 
site) angle to the normal to the surface as the initial 
or incident radiation [9,13]. If the reflection occurs 
due to refractive index differences (Fresnel reflec- 
tion), then the amplitude of this reflectance depends 
on the refractive indices of both the media. For the 
simplest case of normal incidence and nonabsorb- 
ing media, the reflection is described by a simplified 
form of the Fresnel equation: 


p-| 22] ; 
(2, + Dy) 


If the reflection occurs at a polished metal sur- 
face, then the resulting more complete reflection 
can be predicted by Maxwell’s equations, treating 
the metal as a conductive medium. If the metal is 
assumed to be a perfect electrical conductor, then 
the reflection is theoretically predicted both by 
Maxwell’s equations and by simple conservation 
of energy considerations to be 100%. For nonper- 
fect conductors, the reflection is <100%, and the 


(15.5) 


radiation actually extends within the material to 
a depth called the “skin depth.” The reflection can 
therefore be used to determine the material prop- 
erties of the metal. 


15.2.2.4 REFRACTION 

Refraction describes the effect of a boundary 
between two optical materials on a light beam 
passing through it. Radiation incident at an angle 
a on the interface between media with different 
refractive indices (n, and n,) is refracted at an 
angle B, where, according to Snell’s law: 


m, sin = n, sinB. (15.6) 


Radiation, initially incident in a lower refractive 
index medium, is refracted toward the normal 
to the surface, so the angle of refraction becomes 
smaller, whereas in the opposite direction it is 
refracted away from the normal. 


15.2.2.4.1 Total internal reflection and 
evanescent field 


Equation 15.6 suggests there is a critical angle at 
which the angle of refraction is 90°. If the angle 
of incidence increases further, then total internal 
reflection occurs, a behavior only possible when 
light in an optically dense medium strikes an 
interface with a less dense medium. If the incident 
angle is greater than the critical angle, a simple 
explanation suggests radiation does not exit the 
denser medium, but is reflected back into this 
medium, as in Figure 15.5. This effect has become 
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Figure 15.5 Behavior of light incident on a 
dielectric interface, demonstrating refraction and 
total internal reflection. 
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very important in recent years for optical or fiber- 
optical waveguides (see Chapters 1 and 7). 

We shall now move on to consider inelastic 
processes, where either light energy is lost totally 
or where some light is reemitted with a change in 
photon energy. 


15.2.2.5 ABSORPTION (THE LESS COMMON 
OPPOSITE OF THIS IS OPTICAL 
GAIN) 


Absorption is a loss of light in a material, involv- 
ing an internal energy exchange, resulting in the 
destruction of a photon. Optical gain is the oppo- 
site of this, where a light beam gains energy when 
traveling through a medium. Both mechanisms 
usually take place only at specific wavelengths, 
corresponding to a defined transition between 
two energy states. Such a transition occurs when 
the natural frequency of charge displacement 
in the material is in resonance with the frequency 
of the incident radiation. We shall mainly concen- 
trate on absorption in the following discussion. 
Absorption involves a process that leads toa new 
excited (either stable or metastable) energy state in 
the material. In the case of electronic transitions 
in an atom, this involves creation of a new electron 
density distribution. In the case of molecules, it is 
possible instead to excite a new resonant, vibra- 
tional or rotational, mode of the molecule (possibly 
in addition to electronic transition). An electronic 
transition, from the highest occupied molecular 
orbital (HOMO) to the lowest unoccupied molecu- 
lar orbital (LUMO), as described mathematically 
by Equation 15.3, is demonstrated in Figure 15.6. 
Note in Figure 15.6, the phases of the orbitals are 
also given, in order to explain the symmetry of the 
orbitals, which causes specific transitions shown 
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Figure 15.6 Bonding and antibonding orbitals 
(HOMO, LUMO) and their transition possibilities. 


by single, double, or triple arrows. These demon- 
strate the differences in intensity of the transition. 
According to transition rules, only transitions 
between even and odd states are permitted. 

These HOMO and LUMO states correspond to 
bonding or antibonding orbitals [12]. The energy 
required for the transition is provided by the radia- 
tion and the process is known as (induced) absorp- 
tion. Within this absorption band, anomalous 
dispersion is often observed (i.e., the refractive 
index increases with wavelength). Depending on 
the molecular environment and possible pathways 
of deactivation (see Table 15.2), the new excited state 
can exist for a time varying over the wide range of 
10-% to 10°’. From the Schrodinger equation, the 
corresponding energy states can be calculated as a 
set of eigenvalues, by using the electronic, vibra- 
tional, and rotational eigenfunctions and inserting 
the boundary conditions that are appropriate to 
the molecular structure. The relevant energy lev- 
els or states in the UV/VIS spectral region usually 
correspond to electronic levels, whereas in the IR 
area they correspond to the energies of molecular 
vibrational modes. 


15.2.2.5.1 Energy level diagrams 


In Figure 15.7, a Jablonski energy level diagram for 
a typical organic molecule is depicted. It shows the 
energy levels and allowable transitions. It shows 
both electronic and vibrational states, but rota- 
tional states have been omitted to keep the dia- 
gram simple. At room temperature, most of the 
molecules are resting in the lowest electronic and 
vibrational state, so resonance absorption usually 
takes place to various excited higher level vibra- 
tional levels or electronic states. To illustrate these, 
to the right of this energy level diagram, three 
types of resulting intensity spectra [absorbance 
(A), fluorescence (F), and phosphorescence (P)] are 
shown as a function of wave number. 

The relative strengths of absorptions depend on 
the momentum of the transitions and are deter- 
mined by a set of spectroscopic selection rules. 
Radiative transitions requiring a change of elec- 
tron spin are not allowed and, in organic mole- 
cules, anormal ground state has a singlet property. 
Thus, a normal absorption spectrum is for a transi- 
tion from S, to S,. Such a transition occurs within 
a very short time, typically ~10-'5s. Usually, only 
the first excited electronic state is important in 
spectroscopy because all higher states have very 
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Table 15.2 Possible deactivation process from the vibrational ground state of the first excited singlet 


state 


Process type Abbreviation 


Radiationless Te Thermal 


transition equilibration 


Ic Internal conversion 


Isc Intersystem crossing, 
intercombination 


Spontaneous F Fluorescence 
emission as 
a radiative 
process 
Photochemical P 


reactions 


Name of process 


Phosphorescence 


Process description 


Relaxation from a high vibrational level 
within the present electronic state. 
Lifetime depends on matrix. 

Inner deactivation (transfer of energy in 
torsional vibrations). 

Isoelectronic transition within the same 
energy level system from the vibrational 
ground state of a higher electronic state 
into the very high energy vibrational state 
of a lower electronic state. 

Isoelectronic transition into another energy 
level system (S © T), usually from the 
vibrational ground state in the 
electronically excited state; respective 
radiative transition is forbidden because of 
the spin inversion prohibition (except for 
heavy nuclei). Therefore, phosphorescence 
is a “forbidden” process. 

Without spin inversion from, e.g., S;—So 
(provided that lifetime of the 
electronically excited state is 10-®s) within 
singlet system. 

Out of triplet into singlet system (provided 
that lifetime is 10-*s) very low probability, 
only possible at low temperatures or in a 
matrix photo-induced reaction starting 
from the S, term leading to ionization, 
cleavage, or in a bimolecular step to a 
new compound or an isomer (trans—cis), 
provided that lifetime in excited state is 
relatively long. 


short lifetimes (relaxation times). For organic mol- 
ecules, usually paired spins are present (singlet 
states), whereas in inorganic transition metal com- 
plexes the ground state can be a triplet and other 
states with a large number of unpaired electrons 
may occur according to Hund’s rule. An impor- 
tant example of a triplet ground state molecule is 
oxygen. 


15.2.2.5.2 Electronic chromophores 


Unlike the case for IR spectra, where vibrational 
bands are very useful for the analysis of compo- 
nents, or of functional groups, the occupancy of, 
and energy between, the electronic levels (i-e., those 


which usually give rise to UV/VIS spectra) does 
not depend so much on the molecular structure or 
on interactions with the matrix. However, chang- 
ing the electronic ground or excited states can lead 
to a shift in the relative position of the electronic 
level or to a change in the degree of polarization. 
In general, UV/VIS spectra are less easily used to 
characterize chemical components. However, in 
some specific cases, for example, steroids, incre- 
mental rules can be determined that allow both 
determination of, and discrimination between, 
some of these molecules. 

Various transitions between electronic and 
vibrational levels are shown in the energy level 


580 Spectroscopic analysis 


Wavenumber. cm7!_ ———» 


+ Relative intensity 


Figure 15.7 Jablonski energy level diagram, showing electronic and vibrational energy levels, and the 
resulting spectra for different transitions and pathways of deactivation. 


diagram in Figure 15.7. The following abbrevia- 
tions are used: rd (radiationless transition), IC 
(isoenergetic internal conversion within one elec- 
tronic term scheme), ISC (isoelectronic intersys- 
tem crossing between term schemes). Depending 
on the electronic states involved, these transitions 
are called oo*, mx*, or nx* transitions (see Figure 
15.6). In aromatic compounds, electron-attracting 
or electron-repelling components, such as func- 
tional groups in the ortho-, para-, or meta-posi- 
tions of an aromatic ring, can affect these energy 
levels, and they may also be changed by solvent 
effects or with pH changes. 


15.2.2.5.3 Linewidth and line 
broadening effects 


Although the basic absorption process occurs at 
discrete wavelengths, there are many mechanisms 
that can effectively broaden the effective absorp- 
tion line [7]: 


e Velocity or movement effects, causing trans- 
lational Doppler shifts: They are usually only 
important for gas absorption lines. Single- 
direction wavelength shifts of this nature will 
always occur in any fluids that are moving 
rapidly (e.g., fast-moving gas jets), but the most 
usual effect is due to the rapid movement of gas 


molecules, which is defined by well-known gas 
laws and of course depends strongly on the gas 
temperature. 

Rotations of the molecule: They can occur 
according to the number of degrees of freedom 
of the molecule and will cause a set of discrete 
energy (frequency) shifts, depending on the 
quantized rotational levels. 

Interactions with other atoms in a molecule: 
Such interactions increase, for example, the 
number of degrees of freedom of vibrational 
and rotational transitions (e.g., number of pos- 
sible modes of vibration), so molecules con- 
taining more than about six atoms usually tend 
to lack a fine-line absorption band structure. 
Interaction with other atoms or molecules by 
collisions: A simple gas with atoms or mol- 
ecules having a fine line absorption structure at 
low pressure will exhibit increasingly broader 
lines as pressure is increased. This is due to an 
effect called collision broadening or pressure 
broadening and is due to the additional energy 
levels that arise due to collisions. If the pres- 
sure is increased sufficiently, any closely spaced 
sets (manifolds) of narrow absorption lines will 
eventually merge into a continuous absorp- 
tion band, just as occurs in condensed phases 
(liquids and solids). 
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e Interaction with nearby atoms or molecules via 
electric field effects (ligand fields): Eventually, 
in the condensed state, the absorption is 
almost invariably of a broadband nature. 

This is because the large number of possible 
energy levels in absorbing bands can produce 
a near continuum. Electronic energy levels, for 
example, are affected by the electronic fields 
of nearby molecules, a phenomenon known as 
ligand field interaction. 


15.2.2.5.4 Bandshifts of electronic levels 


Apart from line broadening, line shifts can occur. 
As shown in Figure 15.8, the aa* and na* transi- 
tions are influenced differently by polar solvents. If 
a cyclohexane solvent is replaced by methanol, the 
polarity increases considerably and, accordingly, 
the x* band is red shifted by several nanometers, 
a so-called bathochromic effect, and the nx* band 
is shifted to the blue (hypsochromic). Such band- 
shifts can be used to obtain information regarding 
the properties of the energy levels and transitions. 
A change in intensity of the absorption band is 
described as either hyperchromic or hypochromic, 
respectively, depending on whether there is an 
increase or decrease in intensity. 


15.2.2.5.5 Quantifying absorption levels 
in media 

The power, P(A), transmitted through a sample ina 

small wavelength interval at a center wavelength A, 

is given by Lambert’s law: 


Red shift 


loge 


Cyclohexan 
(C) 


ri Methanol 
/ (M) 


P(A) = Py(A)exp[-a(A)], (15.7) 
where P,(A) is the power entering the sample in this 
wavelength interval, a(A) is the attenuation coef- 
ficient of the material at wavelength A, and ¢ is the 
optical path length through the sample to the point 
at which P(A) is measured. This is only true for 
homogeneous nonturbid, nonfluorescent samples. 
The sample can be said to have a transmission T(A), 
at the wavelength i, where 
T(A)=Tout/To- (15.8) 
The transmission, T, is given by 0STS1; or 
T*100% in percent transmission. Alternatively, 


the sample can be said to have an absorbance 
A(A), where 


A(A) a logio[1/T(A)] = logio (Lo/Lout) 


= logy [Py(A)/P(A)]=0.430(A) 2. (15.9) 
The factor 0.43, which is the numerical value of 
log(e), has to be included to account for the use of 
log,) for decadic absorbance calculations, whereas 
natural (to base e) exponents are normally used for 
attenuation coefficients. 

An absorbance of 1 therefore implies only 10% 
transmission. One should take care in the use of 
the term “absorbance,” as defined above and as 
measured by most instruments, as this term seems 
to suggest the optical loss is only due to absorption, 


Figure 15.8 Illustration of band shifts caused by polar solvents. 
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whereas, in cases of turbid samples, it might actu- 
ally arise from a combination of absorption and 
scattering (and reflection at boundaries of cuvettes). 
According to the Lambert-Beer-Bouguer law, the 
value of A for an absorbing analyte is given by 


A= log(I)/I) = MCL, (15.10) 


where M is the (decadic) molar extinction coef- 
ficient, C is the molar concentration, and L is the 
optical path length. 

With merely measuring the intensity when using 
broadband radiation, there is a deviation from the 
above law whenever the absorption varies with 
wavelength. Thus, to avoid errors in calibration and/ 
or absorption coefficient measurement, monochro- 
matic radiation has to be used. Even at single wave- 
lengths, however, the law has the wrong dependence 
on concentration if there is clustering or association 
of molecules or if a concentration-dependent or a 
pH-dependent chemical change occurs. 


15.2.2.5.6 Frustrated total internal 
reflection, evanescent field 
absorption 


In contrast to what is usually taught in classi- 
cal optics, total internal reflection is not really 
“total” [12] at the interface between the media. A 
fuller examination, using either classical geomet- 
ric optics or quantum optics, predicts that part of 
the radiation will penetrate the interface and then, 
for a certain distance, be guided outside the opti- 
cally denser medium, leading to a lateral shift in 
the apparent point of reflection from the interface. 
This field that extends into the less dense medium 
is called the evanescent field. In the medium hav- 
ing the lower refractive index, it decreases expo- 
nentially away from the interface. 

Any absorbing medium within an evanescent- 
field penetration depth, d, of this field, can absorb 
the radiation and result in an attenuation of the 
otherwise total reflectance. The value of d (mea- 
sured from the interface) is given by Equation 15.11: 


rn 
2n Vn sin’? @,—n; 


d= (15.11) 


This penetration depth is typically of order of 
half the wavelength of guided radiation. Thus, 


absorption of the evanescent wave will reduce 
the intensity of the otherwise “totally inter- 
nally reflected” light via this electric field vector 
coupling. 

This effect is called frustrated total inter- 
nal reflection. It can be made practical use of 
for measurement of highly absorbing samples, 
which may absorb (or scatter) far too much 
light to be able to measure them by traditional 
methods. For example, complex samples such as 
paints, foodstuffs, or biological materials, which 
might absorb or scatter light very strongly, can 
be analyzed for their strongly absorbing com- 
ponents by placing them in contact with a high- 
index glass plate and measuring the internally 
reflected light. 


15.2.2.6 FLUORESCENCE 


Fluorescence is an inelastic energy relaxation pro- 
cess, which can follow after absorption of light in 
a material. In most materials, absorption of light 
(i.e., energy loss from a photon) merely causes heat- 
ing of the absorbing material, with all the absorbed 
energy being converted to internal kinetic energy 
(e.g., to excite molecular vibrations or phonons). 
However, in some materials, only part of the 
energy of an optically excited state is dissipated as 
heat. Accordingly, one or more photons of lower 
energy than the incident one are radiated. The flu- 
orescence therefore has lower photon energy (so is 
usually emitted at a longer wavelength) than that 
of the incident light (this is called a Stokes process). 
The emission of light by fluorescence has no pre- 
ferred direction and is said to be omnidirectional. 
It is also randomly polarized. Of course, a single 
photon fluorescent event must, by its nature, have 
the direction and polarization of the single pho- 
ton emitted, but over a period of time the average 
reradiated energy from many such events is omni- 
directional and so fluorescent light is randomly 
polarized. This aspect of fluorescence can be used, 
apart from wavelength and temporal persistence 
(decay lifetime, after pulsed illumination) dif- 
ferences, to distinguish it from Rayleigh and/or 
Raman scattered light, which is usually strongly 
polarized. 

Fluorescence detection is a valuable tech- 
nique in chemical sensing where it can be used 
to directly monitor certain fluorescent chemicals 
(e.g., polyaromatic hydrocarbons, mineral oil, and 
fluorescein tracer dye). However, by deliberately 
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introducing reactive fluorophores to act as a more- 
efficient chemical indicator, it can also be used to 
monitor reactions or reagents having no fluores- 
cence. These are too numerous to cover here, as 
optical techniques have been used [14]. Many of the 
optical sensing applications of these are covered 
in detail in the proceedings of a series of interna- 
tional “Europt(r)ode” congresses. There is also an 
excellent textbook “Optical Sensors,” edited by R. 
Narayanaswamy and O.S. Wolfbeis [15]. 


15.2.2.7 CHEMILUMINESCENCE AND 
BIOLUMINESCENCE 


Fluorescent light can arise as the result of chemical 
reactions, an effect known as chemiluminescence. 
The reaction must be of a type to leave electrons 
in excited states that can then decay radiatively, 
usually with the emission of visible photons. Such 
reactions are now very commonly seen in the 
plastic-tube-enclosed chemical “light sticks” that 
children are often given, but which also have more 
serious uses for emergency lighting or as a distress 
signal at sea or in mountains. These lights operate 
by breaking chemical-filled ampules, enabling the 
reactants to mix and produce the light by a chemi- 
luminescent reaction. 

A compound commonly used to produce 
green light is called luminol (C,H,N,O,), which is 
known by several other chemical names, includ- 
ing 5-amino-2,3-dihydro-1,4-phthalazine-dione. 
When luminol is added to a basic solution of oxi- 
dizing compounds, such as perborate, permanga- 
nate, hyperchlorite, iodine, or hydrogen peroxide, 
in the presence of a metallic-ion catalyst, such as 
iron, manganese, copper, nickel, or cobalt, it under- 
goes an oxidation reaction to produce excited elec- 
tronic states, which decay to give green light. The 
strongest response is usually seen with hydrogen 
peroxide, but very low concentrations of oxidizing 
agents such as ozone, chlorine, and nitrogen diox- 
ide can also be measured. Numerous biochemicals 
can also cause a light-emitting reaction and hence 
be detected. 

Nature has, however, used chemilumines- 
cence long before man, involving a phenomenon 
known as bioluminescence, where the chemicals 
are generated within biological organisms. Well- 
known examples of this include the light from 
glowworms and fireflies and the dull glow aris- 
ing from the bioluminescence of myriads of tiny 
organisms seen in agitated seawater, for example, 


in the wake of boats. Many deep-sea creatures 
use bioluminescence, either to make themselves 
visible or to confuse prey. The reactions are 
essentially of the same type as that with luminol, 
except that the origin of the chemicals is from 
biological reactions. 


15.2.2.8 RAMAN SPECTROSCOPY 


Raman spectroscopy [16] relies on a form of inelas- 
tic scattering, where the periodic electric field 
of the incident radiation again induces a dipole 
moment in a material, but, in this case, there is an 
interaction with material vibrations, or phonons, 
resulting in a change of photon energy. The effect 
depends on the polarizability « of the bond, at the 
equilibrium internuclear distance and on the varia- 
tion of this distance, x according to Equation 15.12 
below. This equation has three terms—the first 
represents the elastic Rayleigh scattering (scatter- 
ing at the same frequency as the incident radiation: 


00 transition). 
x] 
a= Qo AE ES 
Ox 


In the equation below, there are polarizability 
terms that have the effect of reducing or increas- 
ing the reradiated frequency, compared with that 
of the incident light, by an amount equal to the 
molecular vibrational or/and rotational frequency. 
This causes so-called Stokes (downward) and anti- 
Stokes (upward) shifts in the reradiated optical fre- 
quency. This light is referred to as Raman scattered 
light. 


(15.12) 


1 
P =A Ep cos2T Vot+— do. 
——_.———_ _ 2\dx 


Rayleigh scattering 


XyEo[cos 2m (Vp —V, )t+cos 27 (Vo +v,)t]. (15.13) 


The coupling of vibrational eigenfrequencies (v,) of 
the scattering molecule to the exciting frequency 
(v,) is sometimes called a photon-phonon interac- 
tion process. 

Raman spectroscopy occurs with selection rules 
that are different to those for normal vibrational- 
absorption IR spectroscopy. Because of this, many 
users regard analysis by Raman scattering and by 
conventional absorption-based IR spectroscopy to 
be complementary techniques. This can be very 
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convenient, but unfortunately, the Raman scatter- 
ing intensity is very weak, even when compared 
with Rayleigh scattering. Despite this, however, 
with the advent of many high-power lasers, it is in 
many cases now a preferred technique. Not only is 
it relatively free of problems that can arise due to 
turbidity or scattering from amorphous solids, it is 
now far more easy to achieve with low-cost visible 
or NIR semiconductor lasers, allowing compact 
instrumentation which has drastically improved 
in recent years [17]. 


15.2.2.9 MULTIPHOTON SPECTROSCOPIC 
PROCESSES 


Apart from the simple processes described above, 
there are many light absorbing and/or re-emission 
processes that involve more than one photon. It 
is beyond the present text to discuss multiphoton 
spectroscopy, but it is worth making a few short 
comments. Examples include two-photon absorp- 
tion, where it requires simultaneous involvement 
of two photons to excite an absorbing level (is very 
unlikely to occur, therefore a weak effect) or where 
the first photon excites the molecule to a meta- 
stable level and then the second photon excites it 
to a higher level from this intermediate level. Of 
course, many photons can be involved if an appro- 
priate set of metastable levels is present. 

We have also hardly discussed an optical gain 
process, which is the mathematical opposite of 
absorption. This occurs in media which have elec- 
tronic levels which have already been excited, often 
by light injection. This is rarely used in spectros- 
copy, apart from an intercavity measurements 
within lasers, but the process is often outside the 
gain region anyway, so will not be discussed fur- 
ther here. 


15.3 SPECTROSCOPIC METHODS 
AND INSTRUMENTATION 


Instrumentation has always been a very fast 
developing field, so only the basic methods will 
be discussed. The interested reader should con- 
sult specialist texts for more details of instrument 
design and manufacturers data for the latest per- 
formance figures. The fastest developing com- 
ponent areas are those of (1) low-noise detector 
arrays, both 1-D and 2-D (two-dimensional) types, 
and (2) compact high-power semiconductor laser 
sources for Raman and fluorescence excitation, 


Fiber optics 


Diode array 


Figure 15.9 Spectrometer detection unit for 
reflectometric measurements, containing a white 
light source and a CCD camera. (A mirror is used 
in the light source to increase launch efficiency 
by imaging the filament on itself to “fill in” gaps 
in the filament.) 


where performances are now achieved that were 
barely conceivable only 10 years ago. 

Just as an example of these, to illustrate the type 
of components needed, a compact fiber-compatible 
spectrometer source and detection unit with a 
broadband source and concave diffraction grating 
is shown in Figure 15.9. In most cases, xenon or 
halogen lamps are a convenient source of broad- 
band white light, which (often with the help of a 
rear reflector, to aid launch efficiency) is coupled 
into a fiber optic guide and then directed to the 
measurement probe. Here, transmitted or reflected 
light is collected and, via the coupler, guided back 
to a miniaturized spectrometer, which contains the 
diffraction grating and a photodiode array, such as 
a charge-coupled device (CCD) detector chip. 


15.3.1 Spectrometer components 


15.3.1.1 LIGHT SOURCES 


Whenever an intense monochromatic light 
source or an intense light source that can be 
scanned over a small spectral range is required, 
the source of choice is nearly always a laser. This 
is clearly the optimum type of source for PCS or 
for Raman spectroscopy, as a monochromatic 
source is required. However, broadband sources, 
such as incandescent lamps, are desired for most 
full-spectrum spectrophotometry, as intense nar- 
rowband sources, such as lasers or low-pressure 
mercury lamps, either cannot be used for observ- 
ing spectral variations over such a wide range or so 
many would be needed to cover the range that they 
would be prohibitively expensive. 
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Figure 15.10 Comparison of emission spectra of a few discharge lamps and arc lamp sources. 


Because most types of incandescent light 
sources were discussed in detail in Chapter 3, we 
shall cover only the most interesting spectral fea- 
tures of these broader band radiation sources, in 
particular types covering the important UV/VIS/ 
NIR spectral regions. Tungsten lamps are low- 
cost and stable sources, which give a useful output 
over the VIS/NIR region (0.4-3.0 pm). Electrically 
heated thermal sources, such as Nernst emitters or 
modern variants, are usually used to cover the IR 
region beyond 3 pm. 

High-pressure arc and discharge lamps are 
attractive sources in the visible and UV, as they 
can provide the desired broad spectral emission, 
but with significantly higher spectral intensity 
than tungsten lamps. In the visible and near IR 
region, the low cost and stable output of tungsten 
sources often means they are the preferred source. 
If operated at slightly reduced voltage, their life- 
time is excellent, too. However, discharge lamps 
are far more effective sources than tungsten in the 
blue and violet regions and are far better in the 
UV region. Thus, the type of discharge lamp most 
often used to cover these regions in commercial 
spectrophotometers is the deuterium lamp, as this 
is the source that can provide the desired broad- 
band performance at shorter optical wavelengths. 
Figure 15.10 provides a simple comparison of dis- 
charge lamps. 

It is worth mentioning one more broadband 
source, the xenon flashlamp, that is very useful 


for spectroscopy. The output spectrum of it is, of 
course, very similar to that of the xenon arc lamp 
shown above. However, because it is only operated 
in short pulses and at a very low duty cycle (maybe 
a 100 pS flash every 0.015), it can be driven at very 
high current density without destroying the elec- 
trodes. This provides an even brighter source than 
the continuously operated arc-lamp version, albeit 
for very short “on” periods. These lamp sources 
also cover the UV, VIS, and part of the NIR spec- 
trum. They are not only very compact and have low 
average power consumption, but can also provide 
a high peak energy output. This can give an optical 
signal at the detector that may be orders of mag- 
nitude higher than the thermal noise threshold of 
many optical receivers. They can, therefore, be used 
with optical systems, such as fluorimeters, where 
large energy losses often occur, yet still provide a 
good signal/noise ratio at the detector. Because of 
their pulsed output, they are also well suited for 
fluorescence lifetime measurements, provided the 
measured lifetimes are longer than the decay time 
of light output from the pulsed lamp. 


15.3.1.2 COMPONENTS FOR OPTICAL 
DISPERSION (WAVELENGTH 
SEPARATION AND FILTERING) 
15.3.1.2.1 Prisms 


Many older spectrometers used prisms constructed 
of optical glass or vitreous silica. These exhibit 
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Figure 15.11 Dispersion using a prism: radiation 
incident on the prism is refracted and partially 
dispersed as it is deflected in @ direction toward 
the base of the prism. An exit slit selects the 
wavelength from the “rainbow” of radiation. 
Usually, the red wavelength is less refracted than 
the blue one. 


a change of refractive index with wavelength, 
thereby giving wavelength dependent refraction, 
as shown in Figure 15.11. Collimating the input 
light and refocusing onto a slit behind the prism, it 
is possible to filter out “monochromatic” radiation. 

In the UV, vitreous silica prisms are still occa- 
sionally used as wavelength dispersion elements, 
since they transmit well and and their dispersion 
is higher in this region than in the visible, but, 
even despite this, prisms are now rarely used in 
spectrometers. Because the prism material must 
be transparent and have a significant optical dis- 
persion, greater problems with material selection 
arise in the mid-IR, so this was the historical rea- 
son why diffraction gratings were first used in this 
wavelength region. 


15.3.1.2.2 Diffraction gratings 


Diffraction gratings are based on coherent inter- 
ference of light after reflection from, or refraction 
through, an optical plate with multiple paral- 
lel grooves. The dispersion of a grating depends 
on the spacing of the grooves, according to the 
well-known Bragg condition that describes the 
diffraction angle, i.e., the angle at which there is 
constructive interference of the diffracted light. In 
the more common reflective diffraction gratings, 
the surface has parallel grooves, of a “saw-tooth” 
cross-section, and these grooves have reflective 
surfaces. Apart from a high surface reflectivity, 


the angle of the “saw-tooth” cross-section of the 
grooves to the plane of the surface is important to 
achieve good diffraction efficiency. The angle of 
the groove profile is called the blaze angle and this 
defines a blaze wavelength at which the diffracted 
power is maximized at the desired diffraction 
angle. The blaze angle can be chosen to maximize 
diffraction for the first order or for a higher order 
if desired (see Figure 15.12) [10]. 

Diffraction gratings were initially machined 
directly in a material that acted as the mirror 
reflector, and the greater ease of making gratings 
that had a greater intergroove spacing, mean- 
ing machining tolerances were less difficult, was 
another historical reason why gratings were first 
made for the IR region. 

Most diffraction gratings are now made by one 
of two different processes. The first process, to 
make “replica” gratings, involves pressing (mold- 
ing) of an epoxy-resin-coated glass substrate 
against a ruled (or “master”) grating that has been 
previously coated with an aluminum or gold film, 
with pretreatment to ensure this film will “release” 
easily from the surface of the master when the 
epoxy has set and the two substrates are separated. 
This process duplicates the contours when the rep- 
lica grating substrate, with the reflective metal film 
now attached to the outer surface of the resin, is 
pulled away from the master. The other production 
process, which is becoming ever more sophisti- 
cated, is to use photolithography to etch a grating 
in the material. The regular spacing is achieved by 
exposing photoresist using two converging paral- 
lel light beams to give a regular set of parallel and 
equally spaced interference fringes. These grat- 
ings can now be made by exposing the resist and 
etching the substrate in small steps, in a way that 
allows control of the blaze angle. These are called 
holographic gratings, because of the interference, 
or holography process that is used to expose the 
photoresist. 


15.3.1.2.3 Optical detectors 


Optical detectors and their associated amplifiers 
have been discussed in considerable detail in ear- 
lier sections, so there is no need to discuss them at 
length here. However, it is useful to comment that 
the most common single-element detectors used in 
wavelength-scanned spectrophotometers include 
photomultipliers (mainly for the UV range, below 
400 nm), silicon photodiodes (VIS and NIR range 
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Figure 15.12 Dispersion at a diffraction grating: on the bottom left, the angle of incidence, a, cor- 
responding to the strongest diffraction, depends on the groove spacing constant, g. The groove 
profile can be varied to give more light in a preferred direction and the grating is usually specified as 
having a blaze angle for a desired wavelength. In the upper part of the diagram, it can be seen that 
dispersion by the grating depends on the interference order. Higher orders suffer more dispersion, so 


hence provide better wavelength resolution. 


from 400 to 1000 nm), and photoconductive detec- 
tors (usually to cover above 1000 nm wavelength 
range). IR detectors are often cooled, using ther- 
moelectric Peltier devices or mechanical thermal- 
cycle engines, to improve their inherently worse 
noise performance. 


15.3.1.2.4 Optical detector arrays 

Detector arrays are most commonly used in the 
visible and NIR region, primarily based on silicon 
(400-1050 nm) or GaInAs (500-1800 nm). One of 
the simplest approaches is to use a linear array of 


discrete photodiodes, each with its own separate 
preamplifier. However, it is more common in the 
silicon (400-1050 nm) wavelength range to use 
self-scanned arrays. 

Advances in silicon CCD and other forms of 
self-scanned detector arrays have been tremen- 
dous in recent years, initially driven by develop- 
ments for camera technology, but with better 
performances being achieved for low-light-level 
cameras or for scientific uses (e.g., spectros- 
copy and astronomy). Individual pixel noise lev- 
els are now getting down to levels of below 0.01 
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photoelectrons per second in the most advanced 
cooled scientific versions. All of these types of 
low-noise self-scanned detectors effectively store 
photoelectrons (in the capacitor formed by the 
individual detector diode elements) until they are 
electronically scanned out in the read-out pro- 
cess. Note that detectors and detector arrays were 
reviewed extensively in earlier sections. 


15.3.2 Hardware instrumentation for 
spectroscopy 


15.3.2.1 SPECTROMETERS: INSTRUMENTS 
FOR MEASURING OPTICAL 
INTENSITY, AS A FUNCTION OF 
WAVELENGTH 


A spectrometer consists of a dispersive element, 
most commonly a diffraction grating, as described 
above, and a detection system. The latter is, most 
commonly, a silicon self-scanned CCD array (or a 
similar alternative), whenever visible or NIR (400- 
1050 nm) measurements are taken. 

The signal to noise of the spectral analysis and 
detection part of spectrometers is often enhanced 
by various optoelectronic means. One of the most 
common methods for VIS/NIR use is to employ 
multielement self-scanned detector arrays with 
a fixed-grating light-dispersing system, as was 
shown in Figure 15.9 above. 

A large proportion of these low-noise self- 
scanned arrays are 2-D types, with nxm detec- 
tors with up to several million elements, or pixels. 
Such 2-D arrays are clearly needed for cameras, for 
either low-light level terrestrial or astronomical 
use, and they offer an attractive advantage for use 
in spectroscopy, allowing use of a technique called 
spectral imaging. If light that passes through an 
input slit of a spectrometer is dispersed by, for 
example, a diffraction grating, and then is focused, 
or “imaged,” onto such a 2-D detector, light from 
each part of the input slit will produce its own 
individual spectrum or “rainbow” image on this 
detector. This offers the possibility of measur- 
ing, independently, the spectrum of light that has 
entered at each point along the length of the input 
slit. The method therefore has the attractive prop- 
erty of measuring, simultaneously, the spectrum 
at many separate points on the input slit. There 
are many real-world applications of this. One, in 


the field of fiber-remoted spectroscopy, is to allow 
multiplexing of light exiting from many optical 
fibers held in the focal plane of the spectrometer. 
Each of these can be positioned in line, just as the 
single fiber in Figure 15.9, but now, using a 2-D 
detector array in the output focal plane, a simulta- 
neous measurement of the spectrum of light from 
each fiber can be performed. This is then called an 
imaging spectrometer. 


15.3.2.1.1 Alternatives to detector 
arrays 


A common alternative to array technology, partic- 
ularly in the mid- and far-IR region, is to use some 
form of transform system, with a scanned optical 
filter, having a complex multiwavelength charac- 
teristic. Then, the temporal changes in the detected 
signal are electronically decoded to recover the 
spectrum. Such methods will be discussed in more 
detail in a later section. 


15.3.2.2 THE SPECTROPHOTOMETER, 
AN INSTRUMENT TO MEASURE 
TRANSMISSION OR ABSORBANCE 


Measurement of transmission loss usually involves 
the determination of light lost directly from a col- 
limated light beam. This is normally performed 
in an instrument called a spectrophotometer. The 
result is usually required as a function of optical 
wavelength. The instrument normally includes a 
broadband light source, precise collimation optics 
to form the measurement beam or beams, some 
form of tunable optical filter to select the wave- 
lengths measured, and finally a detector. This 
combination then allows transmission loss versus 
optical wavelength to be observed. 

In the past, spectrophotometers were nearly 
always dual-beam instruments. In these, light from 
the source was split into two beams, which were 
alternately directed, first through a measurement 
sample path, where materials to be measured were 
placed, and then, via a reference optical path that 
contained no sample. This configuration allowed 
comparison (by signal division) of the transmis- 
sion of each path, free of any intensity or spectral 
variations in the light source, or any spectral varia- 
tions in the detector or in the collimation and fil- 
tering optics. 

Many modern instruments are stable enough 
to first scan the wavelength range to take a 
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wavelength-dependent intensity measurement 
without a sample in the beam, store the result ona 
PC and then rescan with the sample now inserted. 
Simple signal division, using the PC, then gives the 
transmission of the sample. 


15.3.2.3 INDIRECT MEASUREMENT OF 
ABSORPTION USING FRUSTRATED 
INTERNAL REFLECTION METHODS 


As mentioned above, frustrated internal reflec- 
tion methods are useful for the measurement of 
highly absorbing samples, which may absorb (or 
scatter) far too much light to be able to measure 
them by traditional methods. For example, com- 
plex samples such as foodstuffs or biological mate- 
rials, which might scatter light very strongly, can 
still be analyzed to detect strongly absorbing spe- 
cies, by placing them in contact with a high index 
glass plate and measuring the internally reflected 
light. Such a plate, with suitable coupling optics, is 
a common inclusion in the sets of optional attach- 
ments that can be purchased for use with com- 
mercial spectrophotometers. This evanescent field 
can be used for various measurements, as demon- 
strated in Figure 15.13. 

Another characteristic of this evanescent field 
is that fluorescent materials or fluorophores close 
to the interface can absorb this evanescent field 
radiation and induce fluorescence. The evanescent 
field can be used to monitor effects very close to the 
interface because absorption clearly cannot take 
place beyond the penetration depth, so no fluoro- 
phores in the bulk are monitored. 
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15.3.2.4 MEASUREMENT OF INDIRECT 
EFFECTS OF ABSORPTION: 
PHOTOTHERMAL AND 
PHOTOACOUSTIC SPECTROSCOPY 


Photothermal spectroscopy is a means of detecting 
or monitoring the absorption of materials, usually 
liquids or gases, by observing the resulting heat. 
The idea is that an intensity-modulated intense 
light source is used to illuminate the sample and 
this raises its temperature when light energy is 
lost from the beam. An intense laser beam is most 
commonly used. It first heats the fluid and then 
becomes deflected by the transverse variation of 
refractive index that occurs when the resulting 
heated region induces convection currents. Such 
currents give rise to thermal gradients. In addition, 
thermal-lensing effects can occur, which cause 
variation of beam diameter after passing through 
the fluid, because there is a higher temperature rise 
along the central axis of the beam. 

Whichever method is used, the resulting deflec- 
tions or beam diameter changes can be observed 
on optical detectors, often by enhancing the sensi- 
tivity to changes in the beam position or diameter 
by using lenses and spatial filters. 

Photoacoustic spectroscopy is a special case of 
photothermal spectroscopy, where the light from 
an intense source, again usually a laser, is inten- 
sity modulated, for example, with a pulse, step or 
sinusoidal waveform, and the resulting changes in 
thermal expansion due to the heating are observed, 
usually with a traditional acoustic microphone. 
Gases have low specific heats and large expansion 
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Figure 15.13 Illustration of the evanescent field caused by “total internal reflection.” It can be partially 
absorbed (attenuated total reflection) or can excite fluorescence and surface plasmon resonance. 
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coefficients, so these are relatively easy to detect 
by this method. The light source can also be swept 
in optical frequency (or a broadband source can 
be passed through a swept optical filter) to allow 
spectral variations over an absorption line, or over 
a wider spectral region, to be observed. 

A major advantage of the method for the mea- 
surement of liquids is that it can be used with tur- 
bid samples, for example, liquids containing small 
bubbles, or a suspension of nonabsorbing scatter- 
ing particles, in a gas or liquid. In a conventional 
spectrometer, it is clearly not normally possible 
to distinguish between the light lost from the col- 
limated beam due to elastic scattering (turbidity) 
and the light lost due to absorption. 


15.3.2.4.1 Monitoring of 
chemiluminescence 


In order to detect chemiluminescence and mea- 
sure this weak luminol green light, it is best to use 
a photomultiplier, with some form of light-reflect- 
ing cavity (light-integrating chamber) to ensure 
most of the light hits the sensitive photocathode. 
Because photomultipliers have good green-light 
sensitivity and can even detect single photon 
events in the darkened state, very low concentra- 
tions of oxidizing agents can be measured, includ- 
ing hazardous oxidizing gases such as ozone, 
chlorine, and nitrogen dioxide. Numerous bio- 
chemicals can also cause a light-emitting reaction 
and hence be detected. Chemiluminescence is the 
basis of a number of commercial chemical sensors 
for important biochemicals. 

A particularly useful reaction for law enforce- 
ment is the one that luminol has with blood, 
enabling crime scenes to be sprayed with this 
compound and then be viewed in the dark, when 
telltale glows appear wherever traces of blood are 
present, but here low-level camera detection is the 
simplest method. 


15.3.2.5 INSTRUMENTATION FOR 
FLUORESCENCE SPECTROSCOPY 


Fluorescence spectroscopy involves illumina- 
tion of a sample with a monochromatic or filtered 
light source and observing the reradiated signal, 
which is almost invariably at a longer wavelength 
than the incident light. Light is either measured 
perpendicular to the incident optical axis, or for 
more sensitive detection, by collecting light emit- 
ted over wider angles with an integrating sphere. 


It is common to observe either the fluorescence 
spectrum or the fluorescent decay curve, following 
pulsed excitation from a pulsed source. The latter is 
usually a pulsed laser or filtered light from a xenon 
flashlamp. 

For fluorescence spectra, the most common 
method is to use a modified spectrophotometer 
where part of the omnidirectional fluorescent light 
is observed instead of the transmitted light. This 
can be done either with a dedicated instrument, or 
using a standard commercial spectrophotometer 
instrument fitted with a fluorescence attachment, 
which has appropriate optics to gather fluorescent 
light. There is no need to modulate the light source 
for this measurement, unless this is needed to 
allow lock-in (phase sensitive) detection methods 
to remove signals from unmodulated background 
light. When performing these measurements, con- 
siderable care has to be taken with the following 
filter-related aspects: 


e Removal of any longer wavelength “side-band” 
light from the light source, which could be 
elastically scattered in the sample or the instru- 
ment and mistaken for fluorescence. 

e Very careful filtering of the “fluorescent” light 
signal to remove any “cross talk” from elasti- 
cally scattered source light. 


The first aspect of source light filtering is less of a 
problem when using laser sources, although semi- 
conductor lasers can have some residual sponta- 
neous-emission component at longer wavelengths 
and gas or ion lasers can have residual light from 
the plasma discharge. Particular care is needed 
with incandescent, arc lamp, or xenon flash- 
lamp sources, with their broadband emission. In 
a commercial spectrophotometer instrument or 
a dedicated instrument (spectrofluorimeter), the 
built-in grating spectrometer acts as a very good 
filter, provided “stray light” levels are low, and also 
has the advantage that the excitation wavelength 
can be tuned if desired. Additional rejection of 
long wavelength light is usually done with “short- 
pass” dichroic multilayer edge filters. The problem 
of removing elastically scattered source light from 
the “fluorescent” light signal can be solved in sev- 
eral ways. Narrowband dielectric laser mirrors 
make excellent rejection filters in transmission 
mode, as these can be designed with a reflectivity of 
99.9% or higher. Dichroic long-pass edge filters are 
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also now available with excellent performance. In 
addition, there is a wide range of long-pass optical 
glass filters, which have semiconductor-band-edge 
type transmission behavior, commonly having 
short-wavelength (i.e., shorter than the band edge) 
absorbance as high as 10° in a 2mm thick filter. 
Care must be taken with these, however, as many 
of these fluoresce at longer wavelengths, so the first 
filtering stage is preferably done with a dielectric 
filter. Note that these filtering problems are even 
more acute for Raman scattering, so will be dis- 
cussed further in the section below. 

When it is desired to examine fluorescent 
lifetimes, the severity of the filtering problem 
is reduced by several orders of magnitude as the 
pulsed incident source will usually be no longer 
emitting by the time the intensity-decay curve 
of the consecutive fluorescence is now observed. 
However, strong light pulses can upset many sen- 
sitive detection systems or associated electronics 
for a short time after the pulse, so even here some 
degree of filtering is still desirable to remove light 
at the incident wavelength, and some source filter- 
ing may be necessary, too, if there is a spontaneous 
light decay occurring in the laser source itself. 

When measuring fluorescent lifetime, a fast 
detector may be needed. Fluorescent decay is com- 
monly in the form of an exponentially decaying 
curve, but lifetimes can typically vary from days, 
in the case of phosphorescence, to less than nano- 
seconds (important examples of samples having 
decay times in nanoseconds are the organic dyes 
often used in dye laser systems and some semi- 
conductor samples having short excited state 
lifetimes). When measuring very fast fluorescent 
decays, it is common to use photon counting sys- 
tems using photomultiplier (PMT) detectors. These 
have the advantage of high internal gain of the ini- 
tial photoelectrons, so the input noise level of even 
a fast electronic preamplifier is easily exceeded. 
Also, as the desired detection time is reduced by 
using a fast PMT and amplifier, the effective peak 
current represented by a given size “bunch” of 
electrons, which will arrive at the anode from a 
single-photon event, actually increases as the time 
interval, over which it is measured, reduces (cur- 
rent=charge/time). Thus, using fast photon count- 
ing technology, where each photon count is fed 
into sets of digital registers according to its time 
of arrival, very fast fluorescent decay curves can 
be measured. It is now becoming more common 


to design photon counting systems with avalanche 
photodiode detectors, which are operated in a so- 
called “Geiger” mode, where the incoming photon 
causes a full (but reversible, after the bias voltage is 
reapplied) conductive “breakdown” of the reverse- 
biased detector diode. 

If there is more than one fluorophore or more 
than one fluorescent decay process occurring, the 
decay make take the form ofa bi- or multiexponen- 
tial decay curve, equivalent to linear addition of 
two or more exponentially decaying functions. In 
simple cases, these can be separated with software, 
but in some cases appropriate choice of excitation 
wavelength may help to isolate individual curves in 
mixtures of fluorophores. 

Another common way of measuring fluores- 
cence lifetime is to intensity modulate the source 
with a periodic waveform, usually a sinusoidal or 
square-wave signal, and observe the phase delay in 
the fluorescent signal intensity waveform, relative 
to that of the incident signal. This is commonly 
done using an electronic system, which can multi- 
ply the detected fluorescence signal by an electronic 
signal analog to the incident-intensity waveform, 
and then averages the product over a set time inter- 
val. If desired, more than one multiplication can 
be performed, using reference signals of different 
phases. Such an electronic detector system has sev- 
eral names, which include vector voltmeter, lock- 
in amplifier, or phase-sensitive detector. It not only 
allows for excellent signal averaging to be done to 
improve signal/noise, but also enables the phase 
difference between the two signals to be measured. 
The advantage is that cheaper, simpler detectors, 
such as silicon photodiodes, can now be used, as 
the illumination duty cycle is improved (to 50%, 
in the case of square-wave modulation), which 
helps to improve the signal-to-noise ratio, but the 
disadvantage is that the shape of the decay curve 
cannot be seen. Another significant disadvantage 
is that the system now requires much better optical 
filtering, as any residual optical cross talk, where 
elastically scattered light from the source might 
be detected, will alter the effective phase of the 
detected “fluorescent” signal. 

An important feature is that fluorescence detec- 
tion can be performed with highly scattering 
samples, such as roughly cut or even powdered 
materials, and can be used to analyze the surface of 
translucent materials, as a clear transparent sample 
is not required. Also, as the transmitted light level 
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is not measured, very little sample preparation 
is needed. Another minor advantage is that the 
slightly shorter excitation wavelength means it can 
be focused ona slightly smaller diffraction-limited 
spot, enabling its use in fluorescence microscopes 
which excite the specimen via reasonably conven- 
tional optics. These advantages apply even more 
to Raman spectroscopy, which will be dealt with 
below, so they will be repeated again there for 
greater emphasis. 

A particular problem with fluorescence detec- 
tion is that many common materials will fluo- 
resce, particularly if illuminated with UV light. 
These include, starting with particularly trouble- 
some examples, organic dyes, compounds of rare- 
earth metals, chlorophyll, ruby, long-pass optical 
absorption filters, mineral oils, human sweat, 
many adhesives and even some optical glasses. 
Another problem can be that a high intensity exci- 
tation source may cause photodegradation. 


15.3.2.6 INSTRUMENTATION FOR RAMAN 
SPECTROSCOPY 


Raman spectroscopy can use conventional optical 
materials with low-cost semiconductor lasers and 
high-performance cooled-CCD detector arrays. A 
major advantage is that it can be used with highly 
scattering samples, such as roughly cut or even 
powdered and amorphous materials, and can even 
analyze the surface of apparently opaque materi- 
als. Obviously, a clear transparent sample is not 
required, as the transmitted light level is not mea- 
sured, so very little sample preparation is needed. 
Another major advantage is the short excitation 
wavelength that means it can be focused on a 
smaller diffraction-limited spot, enabling its use 
in Raman microscopes, which excite the specimen 
via reasonably conventional optics and allow a spa- 
tial resolution that would be impossible with direct 
IR excitation. Confocal microscopy is also possible 
using Raman methods. 

Raman spectroscopy requires a focused laser 
source to excite the very weak Raman scattering 
and a highly sensitive spectrometer to detect the 
very weak scattered light. The sample is illuminated 
with laser light, and the Raman scattered light is 
collected and often presented in terms of its power 
spectrum in wave numbers. Even in clear samples, 
such as optical glasses, which are only very weak 
Rayleigh scatterers, the Raman light intensity is 
usually several orders of magnitude even below 


that from this elastic scattering. A major part of the 
problem was that highly scattering samples, such 
as cut materials or powders, can have elastic scat- 
tering levels 5 or more orders of magnitude higher 
than clear samples. Thus, it is necessary to separate 
out the very much stronger elastic scattering that 
can occur at the same wavelength as that of the 
incident laser light, and which might be between 5 
and 9 orders of magnitude higher. The wavelength 
filtering to achieve separation usually requires at 
least two or maybe three stages of optical filtering 
to recover the desired Raman light. In the early days 
of Raman instrumentation, it was common practice 
to use rather large double or triple monochromators 
(ie., two or three cascaded grating spectrometers). 
Second generation Raman systems reduced the 
problem by directing the light though a holographic 
Raman-notch filter, which is a compact optical filter 
designed just to reject a very narrowband of light, 
centered at the incident laser wavelength. It was 
easier to construct this compact component using 
low-fluorescence materials, thereby easing the prob- 
lem of further separation in the dispersive spec- 
trometers. With such a filter present in the detection 
system, a single spectrometer could be used, saving 
both space and cost. However, at the same time, 
another modern technology emerged in the form 
of greatly improved self-scanned silicon detector 
arrays, having very low noise and high quantum 
efficiency and being capable of providing even bet- 
ter low-light performance than photomultipliers on 
each tiny pixel in the array. This has allowed a com- 
pact state-of-the-art detector array to be placed in 
the output focal plane of a single monochromator, 
eliminating the need for a narrow output slit that 
rejected most of the light and now allowing all the 
wavelength components of the Raman spectrum 
wavelengths to be measured at the same time. 

A more recent alternative to the Raman notch 
filter is the use of dichroic band edge filters. These 
interference filters, made in the usual way with 
a stack of dielectric layers on glass, have a long- 
wavelength pass characteristic that removes not 
only the undesirable elastically scattered light, but 
also the anti-Stokes Raman light. These new filters 
are highly effective, of relatively low cost, but far 
more stable than the previously used polymeric 
holographic ones used for notch filters. A minor 
disadvantage is the removal of the anti-Stokes sig- 
nals, but as these are not used in over 90% of appli- 
cations, this is not a major disadvantage. 
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The most troublesome residual problem with 
Raman systems is, as it always has been, that of 
undesirable background signals due to fluores- 
cence. Fortunately, using NIR laser illumination, 
typically at 800 nm wavelength or above, goes a 
long way toward reducing the background fluo- 
rescence of most common materials and contami- 
nants, so helps to make up for the weaker Raman 
signal when using longer wavelength excitation. 
There are also various signal subtraction meth- 
ods that can be used. These can, for example, 
take advantage of the polarization dependence of 
Raman or of the temporal decay behavior of flu- 
orescence, but these are beyond the scope of this 
introductory text. 

All the above developments have given many 
orders of magnitude improvement in the spectrom- 
eter and detection system, to a situation where now 
Raman scattering is often a preferred technique to 
absorption spectroscopy. It is beyond the scope of 
this text to go into detail on other techniques for 
enhancing Raman signals, but it is useful to men- 
tion two briefly. The first is resonance Raman scat- 
tering, where the excitation wavelength is close to 
an absorption line of the material to be monitored. 
The second is surface-enhanced Raman spectros- 
copy (SERS), where the use of a metal surface, or 
a surface covered with metal particles, can greatly 
enhance the signal from Raman scattering. A few 
materials, such as silver or silver particles, are 
highly effective for this. Both of the above meth- 
ods can enhance the Raman signal by between 4 
and 8 orders of magnitude, depending on condi- 
tions. Being a surface effect, however, the SERS 
method is clearly very sensitive to surface prepa- 
ration conditions and to subsequent treatment or 
contamination. 


15.3.2.7 INSTRUMENTATION FOR PHOTON 
CORRELATION SPECTROSCOPY 


The use of PCS for particle detection was discussed 
earlier. It is particularly useful for determination of 
particle size, over a range of a few nm to a few pm, 
simply by measuring scattered signals. Very small 
particles will undergo fast random movements as 
molecular collisions move them about (Brownian 
motion). In a conventional instrument, a polarized 
monochromatic TEM,, laser beam is usually used 
to illuminate the sample by focusing on a narrow 
beam waist, and a fixed detector is used to observe 
the scattered light from the beam-waist area. The 


laser source must be stable better than a small frac- 
tion of 1% and usually has a power of a few mW. 
Originally gas lasers were used, but now compact 
semiconductor lasers are replacing them. 

A single spatial mode (single “speckle”) in the 
far field will exhibit more rapid intensity changes 
with small scattering particles present compared 
to larger particles at the same conditions. In order 
to detect the changes of phase, the scattered light is 
traditionally imaged through a very small hole act- 
ing as a spatial filter, in order to provide the great- 
est intensity modulation index (greatest fractional 
change in optical intensity as the optical phase 
changes). Clearly, a large detector without any pin- 
hole aperture is unsuitable, as it would average the 
light from many interference points, and so would 
result in a smaller modulation index. 

For successful operation, entire optics must 
be very clean, and additional opaque baffles are 
often used to reduce stray light. Samples must be 
prepared very carefully to avoid highly scattering 
large dust particles or bubbles, and clearly particle 
aggregation (clustering) must be avoided. 

Following optical detection of the intensity 
changes, electrical spectral analysis (frequency 
analysis) of the signal scattered, for example, at 90°, 
could potentially yield valuable information on 
particle size, with smaller particles giving higher 
intensity modulation frequencies. Unfortunately, 
very small particles also scatter very weakly and, to 
act as an effective spatial filter, the receiving aper- 
ture has to be very small. As a result, the received 
photon flux is usually very low, sometimes only a 
few photons per second, making frequency analysis 
of a very noisy (random photon event) signal much 
more difficult. A preferred alternative method 
more suitable for use at low photon flux levels is 
the use of a method called PCS. Here, instead of 
analyzing the frequency of detected signals, the 
times of arrival of many individual photon pulses 
are correlated. The decay time of what, for mono- 
disperse (single-size) particles, is usually an expo- 
nentially decaying correlation function, can be 
derived using digital correlator systems. As stated 
earlier, this can, with knowledge of the tempera- 
ture and viscosity of the fluid in which the parti- 
cles are suspended, be related to the hydrodynamic 
radius of the particles using the Stokes—Einstein 
equation. The method works best with single-size 
(monodisperse) particles, but more complex corre- 
lation functions from suspensions of two or even 
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three particle types/sizes can be inverted using the 
Laplace transformation. 

The photon correlation method works well at 
these very low light flux levels, but requires the 
use of single photon counting detectors, such as 
photomultipliers or silicon avalanche photodiodes 
(APDs). Significant developments have been made 
to actively quench APD photon counters that are 
operated in avalanche mode, at very high reverse 
bias, to allow fast recovery from their photon- 
induced breakdown and be ready to detect a subse- 
quent photon. This allows instrumentation to take 
advantage of their superior quantum efficiency in 
the NIR [18]. 

Other developments have been to design fast 
correlators to process the photon count signals 
and recover particle size information. PCS can 
typically achieve an accuracy of order 0.5% in par- 
ticle size on monodisperse samples. With more 
sophisticated signal processing, it is possible, pro- 
vided conditions are suitable, to derive estimates 
of particle size distribution, polydispersity, molec- 
ular weight estimates (using Svedberg’s equation), 
rotational diffusion behavior and particle shape, 
and many other parameters. The greatest practical 
problem is usually when large particles are pres- 
ent, either as an undesirable contaminant or as an 
inevitable feature of the sample to be monitored, as 
scattering from just a few large particles can often 
be intense enough to totally dominate the weak 
signals from many much smaller ones. 


15.3.2.8 MEASUREMENT WITH OPTICAL 
FIBER OPTIC LEADS 


Generally, if used efficiently, optical fiber leads 
offer tremendous advantages. First, expensive 
instrumentation can stay in a safe laboratory envi- 
ronment, free from risk of damage from chemicals, 
weather, or careless handling. Second, the remote 
measurement probe can be very small, robust, and 
immune to chemical attack. Third, there is no need 
to transport chemicals or other samples to the 
instrument, so real-time online measurements are 
possible with no risk to personnel. 


15.3.2.8.1 In-fiber light delivery and 
collection for transmission 
measurements 

Transmission (and hence absorption or turbid- 

ity) measurements can be most easily achieved 

over optical fiber paths by using a commercial 


spectrophotometer with specially designed exten- 
sion leads. Typically, they have a unit that fits into 
the cell compartment of a standard instrument, 
with a first (focusing) lens that takes the collimated 
light that would normally pass through the sample 
chamber and focuses it instead into a large-core- 
diameter (usually >200,1m) optical fiber down- 
lead. A second lens that recollimates light, that has 
returned back from the sample area (via the return 
fiber lead), is used to reform a low-divergence beam 
suitable for passage back into the detection section 
of the instrument. 

There is also a remote measurement cell in the 
sample probe, connected to the remote end of both 
these fiber leads. In this remote sample area, a first 
lens collimates light (coming from the spectrom- 
eter, via the down-lead) into a local interrogation 
beam. This beam passes through the remote mea- 
surement cell, after which a second lens collects 
the light and refocuses it into the fiber return lead 
going to the spectrometer instrument. Such opti- 
cal transformations lead to inevitable losses of 
optical power (due to reflections, aberrations and 
misalignments) of typically 10-20dB (equivalent 
to losing 1-2 units of absorbance in the measure- 
ment range). However, most modern spectropho- 
tometer instruments have a typical dynamic range 
of >50dB, so this optical loss is a price that many 
users are prepared to pay in order to achieve a use- 
ful remote measurement capability. 

It should be noted that the optical power losses 
usually occur mainly due to misalignments and 
the imperfections of the focusing and recollima- 
tion optics, plus Fresnel reflection losses at inter- 
faces, rather than arising from fiber transmission 
losses. If suitably collimated beams were to be 
available in the instrument, if large core diameter 
fibers could be used to connect to and from the 
probe, and if the entire optics, including fiber ends, 
could be antireflection coated, there should really 
be very little loss penalty. There are many other 
probe head designs that are possible. The simplest 
design, for use with measurement samples showing 
very strong absorption, is simply to have a probe 
that holds the ends of the down-lead and return 
fiber in axial alignment, facing each other across a 
small measurement gap, where the sample is then 
allowed to enter. Losses are low for fiber end spac- 
ing of the same order as the fiber diameter or less, 
but rapidly increase with larger gaps. The probe is 
far easier to miniaturize and to handle if the fiber 
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down-lead and return lead are sheathed, in parallel 
alignment, in one cable. Use of such a cable can be 
accommodated using a right-angled prism or other 
retroreflecting device to deflect the beam in the 
probe tip through the desired 180° that allows it to 
first leave the outgoing fiber, pass through a sample 
and then enter the return fiber. Use of a directional 
fiber coupler at the instrument end allows use of 
a single fiber, but then any residual retroreflection 
from the fiber end will be present as a cross talk 
signal, adding light signal components that have 
not passed through the medium. 

Clearly, there are many variants of such opti- 
cal probes, some involving more complex optics 
(e.g., multipass probes), some constructed from 
more exotic materials to withstand corrosive 
chemicals. A very simple option that has often 
been used with such single fiber probes for moni- 
toring the transmission of chemical indicators, 
is to dissolve the indicator in a polymer that is 
permeable to the chemical to be detected and 
also incorporate strongly scattering particles in 
the polymer. When a small piece of such a poly- 
mer is formed on the fiber end, the particles give 
rise to strong backscattered light, and the return 
fiber guides a portion of this to the detection sys- 
tem. This backscatter light had of course to pass 
through the indicator polymer in its path to and 
from each scattering particle, so the returning 
light is subject to spectral filtering by the indica- 
tor. Although this is a very lossy arrangement, it 
is extremely cheap and simple and has formed the 
basis of many chemical sensors, for example ones 
using pH indicators. 


15.3.2.8.2 In-fiber light delivery 
and collection for Raman 
scattering and fluorescence 


Raman scattering and fluorescence can also be 
measured via fiber leads, but the use of fibers 
causes far more loss of light due to the wide-angle 
reradiation patterns characteristic of both meth- 
ods. However, the potential value of these meth- 
ods, particularly of Raman scattering, for chemical 
sensing has encouraged workers to persevere to get 
useful performance, despite the low return light 
levels achieved with fiber-coupled systems. Both 
these mechanisms involve excitation of a sample 
with light, usually at a wavelength shorter than the 
scattered light to be observed; the reemitted light is 
then collected and narrowband filtered. 


The loss due to launching of the excitation 
laser light into a fiber is usually negligible with 
Raman, as narrow-line laser sources are used, but 
ultimately the launch power limit may be set by 
nonlinear processes or, in the case of large-core 
multimode fibers, by optical damage thresholds. 
Similar excitation can be used for low-level fluo- 
rescence monitoring, provided no photobleaching 
or other photodegradation of the monitored sub- 
stance can occur at high illumination intensity. 
The main potential loss is therefore that of light 
collection. The additional loss due to collection of 
Raman light via optical fibers is at least a factor of 
90 greater than when using conventional optics, 
thus making the already low light levels about 2 
orders of magnitude worse. Despite this, a number 
of fiber-remoted Raman chemical sensor probes 
are appearing as commercial items. 


15.3.2.8.3 Evanescent field methods 
of in-fiber light delivery and 
collection: Frustrated total 
internal reflection 


Frustrated internal reflection measurements are 
clearly attractive for use with multimode optical 
fibers, as the guidance in the fiber depends on such 
internal reflections. As stated already, total internal 
reflectance is often not total. One marked depar- 
ture from traditional school teaching occurs dur- 
ing reflection at curved interfaces, where light can, 
under certain circumstances, be lost by radiation 
into the less dense medium, even when the angle 
of incidence is above the well-known critical angle. 
This occurs in multimode optical fibers if light is 
launched to excite skew rays, provided the light is 
launched at an angle to the fiber axis that is too 
large to expect guidance by total internal reflec- 
tion, assuming rays were passing through the fiber 
axis. Thus, even if the actual angle of incidence on 
the fiber core-cladding interface for these skew 
rays is greater than the calculated critical angle, 
light will be lost into the cladding to form what are 
called leaky rays. 

However, even with light incident on flat sur- 
faces, so assuming no leaky rays are possible, there 
is always a small penetration of light into the less 
dense medium. Another aspect that is not nor- 
mally taught in elementary optical texts is that the 
reflected light beam, although being reflected at 
the same angle to the normal as the incident light, 
undergoes a lateral shift in the direction of the 
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interface plane between the two media. It is said to 
suffer a so-called Goos—Hanchen shift (this is like 
a displacement of the “point” of reflection at the 
interface). A fuller examination, using either clas- 
sical geometric optics or quantum optics, predicts 
that part of the radiation, the evanescent field, pen- 
etrates the interface and then, for a certain distance, 
is guided outside the optically denser medium, lead- 
ing, when it returns, to a lateral shift in the apparent 
point of reflection from the interface. 

The concept of the evanescent field, which 
decreases exponentially away from the interface 
in the medium having the lower refractive index, 
was introduced earlier. This evanescent field can be 
used for various fiber-based experiments, as shown 
in Figure 15.13. 

As stated earlier, material or molecules within 
a penetration depth, d, of this field can absorb the 
radiation and result in an attenuated total reflec- 
tance. The absorption can be enhanced further 
using thin metal layers on the fiber to cause evanes- 
cent field enhancement due to plasmon resonance. 
This effect will be discussed in more detail later. 
Another application of this evanescent field is that 
fluorescent materials or fluorophores close to the 
interface can absorb this evanescent field radiation 
and induce fluorescence. The evanescent field can 
be used to monitor effects very close to the inter- 
face because absorption clearly cannot take place 
beyond the penetration depth, so no fluorophores 
in the bulk are monitored. 


15.3.2.8.4 In-fiber light delivery and 
collection for PCS 


In PCS systems, monomode fibers are very well 
suited for both delivery and collection of light. 
A single-mode optical fiber not only makes an 
excellent delivery medium for a beam launched 
efficiently from a gas or semiconductor laser, but 
it can also form a near-ideal single-spatial-mode 
optical filter to replace the conventional pinhole 
used for traditional systems. The PCS technique 
is, therefore, easily adaptable to perform remote 
measurement with optical fibers. There is very little 
penalty in using optical fibers, as the lasers can be 
launched with high efficiency, and because a fiber- 
based single-spatial-mode receiving filter will not 
lose any more light energy than the alternative of a 
tiny hole in a metal plate. The fiber, in fact, makes a 
near-ideal spatial mode filter. 


15.3.2.9 SPECIALIZED OPTOELECTRONICS 

AND SIGNAL PROCESSING 

METHODS FOR SPECTROSCOPY 
This section will look at specialized spectroscopic 
methods such as scanned filters, modern fixed 
detector arrays, and use of Hadamard and Fourier 
transform signal processing methods. In simple 
dispersive spectrometer instruments, the desired 
selection of optical wavelength or frequency is 
achieved by monochromators using a prism or 
diffraction grating, with the necessary collima- 
tion and refocusing optics. The spectrum is then 
recorded sequentially in time as the frequency or 
wavelength of the light transmitted by the filter is 
scanned. 

Another possibility, discussed in this section, 
is to use various parallel multiplexing techniques 
where all wavelengths are monitored at the same 
time. There are two generic lines of development 
that have gained success in recent years. 

The first of these, which has already been men- 
tioned above, is simply to use multiple detectors, 
either a discrete-multielement photodiode array 
with separate amplifiers, or a self-scanned CCD 
detector array. Both of these enable the parallel 
detection of all wavelengths, and thus a more effi- 
cient use of the available light. These components 
were discussed above. 

The second generic approach is to pass the light 
through a more complex multiwavelength optical 
filter, which is capable of passing many wavelengths 
at once, but where the spectral transmission varies 
as it is scanned with time. Light then impinges on 
a single optical detector and finally the transmit- 
ted spectrum is decoded by applying mathematical 
algorithms to the observed temporal variations in 
the detected signal as the complex filter is scanned. 
Two common variations of this so-called trans- 
form approach are used, first, the Hadamard, and 
second, the Fourier method. Both methods have 
the advantage of parallel multiplexing, thereby 
achieving a so-called Fellget advantage [19]. Both 
methods also have the additional advantage of 
high optical “throughput” [20], since instead of a 
single narrow slit a large area (normally a circular 
hole) is used, allowing far more light to enter. 

In both of these transform methods, math- 
ematical analysis is used to decode the tempo- 
ral modulation pattern of optical signals arising 
from superposition of radiation at many different 
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wavelengths. In the Hadamard spectrometer, an 
encoded mask with a transmissive (or reflective) 
pattern of markings is positioned in the focal 
exit plane of a normal dispersive spectrometer, 
and then a mathematical transformation called 
the Hadamard matrix is applied to the detected 
signal. The coded mask usually has a fine-detail 
binary orthogonal code on it, forming a pattern 
of elements like 010010100110111001010 such that 
the detected transmitted (or reflected) signal var- 
ies with the position of this mask [21]. Using the 
Hadamard matrix to perform the desired math- 
ematical transform, the transmitted spectrum can 
be reconstructed from the detected modulation 
of light. 

Fourier transform spectrometers require a filter 
having a sinusoidal variation of optical transmis- 
sion, which can have its spectral period (wave- 
length or frequency between successive peaks in 
transmission) modulated with time in a defined 
manner. All two-path optical interferometers, as 
their path difference is varied with time, conve- 
niently demonstrate the desired sinusoidal trans- 
mission changes which are a natural consequence 
of the interference process. The most convenient 
form of this to use in instruments is the Michelson 
interferometer configuration [22,23]. 

The simplest way to estimate how a Fourier 
transform spectrometer operates is to first con- 
sider what will happen with monochromatic light 
(e.g., single-frequency laser). Here, if the path dif- 
ference of the interferometer is increased linearly 
with time, the intensity transmitted by the inter- 
ferometer will vary in a purely sinusoidal manner 
with time, i.e., the output from the optical detec- 
tor will be a single electronic frequency. Any more 
complex optical spectra can be considered to be 
composed of a superposition of many such pure 
optical frequencies, and each of these single-fre- 
quency components will generate a pure electronic 
frequency or tone, and so will give its own unique 
electronic signature. The temporal signal from the 
detector will be composed ofa linear superposition 
of all such signals. Decoding of such an electronic 
signal to recover all such single-frequency sinu- 
soidal signal components is a standard problem 
in electronic spectrum analyzers, and the well- 
known solution is that of Fourier analysis; hence, 
the use of the words Fourier transform spectrom- 
eter to describe its optical equivalent. 


In all of these transform methods, the instan- 
taneous condition of the complex optical filter is 
known from the position of the interferometer 
(Fourier method) or the coded mask (Hadamard 
method), thus allowing decoding of detected sig- 
nals to produce the desired spectral output. 


15.3.2.10 COMPARISON OF PROCESSING 
METHODS 


Before finishing this section, it is instructive to dis- 
cuss the relative strengths and weaknesses of each 
approach. In fundamental signal to noise ratio 
terms, it is preferable to use dispersive optics with 
a fully parallel array detector as, for example, used 
to great advantage in modern CCD array spectrom- 
eters. Unfortunately, such high-performance detec- 
tor arrays are only available in the visible and NIR 
region, as detector technology for other regions is 
not yet as well commercialized [24]. Because of this, 
there is far greater use of Fourier transform methods 
in the mid- and far-IR regions, where it is only nec- 
essary to have one (often cooled) high-performance 
single detector. The Hadamard and Fourier meth- 
ods gain advantage over using a single detector with 
a scanned narrowband filter, as more wavelengths 
are transmitted through the filter at any one time, 
and a greater optical throughput can be used in the 
optical system. The Fourier transform method has 
a further advantage, however, when high spectral 
resolution is desired, as it is difficult, particularly in 
the IR region, to make detector arrays with a small 
enough spacing to resolve very closely spaced wave- 
length, which would otherwise need very large dis- 
persive spectrometers to achieve high resolution. 


15.4 CASE STUDIES OF RESEARCH 
INTO OTHER SPECTROSCOPIC 
METHODS 


In recent years, an increasing number of applica- 
tions of optical sensors has been published. Many 
optical techniques using either fluorophore-labeled 
samples or applying direct optical detection have 
been investigated and well covered in the literature 
[1,25,26]. The direct optical methods can be classi- 
fied according to whether they use refractometric 
or reflectometric principles. As discussed above, 
the first relies on the behavior of the evanescent 
field. Several types have been commercialized. The 
best known type is the BiaCore [27] using surface 
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plasmon resonance. Several examples of refracto- 
metric methods are listed briefly below: 


e An early method for refractive index monitor- 
ing, by Lukosz [28], was to embed a grating on 
the surface ofa slab (rectangular cross section) 
waveguide, with the sensor taking advantage of 
the modification of the Bragg grating condi- 
tion when the index of the external medium 
changed. The original concept has since been 
modified by many scientists. Read-out from the 
sensor can be conveniently performed using a 
CCD sensor array [29]. 

e Another approach used a prism coupler, 
which is acommon means of coupling light 
into slab waveguides. This technique is also 
sometimes called a “resonant mirror” method. 
A layer of lower refractive index is arranged 
as a buffer layer between the prism and the 
waveguide. An incident beam (polarized at 
45° to the normal in the plane of incidence) is 
reflected at the base of the prism in a manner 
dependent on the wavelength, the angle of 
incidence, and the optical properties of both 
the prism and the waveguide. The incident 
beam can excite TE (transversal electric) and/ 
or TM (transversal magnetic) modes of the 
waveguide and the modes of the waveguide 
can recouple back into the prism, resulting in 
a phase shift relative to the directly reflected 
beam. Because both propagating modes 
(TE and TM) travel with different veloci- 
ties within the waveguide and are differently 
influenced by the refractivity of the medium 
in the evanescent field region [30], the plane 
of polarization of the output light is in general 
elliptical, with a polarization state depend- 
ing on the relative phase delay. The process is 
similar to the polarization changes that occur 
in a birefringent crystal. 

e Direct interferometers, i.e., ones not making 
use of polarization changes, are also interest- 
ing and are useful arrangements for interroga- 
tion of refractive index. In a commonly used 
configuration, radiation is guided via two arms 
of a Mach-Zehnder interferometer [31], and 
one of the arms is covered by a sensing layer 
(polymer, biomolecular recognition layer). 
Guided radiation propagates within these two 
arms with different phase velocities, result- 
ing in a phase shift that, after interferometric 


superposition at the coupling junction, can 
be measured as an intensity change, which 

is now dependent on the refractive index 

of the medium in contact with the recogni- 
tion layer. Another possible configuration is 
the well-known Young interferometer [32] 
arrangement. Instead of recombining the two 
waveguides in the planar layer, the two beams 
are now focused to one plane only, using a 
cylindrical lens, and directed onto a CCD array 
where they form an interference pattern. 

@ One of the now most commonly used methods, 
surface plasmon resonance, involves enhancing 
the evanescent field intensity using thin metal 
films coated on the surface of a dielectric. The 
surface plasmons (electrons) in the metal film 
coating can be excited by the guided wave in 
the waveguide. The excitation depends on the 
refractive index of the medium on top of the 
metal film and varies dramatically near the 
resonance condition, as the intensity of the 
radiation propagating in the waveguide is sub- 
stantially reduced by the stronger resonance 
coupling [33]. It can be used as a detection 
principle, either directly, using light propagat- 
ing through the waveguide, or indirectly, using 
a prism coupled to the waveguide structure. 
Originally, the concept of surface plasmon res- 
onance was introduced to the scientific com- 
munity by Kretschmann [34] and Raether [35]. 


In addition to the above examples of interrogation 
principles, many other approaches have been pub- 
lished [36-38] and recently reviewed [1]. 

Apart from refractometry, reflectometry is 
frequently used in sensing. It often relies on mul- 
tiple reflections and usually involves white light 
interferometry (similar to the Fabry-Perot) to 
give a wavelength-dependent or filtering action. 
For many decades, ellipsometry has been used to 
examine the properties of thin film layers. As men- 
tioned above, reflection and refraction takes place 
at each interface between media of different refrac- 
tive index. The partially reflected beams from the 
interfaces at each side of a layer will superimpose 
and exhibit interference. The intensity of reflection 
depends on the wavelength, the angle of incidence 
of radiation, the refractive index of the layer, and 
the physical thickness of this layer. In ellipsometry 
[39,40], polarized light is used and thus the refrac- 
tive index and the physical thickness of the layer 
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can be determined separately. In normal reflec- 
tometry, no polarization state is selected, resulting 
in a very simple, robust and easy-to-use method 
for monitoring effects in these layers, conveniently 
having negligible temperature dependence [37]. 
The reason is an opposite behavior of refractive 
index and of the thickness of the layer in depen- 
dence on temperature which compensate. Gases 
or liquids can be measured applying interference 
reflectometry. It has the combined advantages of 
small cell volume and the possibility of remotely 
monitoring interference effects in the cell via fiber 
optic leads. 


15.5 CONCLUSIONS 


We have now completed our introduction to the 
subject of spectroscopy, specifically to give a sim- 
ple overview to nonspecialist readers. Of course, 
this area is so complex, it can fill large specialist 
textbooks, so it is hoped that the reader will for- 
give the inevitable omissions. We have given a 
short introduction to the basic theory, a brief prac- 
tical overview of components and instruments, 
and finally we have introduced a few examples of 
recent research areas, mainly using fiber optics. 
Clearly, the interested reader can gain more 
insight into this fascinating subject from more- 
voluminous specialist textbooks, many of which 
have been mentioned in the references and further 
reading. 
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16.1 INTRODUCTION 


Photovoltaics (PV) is about to celebrate 70 years 
of its existence in the modern era. Photochemical 
reactions that can produce energy from sunlight 
have been known for over 160 years [1] and pho- 
tosensitive semiconductor devices were discov- 
ered in the second half of the nineteenth century 
[2,3]. However, it was not until the 1950s that 
the first practical solar cell was developed at Bell 
Laboratories in New Jersey [4]. 

Solar cells soon found applications in supply- 
ing electrical power to the first satellites. The first 
practical terrestrial systems followed in the 1970s. 
Their economic viability was restricted to what is 
now referred to as remote applications in isolated 
locations, including powering of radio repeater 
stations, control and measurement devices, and 
cathodic protection equipment for oil and gas 
lines. As costs fell, PV systems started to be eco- 
nomically feasible to provide power to the large 
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number of people who did not have (or, in the 
foreseeable future, were unlikely to have) access to 
mains electricity. 

Since the oil crises in the 1970s, solar electricity 
has been increasingly advocated as an alternative 
to conventional electricity supplies, which, in 
most countries, depended on fossil fuels. The first 
larger scale (utility) PV power plants started to be 
built in the United States and Europe during the 
1980s. 

In the last decade of the twentieth century, solar 
cell integration into roofs and facades of domes- 
tic and commercial buildings became widespread, 
enabling a new and attractive distributed form 
of power generation (see Table 16.1). In the early 
stages, economic viability often required govern- 
ment subsidies, but continued price reduction has 
led to a boom in PV production and increased 
attractiveness of this form of energy generation, 
with a market growth of more than 40% per year 
between 2000 and 2014. 
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Table 16.1 The growth of PV applications 


1950s First modern solar cells appeared 

1960s Solar cells become the main source 
of electric power for satellites 

1970s First use for remote industrial 
applications on the ground 

1980s Solar cells used in rural electrification 
and water pumping; first grid- 
connected systems 

1990s Major expansion in the use of 
building-integrated systems 

2000s PV installed capacity grows in 
average at over 40%/year 

2014 PV installed capacity reaches over 


1% of total global electricity 
demand 


Solar cells capable of generating more than 
38 GW were installed in 2014, providing electricity 
for applications of all scales, ranging from milliwatts 
to megawatts (Figure 16.1). This chapter aims to 
give an overview of this rapidly growing industry, 
with an outline of the device and system aspects of 
the field. The basic principles of solar cell operation 
are described in Section 16.2 along with an exami- 
nation of the fundamental constraints on solar cell 
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efficiency. Section 16.3 reviews current PV technolo- 
gies covering crystalline-silicon, thin-film, organic, 
and hybrid solar cell types. The integration of these 
into PV systems is briefly discussed in Section 16.4, 
with a short overview of the design and principal 
features of stand-alone and grid-connected systems. 
Please note, however, that there is a special section 
on PV applications in Chapter 24 “Applications of 
electricity generation by solar panels” in Volume III. 


16.2 PRINCIPLES OF SOLAR CELL 
OPERATION 


From an engineering point of view, the solar cell 
can be considered as a semiconductor diode, con- 
nected in parallel with a current source, which is 
the photogenerated current I, (Figure 16.2). This 
argument yields the Shockley ideal solar cell 


equation 
qv 
I=I, -1(< -1] 


where I, is the dark saturation current of the diode, 
q is the electron charge, V is the voltage at the 


(16.1) 


Market segmentation in Europe in 2013 
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Figure 16.1 World annual production of solar cells (yearly and cumulative) and the market segmenta- 
tion in Europe in 2013. (Data Adapted from Masson, G. et al., Global market outlook for photovoltaics 


2014-2018, EPIA, p. 17, 2014.) 
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Io 


Figure 16.2 Simplified equivalent circuit of an 
ideal solar cell (internal resistances not shown). 


terminals of the solar cell, k is the Boltzmann con- 
stant, and T is the absolute temperature. The solar 
cell characteristic is shown in Figure 16.3, depict- 
ing also the relevant parameters J,., V,., and P,,,,- 
The short-circuit current I,. produced by this ideal 
solar cell is equal to the light generated current J. 


The open-circuit voltage V,. is given by 
Voe= Pip( 1") 
Ip 


q 
It is customary to define a “fill factor” that is 
used to calculate the power produced by the cell at 
the maximum power point P,,,.,: 


(16.2) 


Prax 
FF=—~ 
TscVoc 


(16.3) 
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Maximum power 
rectangle 
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Figure 16.3 I-V characteristic of a solar cell. The power generated at the maximum power point 


equal to the area of the maximum power rectangle. 


To describe practical devices, a diode “ideal- 
ity” factor n is sometimes introduced as a coef- 
ficient of the thermal voltage kT in the exponent 
of Equation 16.1. A second diode with n=2 can 
also be added, as well as series and parallel resis- 
tances to improve the agreement of equations with 
measured data. More complex circuits can also be 
used, mainly during the R&D phase to help opti- 
mize device structure aimed at achieving higher 
efficiency and stability [6]. 

Constraints to solar cell efficiency imposed by 
fundamental physical laws have been reviewed 
on a number of occasions [7-12]. The best known 
of these, the “Shockley-Queisser detailed bal- 
ance limit” is briefly reviewed in this section. 
This limit reflects the fact that the solar cell is a 
quantum energy converter, which is not subject 
to some of the usual constraints normally appli- 
cable to the conversion of heat to mechanical 
work. Notwithstanding this, we shall see that the 
thermodynamic Carnot factor does appear in a 
subtle way. 

Before the efficiency of a solar cell can be dis- 
cussed, a common standard of solar radiation has 
to be agreed upon. Workers in the area of terres- 
trial solar cells and systems use the most com- 
monly accepted standard, which represents a good 
compromise between convenience and observa- 
tion: solar spectrum at air mass (AM) 1.5, normal- 
ized to a total energy flux density of 1kWm” [13]. 
The value of 1kWm for the total irradiance is 
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Figure 16.4 Different types of solar radiation spectra that are commonly used as a basis for solar cell 


theory and characterization. 


particularly convenient for the system design (see 
Section 16.4) and, together with the cell tempera- 
ture of 25°C, corresponds to the usual conditions 
for the calibration of terrestrial solar cells and 
modules. The rated solar cell output at the maxi- 
mum power point under these standard test condi- 
tions is usually known as “watt peak” (W,). 

Space solar cells and systems operate under 
broadband extraterrestrial (AMO) radiation, and 
this spectrum is therefore used for their calibra- 
tion [14]. The total irradiance, equal to the average 
solar irradiance immediately outside the earth’s 
atmosphere, is called the solar constant S. The 
commonly accepted value of S is now 1.367 kWm”. 

Theoretical calculations are often based on a 
third definition of solar radiation: the radiation of 
a black body that agrees with the observed AMO 
irradiance, giving the appropriate temperature of 
solar radiation T= 5767 K [15]. To complicate mat- 
ters further, some calculations use a less accurate 
but more convenient value of T= 6000 K. A geo- 
metrical factor f,, is introduced to allow for the size 
of the solar disk as perceived from the earth: 


Rs ; Ws 
fo =] — = — 
Rsp T 
where R, is the radius of the sun, R,, is the mean 


distance between the sun and the earth, and 
@S= 6.8510 sr is the solid angle subtended 


(16.4) 


by the sun. The spectra that correspond to these 
different types of solar radiation are shown in 
Figure 16.4. 

In the calculation of their detailed balance limit, 
Shockley and Queisser [11] used the black body 
photon flux (5d) on earth received by a planar sur- 
face of unit area in a frequency interval dv, equal to 


2n,. v'dv 
86= fo a (16.5) 
e kts —] 


where hi is the Planck constant and c is the speed 
of light. An ideal solar cell made from a semicon- 
ductor with bandgap E, absorbs all photons with 
frequency v > v,=E,/h and each such photon gives 
rise to an electron in the external circuit. The 
short-circuit current density is then 


an. f vs 
Ixc=9]80=" afo | AF 


2 
Cc 
e kTs 


(16.6) 


Vg 


A similar approach can be used to obtain the dark 
saturation current density J), but now the black 
body radiation (which completely surrounds the 
cell) is at the ambient temperature T: 


4n f vdv 
Jo = hv (16.7) 
Vg ek 
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Figure 16.5 Shockley—Queisser detailed balance limit on the solar cell efficiency. The lower efficiency 
corresponds to ordinary sunlight with geometrical factor f, given by Equation 16.4. The higher value 
corresponds to the case where the radiation source completely surrounds the solar cell obtained by 
setting f,=1. Also shown are the bandgaps of the most common PV materials (c-Si and a-Si denote 


crystalline and amorphous silicon, respectively). 


Note that there is an extra factor of 2 included 
to account for the fact that the black body radia- 
tion at ambient temperature is emitted from twice 
the cell area (i.e., from the front and back of the 
cell). The open-circuit voltage can now be deter- 
mined from Equations 16.6 and 16.7 with the aid of 
Equation 16.2. The maximum solar cell efficiency 
thus obtained depends only on the bandgap of the 
semiconductor from which the solar cell is made 
(Figure 16.5). 

Higher power generation for a given semicon- 
ductor area can be obtained if solar radiation is 
“concentrated” by means of lenses or mirrors, and 
higher cell conversion efficiency can be obtained 
if several semiconductors with different bandgaps 
are used to convert different parts of the spectrum. 
The former situation is shown in Figure 16.5 by 
assuming that the cell is completely surrounded by 
the source of the black body radiation (i.e., by set- 
ting f,= 1). The combination of different semicon- 
ductors to make “tandem cells” is discussed later 
in this chapter. 

Other methods to calculate the solar efficiency 
for a single-gap cell include arguments based on 
thermodynamic principles, which underline the 
observation that, near the open circuit, the solar 
cell behaves as an ideal thermodynamic engine 
with Carnot efficiency [16]. Using a simplified 


two-level model, Baruch et al. [10] showed that 
the maximum open-circuit voltage produced by a 
solar cell made from a semiconductor with band- 
gap E, can be approximated by the expression: 


E 
Voc = ‘(1 +P an( (16.8) 
q Ts} q 2 


The first term in Equation 16.8 is the bandgap 
energy in eV multiplied by the Carnot efficiency, 
and the second term gives the reduction in V,, on 
account of the spread of the solar black body radia- 
tion on earth. A factor of two has been added in the 
denominator of the second term, in keeping with 
a similar term in Equation 16.7. A closer agree- 
ment between the methods of Baruch et al. [10] 
and Shockley and Queisser [11] can be obtained by 
adding a term kT/q In(T;/T) to the right-hand side 
of Equation 16.8 that takes into account the fact 
that conversion occurs in semiconductor bands 
rather than in a two-level system [17]. Figures 16.6 
and 16.7 compare the theoretical values of the 
short-circuit current and open-circuit voltage with 
data for the best solar cells to date from different 
materials. 

The ideal solar cell efficiencies discussed ear- 
lier refer to single-junction semiconductor devices 
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Figure 16.6 Maximum theoretical short-circuit current density under the standard AM1.5 illumination 
(full line) and the best measured values for solar cells from different materials. (CIGS stands for copper 


indium-gallium diselenide. 


whose principal efficiency limitations are due to the 
inability of the semiconductor to absorb photons 
of energy lower than the bandgap, and to the fact 
that a part of the energy of above-bandgap photons 
is inevitably lost as heat. To overcome these limita- 
tions, one can form a tandem cell by stacking two 
or more cells on top of each other, each converting 
its own part of the spectrum. Devices of this form, 


which are now available commercially, include 
high-efficiency solar cells for satellites from 
Spectrolab or SolAero Technologies in the United 
States and thin-film amorphous silicon or silicon/ 
germanium double- or triple junction tandem 
cells. The tandem cell technology may, in princi- 
ple, increase efficiency to 42% (55% under concen- 
trated sunlight) for a double-tandem structure, or 
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Figure 16.7 Best measured values of open-circuit voltage for crystalline materials and thin films (filled 
and open squares, respectively) compared with theoretical predictions. The theories of Shockley and 
Queisser and Baruch et al. are discussed in the text. The theory of Green [44] gives a semiempirical 


estimate of the maximum achievable voltage. 
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to 86.8% in the limit of an infinite number of cells 
[8,18]. In 2015, the certified efficiency of an experi- 
mental four-junction cell reached 46% under con- 
centrated sunlight. 

Further improvements are possible by a variety 
of means that has recently been referred to as “third 
generation” technology [19]. These ideas involve, for 
example, the creation of more than one electron- 
hole pair from the higher energy incident photons— 
in other words, achieving quantum efficiencies in 
excess of unity. This use of “impact ionization” to 
improve the cell efficiency was first proposed by 
Landsberg et al. [20]. 

Kolodinski et al. [21] demonstrated the feasibil- 
ity of this approach but a working solar cell has yet 
to be developed. Other theoretical third-generation 
concepts invoke hot electrons [22,23], or the use of 
impurities [24,25], or intermediate bands [26,27] to 
attempt to utilize the below-bandgap light. Quantum 
wells have also been suggested as a means of improv- 
ing efficiency [28,29]. (Nelson [30] gives a review of 
this field.) A thermodynamic efficiency limit for this 
general form of solar energy converter is known as 
the Landsberg efficiency [31,32]. 


16.3 CURRENT SOLAR CELL 
TECHNOLOGIES 


At the time of publication, the PV market is domi- 
nated by crystalline silicon, in the single crystal 
or multicrystalline form (Figure 16.8). Thin-film 


Multicrystaline silicon 
55% 


9% 


Monocrystalline silicon 
36% 


Thin flims 


technologies still accounted for only about 10% 
of the global market in 2013, despite significant 
growth in volume. There is also a fast growing mar- 
ket, albeit from a very small base, for terrestrial 
applications of solar cells using concentrator sys- 
tems that can make use of expensive high-efficiency 
solar cells, of a type previously only used exten- 
sively in the space technology domain. 

Space solar cells are not included in Figure 16.8 as 
their production is much smaller in generated energy, 
even though, because of high cost, they are not insig- 
nificant in terms of financial turnover. Although 
similar in structure, the main requirements on solar 
cells used to power satellites in space differ substan- 
tially from their terrestrial counterparts. The primary 
drivers for space cells are the power output for a given 
size and weight (but, most importantly, the power-to- 
weight ratio) and, for many applications, the require- 
ment for good radiation resistance. The latter factor is 
the main reason for the predominant n-on-p configu- 
ration of current silicon solar cells, since the p-type 
base, where most of the power is generated, is more 
radiation resistant than the n-type material. 

The principal effect of particle radiation on the 
solar cell arises from the change in minority-carrier 
lifetime and the resultant reduction in the diffu- 
sion length. Over recent years, whenever there is a 
requirement for high efficiency and radiation toler- 
ance, the balance has swayed in favor of GaAs cells, 
sometimes in the form of a double-junction struc- 
ture based on GaInP/GaAs. Indium phosphide 
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Figure 16.8 Market share of different PV technologies (Kost, C., Mayer, J. N., Thomsen, J. et al., Levelized 
Cost of Electricity Renewable Energy Technologies, Fraunhofer ISE, Freiburg, Germany, 2013.) 


608 Optical to electrical energy conversion 


cells have been suggested for use in harsh radiation 
environments as their radiation resistance is par- 
ticularly high, due to partial annealing of radiation 
damage under illumination [33]. 


16.3.1 Crystalline silicon 


Atthetime of publication, atypical monocrystalline 
silicon solar cell module in production has an efhi- 
ciency between 15% and 18%, usually slightly less 
for cells made from multicrystalline material. Cells 
are typically made by phosphorus diffusion of the 
top layer (emitter) into a p-type wafer, with screen- 
printed metal contacts and a thin layer of antire- 
flection coating (arc; Figure 16.9) [34]. Photons, 
absorbed mainly in the base of the cell, create elec- 
tron-hole pairs. These are separated by the electric 
field of the junction (Figure 16.10a). The principal 
efficiency losses incurred in such a cell are shown 
in Figure 16.11. 

Improvements to the basic structure can be 
made by texturing the top surface to reduce opti- 
cal reflection. In combination with an optically 
reflecting back surface, this can be used to produce 
a significant degree of light trapping to offset the 
poor optical absorption of silicon. Surfaces can be 
passivated to reduce surface recombination, and 
p* diffusion can be used to create a back-surface 
field (a barrier to minority-carrier transport) and 


reduce recombination near the back surface. Laser- 
grooved buried-contact technology, invented by 
Martin Green at the University of New South Wales 
in Australia and utilized by the BP Solar “Saturn” 
cells, reduces shading by top surface metallization 
and pushes up the efficiency of production cells 
close to 18%. In the laboratory, the top silicon solar 
cell efficiency has now exceeded 25% —about 80% 
of the theoretical maximum for single-junction 
devices. Two high-efficiency structures that have 
achieved record conversion efficiencies are shown 
in Figure 16.12. The Stanford cell, with both sets 
of contacts at the rear to avoid shading, is intended 
for operation under concentrated sunlight. The 
highest efficiency silicon-based solar cell (Sanyo/ 
Panasonic HIT cell, with efficiency 25.6%) uses a 
back contacted heterojunction of crystalline and 
amorphous silicon. 

Silicon is an indirect-gap material, and the low 
absorption coefficient necessitates relatively thick 
devices to make good use of the available sunlight 
(Figure 16.13). This is most easily achieved by a 
wafer-based technology. The major part of crystal- 
line silicon cell production involves cutting wafers 
from monocrystalline or multicrystalline ingots. 
To reduce costs, there has been a push toward thin- 
ner and thinner silicon wafers, from over 300 pm in 
the late 1990s to around 200 pm a decade later. The 
trend is expected to continue as reliable production 
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Figure 16.9 Structure of a typical crystalline silicon solar cell in production today. 


16.3 Current solar cell technologies 609 


(b) 


Pp Intrinsic n 


aN 
hv Vy 
XK 


MoSe, 


+ 


Cu(In,Ga)Se, 


CdS 


ZnO 
(Front 
contact) 


Mo (Back contact) 


Figure 16.10 (a) A schematic diagram of the band 
structure and operation of a crystalline silicon 
solar cell. Most minority carriers are generated in 
the p-type base, and must diffuse to the junction 
(arrow 1) before charge separation (arrow 2). (b) 
In a typical amorphous silicon p—i-n structure, 
electron-hole pairs are generated in the space 
charge region where the built-in field aids carrier 
collection. (c) The band structure of a Cu(InGa) 
Se, solar cell (After Rau, U. and Schock, H. W., 
Cu(In, Ga)Se, Solar Cells Clean Electricity from 
Photovoltaics, World Scientific, Singapore, 2001.) 


and handling technologies for even thinner wafers 
become available. 

To avoid the kerf loss (amount of material lost 
during the cutting process), a number of companies 
(including ASE, Evergreen Solar, and Ebarra) are 
turning to “non-ingot” wafer technologies such as 
dendritic or edge-defined film growth [37]. Other 
fabrication methods that use less silicon include 
the growth of thin crystalline silicon solar cells 
deposited on a ceramic substrate, as demonstrated 
by Astropower. Since the early days of PV, however, 
there has been a substantial effort directed toward 
finding a cheap replacement material for crystal- 
line silicon—for a semiconductor with good opti- 
cal properties that can be deposited as a thin film 
to reduce the material requirements. 
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Figure 16.11 Principal efficiency losses ina 
commercial 100x100 mm crystalline silicon 
solar cell, compared with those for the pas- 
sivated emitter rear, locally diffused solar cell 
(PERL) (in brackets, see Figure 16.12). Notes: 

1. See Equation 16.8. Losses due to dilution of 
solar radiation (the second term) are absent 
under maximum concentration of sunlight. 2. 
Surface and volume recombination in the base, 
emitter, and junction regions. 3. Principally 
free-carrier absorption. 4. Collection efficiency 
is limited mainly by recombination in the base 
and emitter (After Boes, E. C. and Luque, 

A., Photovoltaic Concentrator Technology 
Renewable Energy: Sources for Fuels and 
Electricity, Earthscan, London, 1993.) 
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Figure 16.12 The UNSW PERL cell [45] and the Stanford point-contact cell [46]. 
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Figure 16.13 Absorption coefficients of selected 
PV materials. Data for organic solar cell poly[N- 
9’-heptadecanyl-2, 7-carbazole-alt-5, 5-(4’, 
7’-di-2-thienyl-2’, 1’, -benzothiadiazole)] 1:4 wt/wt 
blend with [6,6]-Phenyl-C71-butyric acid methy| 
ester, PCDTBT:PC71BM, Adapted from [35]. Data 
for methylammonium lead triiodide, CH;NH;Pbl;, 
(perovskite solar cell) (Adapted from Léper, P. 

et al., J. Phys. Chem. Lett., 6, 66-71, 2015.) 


16.3.2 Thin-film solar cells 


The origin of thin-film solar cells dates back to 
the same year as crystalline silicon solar cells 
[38]. Commercial production of the first thin-film 
devices had to await the development of hydroge- 
nated amorphous silicon (a-Si) in the second half 
of the 1970s. Unfortunately, the discovery of an 
amorphous Si solar cell degradation mechanism 
associated with light-induced metastable changes 
in the properties of hydrogenated amorphous 
silicon (the Staebler-Wronski effect [49]), repre- 
sented a major setback to the new technology, and 
most a-Si solar cells produced today require some 
measures to reduce its impact. 

The present single junction a-Si cells (used 
principally to power commercial products such as 


watches and calculators) are invariably based on 
p-i-n or n-i-p technology. The incorporation of 
the intrinsic layer serves to enhance carrier collec- 
tion since most electron-hole pairs are generated in 
a region where electric field assists their separation 
(Figure 16.10b). The manufacture involves first the 
deposition of a layer of tin oxide on glass to act as 
a transparent front contact. The p-i-n cell is then 
formed by depositing a p-layer of boron-doped 
silicon (or silicon/silicon carbide alloy), intrinsic 
silicon, the phosphorus-doped n-layer, and finally 
a metal back contact. Tandem a-Si cells, which are 
increasingly more common, sometimes also incor- 
porate a-Si/a-Ge layers to optimize the bandgap. 
Other structures involve a combination of micro- 
crystalline/amorphous silicon solar cells, as mar- 
keted by Sharp and Kaneka Solar Energy. 

Other thin-film solar cell technologies based 
on compound semiconductors—cadmium tellu- 
ride (CdTe) and copper indium gallium disellenide 
(CIGS) or its derivatives—have been overtaking a-Si 
in terms of commercial importance and, in 2013, 
CIGS production was more than twice that of a-Si. 

CdTe cells are usually manufactured in the 
form of CdTe/CdS heterojunctions. They present a 
direct transition bandgap of 1.51 eV, which allows 
the use of very thin films, and have achieved certi- 
fied module efficiency of >18% in 2015, with best 
research cells reaching 21.5%. At the time of writ- 
ing, the major company developing CdTe products 
is First Solar who has installed millions of CdTe 
modules in one of the largest PV power plants in 
the World, Topaz, in the United States. 

The other main compound semiconductor 
that is being used in commercial production of 
solar cells is CIGS. It forms a chalcopyrite crystal 
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Figure 16.14 A schematic diagram of an inte- 
grally interconnected module [47]. 


structure and the bandgap can be optimized by 
controlling the amount of gallium in the struc- 
ture, CuIn,Gay_,Se,. Optimized structures have 
achieved research cell efficiencies of 21.7%. The 
band structure of a typical CIGS solar cell, based 
on a CdS/Cu(InGa)Se, heterostructure, is shown in 
Figure 16.10c. There is also much interest in other 
types of compound semiconductors that do not use 
scarce elements (i.e., In), such as compounds that 
form kesterite structure (e.g. Cu,ZnSnS,), that 
have shown laboratory cell efficiencies of 12%. 

An advantage of thin-film solar cells over the 
wafer-based technologies lies not only in the lower 
material requirements but also in the possibility 
of integrated module manufacture. Alternating 
layers can be deposited and laser scribed with an 
offset in such a way that a series connection of cells 
is produced without mechanical handling of indi- 
vidual cells (Figure 16.14). The major drawback of 
thin-film technologies in the past has been lower 
efficiencies combined often with stability problems 
(e.g., due to moisture). The recent increase in pro- 
duction and the growing efficiency seem to indicate 
that thin films are ready for large-scale installa- 
tions, as demonstrated by its use in the Topaz solar 
farm. It remains to be seen if this will be sufficient 
to increase market share that has stagnated in the 
last years, despite large overall market growth. 


16.3.3. Organic and hybrid solar cells 


The search for new cost-effective materials includes 
the development of devices where the PV energy 
conversion is carried out by molecules instead of 
traditional semiconductor structures. In the early 
1990s, Michael Gratzel and colleagues at the Ecole 


Polytechnique Fédérale de Lausanne demon- 
strated dye-sensitized solar cell (DSSC) architec- 
ture where a Ru-based molecular dye both absorbs 
the light and participates in charge separation. 
In effect, the monomolecular dye layer “pumps” 
electrons from one electrode (liquid electrolyte) 
to the other (solid titanium dioxide). The use of 
mesostructured nanocrystalline titanium oxide 
particles coated by the dye ensures high optical 
absorption: virtually all light within the spectral 
range of the dye is absorbed by a coating not more 
than few angstroms thin. Laboratory cell efficien- 
cies of over 10% were quickly achieved; however, 
the presence of a liquid electrolyte combined with 
serious stability issues severely affected indus- 
trialization of this technology. Several attempts 
were made to substitute the liquid electrolyte with 
a solid-state material but often lead to lower effi- 
ciencies. Recently, many companies and research 
groups have been changing focus from DSSCs to 
the recently discovered perovskite solar cells. 

The light absorbing element in hybrid organic- 
inorganic Perovskite solar cells are crystals with an 
ABC, structure, where A is an organic cation, B a 
metal cation, and Ca halide anion. They were intro- 
duced as sensitizers in solid-state DSSC configura- 
tions but soon it became clear that high efficiencies 
can be achieved in a thin-film configuration. It has 
shown the fastest growth in device efficiency of any 
PV technology to date and in 2014, certified lab 
efficiencies of >20% were demonstrated. It is still 
too early to determine if such technology will ever 
become commercially viable. Advantages include 
the potential for low temperature processing and 
even for solution-based manufacturing, which 
could mean low production costs and fast energy 
payback time. However, early results indicate sig- 
nificant stability issues (e.g., due to moisture). 
Additionally, the most efficient perovskites use Pb 
in the structure, which could lead to regulatory 
challenges when dealing with end-of-life and dis- 
posal of the products. Alternative lead-free com- 
binations are being investigated and, with further 
research, it is likely that improvements in stability 
should be achieved. 

Another area of interest is that of truly molecu- 
lar solar cells. Both polymer and small organic mol- 
ecules have been used as active ultrathin layers in 
solar cells. These materials are direct bandgap semi- 
conductors with extremely high absorption coef- 
ficients; therefore, only a few hundred nanometer 
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Figure 16.15 A schematic generic diagram of the typical organic solar cell or perovskite planar het- 


erojunction solar cell today. 


thick layers are needed to absorb all light. A typi- 
cal organic photovoltaic (OPV) is formed by two 
materials, one electron donor and one electron 
acceptor, somewhat mimicking a p-n junction. 
However, in OPV, the pair of charges (exciton) 
generated upon illumination is strongly bound 
and will not generate free charges unless charge 
transfer takes place at the heterojunction. That, 
combined with a very short exciton lifetime and 
low charge-carrier mobility, puts restrictions on 
device architecture [50]. At the time of writing, 
the preferred device configuration uses a blend 
of donor and acceptor materials to form an ultra- 
thin film (~100-200 nm) with an interpenetrating 
network of phase domains (in order to maximize 
donor-acceptor interface area and, therefore, max- 
imize charge generation) (Figure 16.15). This active 
layer is sandwiched between two electrodes, one 
of which is typically a transparent oxide, such as 
indium tin oxide. Between the active layer and the 
electrode, typically an interlayer is used to improve 
charge collection and avoid interdiffusion of spe- 
cies. Modules are monolithically integrated, simi- 
larly to thin-film PV modules. Note however, in 
contrast to silicon PV, OPV is not one single tech- 
nology and device structure. Thousands of active 
molecules, interlayer materials, and combinations 
of these have been under investigation and each 
reported breakthrough in efficiency is typically 
achieved with a new material. 

Significant industrial development has taken 
place since 2010 and, in 2014, Toshiba announced 
the first certified OPV module efficiency of 8.7%, 
while the cell efficiencies are reported between 11% 
and 12%. Despite the efficiency being significantly 
lower than that of c-Si, OPV provides a series of 
potential advantages that could lead to market 


entry in niche applications. The first advantage is 
related to low temperature and potential for fast 
roll-to-roll manufacturing via printing or coat- 
ing, which enables direct coating onto flexible sub- 
strates such as plastic, thin metal sheets, and even 
paper. This should allow very low production costs 
and very short energy payback time. Other charac- 
teristics are lightweight and easy customizability, 
which should allow easy integration into consumer 
products. For example, in 2015, Heliatek and 
RECKLI announced an OPV solar concrete wall 
installation in their headquarters in Germany. One 
major drawback of OPV technology has been the 
limited lifetime that has hindered the use of flex- 
ible products. Recent advances in intrinsic stability 
and improved barrier layers have led to significant 
improvements in demonstrated stability in field 
trials. There is much to look forward to if knowl- 
edge from the booming organic light-emitting 
diodes industry can be used to bring OPV closer 
to the market. 


16.4 ISSUES IN SYSTEM DESIGN 


For practical use, solar cells are laminated and 
encapsulated to form PV modules. These are 
then combined into arrays, and interconnected 
with other electrical and electronic compo- 
nents—for example, batteries, charge regulators, 
and inverters—to form a PV system. A number 
of issues need to be resolved before an optimum 
system design is achieved. These issues include the 
choice between a flat plate or concentrating sys- 
tem, and the required array configuration (fixed 
tilt or tracking). Answers to these questions will 
vary depending on the solar radiation at the site 
of the installation. Specific issues also relate to 
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whether the system is to be connected to the utility 
supply (the “grid”) or is intended for stand-alone 
operation. 

The available solar radiation, of course, varies 
from day to day, season to season, and depends 
on the geographical location. One can, however, 
obtain an estimate for the mean daily irradiation 
(solar energy received by a unit area in one day), 
averaged over the surface of the globe. To this end, 
we note that the total energy flux incident on the 
earth is equal to the solar constant S multiplied by 
the area of the disk presented to the sun’s radiation 
by the earth. The average flux incident on a unit 
surface area is then obtained by dividing this num- 
ber by the total surface area of the earth. Making 
allowance for 30% of the incident radiation being 
scattered and reflected into space, the average daily 
solar radiation G, on the ground is equal to 


TR: 
2. 


Gy =24x0.7X 
4TR: 


xS$=24x0.7x 5 


=5.74 kWhm” (16.9) 


where R, is the radius of the earth, which was 
introduced in Section 16.2. The value should be 
compared with the observed values. Figure 16.16 
shows the daily solar radiation on a horizontal 
plane for four locations ranging from the humid 
equatorial regions to northern Europe. The solar 
radiation is highest in the continental desert areas 
around latitude 25-30°N and 25-30°S, and falls 
off toward the equator because of the clouds, and 


Daily solar radiation (kWh m7) 


0 + r 
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toward the poles because of low solar elevation. 
Equatorial regions experience little seasonal varia- 
tion, in contrast with higher latitudes where the 
summer/winter ratios are large. 

Most PV arrays are installed at fixed tilt and, 
wherever possible, oriented toward the equator. 
The optimum tilt angle is usually determined by 
the nature of the application. Arrays that are to 
provide maximum generation over the year (e.g., 
many grid-connected systems) should be inclined 
at an angle equal to the latitude of the site. Stand- 
alone systems that are to operate during the winter 
months have arrays inclined at a steeper angle of 
latitude +15°. If power is required mainly in sum- 
mer (e.g., for water pumping and irrigation), the 
guide inclination is latitude —15°. 

The amount of solar energy captured can be 
increased if the modules track the sun. Full two- 
axis tracking, for example, will increase the annual 
energy available by over 30% over a nontracking 
array fixed at the angle of latitude (Figure 16.17)— 
at the expense, however, of increased complexity. 
Single-axis tracking is simpler but yields a smaller 
gain. Tracking is particularly important in systems 
that use concentrated sunlight. These systems can 
partially offset the high cost of solar cells by the use 
of inexpensive optical elements (mirrors or lenses). 
However, the cells then usually need to be cooled 
and it should also be borne in mind that only direct 
(beam) solar radiation can be concentrated to a sig- 
nificant degree, thus reducing the available energy 
input. This effectively restricts the application of 
concentrator systems to regions with clear skies. 
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Figure 16.16 Mean daily solar radiation and its monthly variation in different regions of the world. The 
dashed line indicates the average value (Equation 16.9). 
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Figure 16.17 The energy available to a PV array with two-axis tracking, in comparison with a flat 
plate system inclined at an angle equal to the latitude of the site. Global radiation (circle) and direct 
beam (triangle) are shown. The full and dotted lines correspond to the global radiation available 

to the fixed array and a 34% increase over this value, respectively (After Boes, E. C. and Luque, 

A., Photovoltaic Concentrator Technology Renewable Energy: Sources for Fuels and Electricity, 


Earthscan, London, 1993.) 


There is a considerable difference between the 
designs of stand-alone and grid-connected sys- 
tems. Much of the difference stems from the fact 
that the design of stand-alone systems endeavors 
to make the most of the available solar radiation. 
This consideration is less important when util- 
ity supply is available, but the grid connection 
imposes its own particular constraints that must 
be allowed for in the system design. 


16.4.1 Stand-alone systems 


The various applications of PV systems in iso- 
lated locations have been mentioned briefly in 
Section 16.1 and more examples are given in Chapter 
24 “Applications of electricity generation by solar 
panels” in Volume III. The diversity of uses leads to 
different requirements on the system and different 
system specifications. An important parameter that 
reflects the nature of the application is the required 
security of supply. Telecommunication and systems 
used for marine signals, for example, need to oper- 
ate at a very high level of reliability. In other applica- 
tions, the user may be able to tolerate lower reliability 
in return for a lower system cost [40]. These consid- 
erations have an important bearing on how large PV 


array and how large energy storage (battery) need to 
be installed, in other words, on system sizing. 
Among the variety of sizing techniques, sizing 
based on energy balance provides a simple and pop- 
ular technique that is often used in practice. It gives 
a simple estimate of the PV array necessary to sup- 
ply a required load, based on an average daily solar 
irradiation at the site of the installation, available 
now for many locations in the world. Choosing the 
month with the lowest irradiation (e.g., December 
in northerly latitudes) and the value appropriate to 
the inclined panel, the energy balance equation can 
then be found with the use of data in Figure 16.16: 


ee Dail ti 
Array size (in W,) _ Daily energy consumption 


Daily solar raditaion 
(16.10) 


Equation 16.10 specifies how many PV modules 
need to be installed to supply the load under aver- 
age conditions of solar irradiation. The battery size 
is then estimated “from experience”—the rule of 
thumb recommends, for example, installing three 
days of storage in tropical locations, five days in 
southern Europe and ten days or more in the United 
Kingdom. As an illustration, Table 16.2 shows the 
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Table 16.2 Area and nominal power of a PV array, 
facing south and inclined at latitude angle, needed 
to produce 1 MWh of electricity per year at 
different locations, for two-module efficiencies h 


2 
Nominal array Module area (m?) 


Location power (kW,) n= 15% 1n=10% 
Sahara 0.41 2.7 4.1 
desert 
Almeria, 0.53 3.5 5.3 
Spain 
London 1.04 6.9 10.4 
Vietnam 0.65 4.4 6.5 


size of the PV array needed to generate 1 MWh per 
year in different locations around the world. 

Although easy to use, the sizing method based 
on energy balance does not give an indication of 
how the PV system will operate under fluctuating 
solar radiation. In particular, it does not predict 
the reliability of energy supply. The relationship 
between the reliability of supply and sizing is 
illustrated well by the random walk method [41]. 
The method consists of treating the possible states 
of charge of the battery as discrete numbers, 
which are then identified as sites for a random 
walk. Each day, the system makes a step in the 
random walk depending on solar radiation: one 
step up if it is “sunny” and one step down if it is 
“cloudy.” The magnitudes of these steps and prob- 
abilities of weather being “sunny” or “cloudy” are 
determined from the solar radiation data and the 
daily load. When the random walker resides in 
the top state, the battery is fully charged; when 
it is in the bottom state, the battery is completely 
discharged and the load is disconnected. The cal- 
culations are carried out by assuming that, after a 
certain time, the random walker reaches a steady 
state. The loss of load probability (LLP) is then 
equated to the probability of the random walker 
residing in the lowest state. Bucciarelli [42] sub- 
sequently extended this method to allow for cor- 
relation between solar radiation on different days. 
For a given LLP, the result can be expressed in the 
form of a sizing curve: a functional dependence 
of the array size on the battery size from which 
the least-cost system can easily be determined 
(Figure 16.18). These and other more complex siz- 
ing techniques have been summarized in more 
detail by Gordon [43]. 
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Figure 16.18 The sizing curve, representing 

the locus of PV system configurations with the 
same reliability of energy supply. The least-cost 
system can be found by a simple geometrical 
construction. 


16.4.2 Grid-connected systems 


Grid-connected systems have grown considerably 
in number since the early 1990s spurred by gov- 
ernment “feed-in tariffs” in a number of coun- 
tries, led initially by Switzerland and followed by 
more substantial programs in Germany, Japan, 
California, and Spain. One feature that affects 
the system design is the need for compliance with 
the relevant technical guidelines to ensure that the 
grid connection is safe; the exported power must 
also be of sufficient quality and without adverse 
effects on other users of the network. Although a 
common set of international standards is still some 
way off, it is probably fair to say that the funda- 
mental issues have been identified—partly through 
the work of the Task 5 of the International Energy 
Agency. In a number of countries, the required 
statutory guidelines have now been produced: in 
the United Kingdom, for example, the relevant 
Engineering Recommendation G77 was published 
in 2000 and some updated recommendations in 
2012 (Engineering Recommendation G83/2). An 
example of the requirements imposed on the grid 
interface of a PV system by the utility is shown in 
Table 16.3. 

Many of the grid connection issues are not 
unique to PV. They arise from the difficulties in 
trying to accommodate “embedded” or distributed 
generators in an electricity supply system designed 
around large central power stations. It is likely 
that many of these features of grid connection 
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Table 16.3 A summary of principal requirements 
on the grid interface of PV generators, covered 
in the UK engineering recommendation G77 


Function Reference 
Protection 
General IEC 255 


Under/over voltage UK guidelines? 


Under/over frequency UK guidelines? 


Loss of mains Specific for PV 


Supply quality 


Harmonics BS and EN> 
Voltage flicker BS and EN> 
Electromagnetic BS and EN 


compatibility 
DC injection UK guidelines? 
Safety 
Earthing BS<¢ 


2 Covered by existing UK guidelines for parallel con- 
nection of embedded generators. 

> Covered by existing British standard and European 
norm. 

* Covered by existing British standard. 


will undergo a review as electricity distribution 
networks evolve to absorb a higher proportion of 
embedded generation: wind farms, cogeneration 
(or CHP) units, or other local energy sources. The 
electricity supply system in 20 or 30 years’ time 
might be quite different from now, and new and 
innovative integration schemes will be needed 
to ensure optimum integration. PV generators 
are likely to benefit from these changes, particu- 
larly from the recent advances in the technology 
of small domestic size cogeneration units (micro- 
CHP) that have a good seasonal synergy with the 
energy supply from solar sources, and can share 
the cost of the grid interface. 


An example of how elegant architecture can 
be combined with forward looking engineering 
is offered by the Mont Cenis Energy Academy at 
Herne-Sodingen (Figure 16.19). This solar-cell clad 
glass envelope at the site of a former coal mine pro- 
vides a controlled Mediterranean microclimate 
that is powered partly by 1MW, PV array and 
partly by two cogeneration units fueled by methane 
released from the disused mine. To ensure a good 
integration into the local electricity supply, the 
generators are complemented by a 1.2MWh bat- 
tery bank. In addition to the Academy, the scheme 
also exports electricity and heat to 250 units in 
a nearby housing estate and a local hospital. The 
Mont Cenis Academy is a fine flag carrier for PV 
and new energy engineering—without a doubt, 
similar schemes will become more prolific as PV 
and energy efficient solutions become the accepted 
norm over the next few decades. 


16.5 CONCLUSIONS 


PV technology has come a long way since the 
first solar-powered satellites in the late 1950s. 
Based today mainly on crystalline silicon, new 
approaches that utilize thin-film technologies and 
molecular materials are beginning to make their 
mark in commercial production. If the current rate 
of growth continues, a combination of distributed 
solar power systems with large PV farms will sup- 
ply a significant part of our energy needs during 
the early part of this new millennium. One can 
foresee a bright future for this new, clean energy 
source. 
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Figure 16.19 The Mont Cenis Academy at Herne-Sodingen. 
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17.1 INTRODUCTION 


Nano- and microactuators are very small material 
structures and mechanisms that perform mechani- 
cal work in response to external stimuli. The size 
of these devices can range from a few molecules to 
several hundred microns. The mechanical action 
produced by the actuating structure is able to gen- 
erate tiny displacements or induce microforces 
on the surrounding medium (Knopf 2006, 2012; 
Tabib-Azar 1998). Examples of microscale actua- 
tors include cantilever beams, microbridges, flex- 
ible diaphragms, torsional mirrors, shape memory 
wires and strips, and expansive polymer gels. These 
miniature devices may exploit mechanical prin- 
ciples similar to much larger analogous systems or 
merely involve the subtle expansive and contractive 
characteristics of environmentally sensitive metals 
and polymers. In contrast, nanoactuators are often 
assembled from small groups of interconnected 
molecules that move in unison under an external 
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energy source. Molecular motor proteins found in 
living cells are an example of nanoactuators capable 
of producing piconewton (pN) forces (Kang et al. 
2009; Liand Tan 2001; Setou et al. 2000; Vale 2003). 

Different nano- and microactuation methods 
will take advantage of mechanical, electrostatic, 
piezoelectric, magnetic, thermal, fluidic, acoustic, 
chemical, biological, or optical principles. Although 
optical actuators may be one of the least studied, in 
the world of the very very small (Feynman 1960) 
optically driven transducers provide a number 
of interesting features and design opportunities. 
These actuators can be created to either directly or 
indirectly transform the light energy into structural 
movement (Knopf 2006, 2012; Tabib-Azar 1998). 
Direct optical methods use the photons to interact 
with the photosensitive properties of the actuator 
material in order to initiate mechanical displace- 
ment. One example of direct optical microactua- 
tion occurs when photoresponsive shape memory 
polymers (SMPs) (Jiang et al. 2006) are physically 
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deformed by exposing the material to ultraviolet 
(UV) radiation. On the other hand, indirect opti- 
cal methods exploit the ability of light to generate 
heat when the focused beam strikes the material 
surface and influences the thermal properties of 
nearby gases, fluids, and solids. One application 
of this concept is to use a focused light source to 
heat a liquid (e.g., Freon 113) that transforms into 
a gas and expands sufficiently to deform a very 
thin mechanically flexible diaphragm (Mizoguchi 
et al. 1992). The remote optical heating of liquids 
in Lab-on-a-Chip (LOC) devices also represents a 
novel approach to move fluid along a microchannel 
(Weinert and Braun 2009; Weinert et al. 2009) or 
mix adjacent streams of liquids (Shiu et al. 2010) 
prior to chemical analysis. 

At the nano- and microscale optically driven 
devices have important advantages over conven- 
tional microelectromechanical systems because 
streams of photons provide both the energy into 
the system and the control signal used to regu- 
late the actuator response. Optical actuators can 
be created to operate under different properties 
of light such as intensity, wavelength, phase, and 
polarization characteristics. Many of these light- 
driven mechanisms are also free from small elec- 
trical current leakage, resistive heat dissipation, 
and mechanical friction forces that may signifi- 
cantly reduce performance and efficiency. 

The fundamental operating principles and 
unique attributes of a wide variety of light- 
activated nano- and microactuators are explored 
in this chapter. The discussion begins with a brief 
look at how light can exert pressure on mechani- 
cal structures and even levitate artificial micro- 
objects. Section 17.3 introduces light-driven 
nanomotors comprised of single protein or DNA 
molecules that are able to switch between states 
causing the mechanism to shrink and expand like 
an inchworm. Several direct methods of optical 
actuation that take advantage of newly engineered 
photoresponsive materials are examined in Section 
17.4. Some of these materials are able to alter their 
shape, bend, or expand in volume when exposed to 
specific wavelengths of light. Other direct optical 
methods are also introduced using photon-gener- 
ated electrons to alter the electrostatic pressure ona 
microcantilever beam and that induce mechanical 
deformation. Indirect optical actuation methods, 
caused by the photothermal interactions arising 
from light striking a material surface, is described 


in Section 17.5. The principles of light material 
interactions are discussed and applications involv- 
ing diaphragm actuators and light-driven micro- 
flows are introduced. In addition, examples of 
temperature-induced phase transformation of sol- 
ids and photothermal vibrations of optical wave- 
guides for mechanical actuation are presented. In 
all cases, light is used to initiate movement and 
control the actuating mechanism that performs 
work. Finally, a summary of light-driven actuation 
and future opportunities for exploiting this tech- 
nology is provided in Section 17.6. 


17.2 FORCES GENERATED BY LIGHT 


In the late 1800s, the Scottish mathematical physi- 
cist James Maxwell (1873) predicted that it could 
be theoretically possible to generate measurable 
forces using a source of light. However, it was not 
until the dawn of the 20th century when Russian 
physicist Pyotr Lebedev (1901) experimentally 
confirmed Maxwell’s prediction. For many years, 
Lebedev’s observations were treated as minor sci- 
entific curiosities that would have little influence 
or impact on practical engineering applications. 
Although perceived to be impractical, the possi- 
bilities of using light to drive machines have con- 
tinued to inspire the imagination of numerous 
science fiction writers for more than a century. 

Recent advances in precision instrumentation 
and a deeper understanding of material science 
have opened doors not envisioned by Maxwell or 
Lebedev. The modern tools for scientific investiga- 
tion have enabled a broad spectrum of scientists 
and engineers from around the globe to explore the 
micron and submicron worlds where electromag- 
netic radiation plays a dominant factor in chang- 
ing the behavior of certain materials. In other 
words, in the world of the very very small light does 
matter. Through advanced material engineering 
and clever design, the properties of light (radia- 
tion pressure, intensity, wavelength, phase) can be 
transformed into small, yet meaningful, mechani- 
cal pressure, force, and displacement. 

In general, electromagnetic radiation from a 
focused light source will exert a small amount of 
pressure upon any exposed surface. The radiation 
pressure absorbed by that surface can be described 
as the power flux density divided by the speed of 
light. The driving force generated by light radia- 
tion pressure is based on the transfer of photons. 
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The photons have no mass but carry energy at the 
speed of light. The momentum, p,,,, of each photon 
in a light beam is the result of this energy (Ashkin 
2000; Steen 1998; Steen and Mazumder 2010) and 
can be given by 


ho 


a= (17.1) 


where h is the Planck’s constant (6.63 x 10“4Js), c is 
the speed of light in a vacuum (~2.99 x 10°m/s), and 
v is the optical frequency related to the wavelength 
(A) of the light source by 


v= — (17.2) 


ny 


The optical forces arising from this light-material 
interaction are the result of the exchange of 
momentum between the incident photons and 
irradiated object (Jonas and Zemanek 2008). 
A focused light beam with power of P will gen- 
erate (P/hv) photons per second. The force 
acting on the structure is equal to the change-in- 
momentum per unit time, (dproa/dt). If a light 
beam strikes a surface that absorbs 100% of the 
photons per second then corresponding force on 
the structure is 


AP Total P ho P 
Fy sorber = Fae i 17.3a 
atid ( dt (Zz I c ) c ( 


Alternatively, the principle of conservation of 
momentum (Ashkin 2000) states that if the same 
stream of photons strikes a highly polished mirror 
surface then all the photons are reflected straight 
back. As a consequence, the total change-in- 
momentum per second doubles the force acting on 
the structure, that is 


Foyiror = a a (17.3b) 
dt ho J. c c 


A single-focused laser beam will, however, only 
produce a very weak force. For example, a single 
ImW diode laser pointer pen will generate a 
force in the range of a pN. Therefore, to suspend 
a small solid object, such as a coin, in the air it 
would be necessary to use 10° similar pen lasers 
all pointed at the same spot on the object’s surface 


(Tang 2009). Increasing the power of the light 
source 10° times will result in an equivalent kW 
laser with sufficient energy to ablate or vaporize 
the coin surface (depending on beam spot size). 
Although such tiny forces are not able to suspend 
or move large objects, this perceived weakness can 
be turned into an engineering strength if the goal 
is to manipulate solid objects that weigh only few 
pictograms or move microliters of liquid. At this 
scale, optics can be used to shape and redirect 
light beams to strike the object surface at precise 
locations (Knopf 2012). 

The optical gradient forces necessary to hold and 
induce micro-object movement were first explored 
more than four decades ago. In the early 1970s, 
Ashkin (1970) developed a single-beam gradient 
force optical trap based on the principle that a 
laser beam brought into sharp focus will generate 
a restoring force that can pull particles into that 
focus. The generated force is the result of the elas- 
tic scattering of laser photons by the particle such 
that the object alters the direction of the photon 
momentum without absorbing any of the beam 
energy. The basic operating principle of an opti- 
cal trap is that light carries momentum in the 
direction of propagation that is proportional to 
its energy. Any change in the direction of the light 
rays arising from either reflection or refraction will 
produce a change in the momentum. If the object 
in the trap bends the light rays and changes the 
momentum of the light, then the object under- 
goes an equal and opposite change in momentum. 
The momentum transfer gives rise to a force act- 
ing on the object. In a typical setup, Figure 17.1, 
the incoming laser light has a Gaussian intensity 
profile that is brighter in the center of the beam 
than at the edges. When the light interacts with 
the object, the rays are bent according to the laws 
of reflection and refraction. The sum of the forces 
from all light rays in the beam can be divided into 
scattering (f,) and gradient (f,) components. The 
scattering force (f,) points in the direction of the 
incident light, whereas the gradient force (f,) arises 
from the gradient intensity profile pointing toward 
the center of the beam. The gradient force is the 
restoring force that pulls the object into the beam 
center. If the contribution to the scattering force of 
the refracted rays is larger than that of the reflected 
rays, then a restoring force is also created along the 
incident light direction and a stable optical trap is 
formed. 
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Figure 17.1 The light-particle interactions that occur in a focused Gaussian laser beam during 
transverse and axial optical trapping. (a) The light refracted through a transparent object with a high 
refractive index produces an optical force that is balanced when the object reaches the beam cen- 
ter. (b) The electromagnetic field of the light causes the object to act as an induced dipole, which is 
drawn into the beam focus where its energy is minimized. (Reprinted from Nanotoday, 1, Dholakia, 
K. and Reece, P., Optical micromanipulation takes hold, 18-27, Copyright 2006, with permission from 


Elsevier.) 


In his seminal work, Ashkin was able to experi- 
mentally observe the forced acceleration of freely 
suspended particles by radiation pressure from a 
continuous wave (CW) visible laser beam. Based on 
this observation, he was also able to lift and levi- 
tate a very small glass sphere (Ashkin 1970). A later 
experiment with his colleague Dziedzic (Ashkin 
and Dziedzic 1971, 1980) showed that it was pos- 
sible to trap a glass sphere in a vertical laser. The 
concept of optical trapping and tweezing (Dholakia 
and Reece 2006; Dienerowitz et al. 2008, Kroner 
et al 2007; Nieminen et al. 2006) has more recently 
been applied to a variety of scientific areas includ- 
ing colloidal dynamics, statistical mechanics, cell 


biology, and nanomanipulation. In the late 1990s, 
Higurashi et al. (1997) demonstrated how carefully 
shaped fluorinated polyimide micro-objects with 
a 6-7.5 tm cross-sectional radius can be rotated 
using Ashkin’s method of optical trapping and 
tweezing. In this series of experiments, applying 
radiation forces near the focal point were used to 
reposition and rotate the micro-object about the 
laser beam axis. 

Over the past decade, Shoji Maruo and his col- 
leagues at the Yokohama National University 
(Maruo and Inoue 2007; Maruo 2008, 2012) devel- 
oped a large variety of light-driven micromachines 
based on the concept of an optical trap. Polymeric 
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microstructures such as microgears (Figure 17.2) 
were fabricated using two-photon microstereo- 
lithography techniques and driven by a circular 
scanning laser beam. During operation, the gear 
is optically trapped when the laser beam is focused 
on the center of the tooth. However, if the focus is 
moved slightly to one side of the gear tooth cen- 
ter then the net radiation pressure will produce an 
external force on the tooth moving it toward the 
focus. In this manner, a circular scanning laser 
beam was used to rotate the microgear and vari- 
ous motion patterns were generated by modifying 
the beam trajectory. In addition to driving gears, 
it is also possible to use optical tweezing to con- 
trol multiple micromanipulators. For example, the 
polymeric three-hand micromanipulator shown in 
Figure 17.3 was developed by the research team to 
grasp and manipulate tiny objects. The manipulator 
arms were driven sequentially using a single laser 
beam and a time-divided laser scanning method 
(Maruo 2008). By optimizing the repetition rate 
(~100Hz) between trajectories, it was possible to 
handle the 5 ym diameter glass microbead. 


The 3D microfabrication technology was also 
used by the Yokohama researchers to create an 
optically driven microfluidic pump. The micro- 
pump was comprised of two interlocked spinning 
9 um diameter rotors that could regulate the liquid 
flow in a microfluidic channel (Maruo 2008, 2012). 
Each individual rotor had two lobes and was held 
within the microchannel by a stationary shaft. A 
tightly focused laser beam was then used to cre- 
ate forces necessary to drive the micropump. Once 
more, the pumping mechanism was controlled 
by changing the trajectory of the scanning beam. 
The velocity of tracer particles added to the fluid 
was observed and determined to be 0.2-0.7 pm/s, 
which was directly proportional to the rotation 
speed of the rotors. An optically driven micro- 
pump that used only a single-disk microrotor was 
later introduced by Maruo and his colleagues. In 
this particular design, the single 10 pm diameter 
disk rotor had three columns as targets for the 
optical trap. In addition, the shaft had been elimi- 
nated by confining the disk to a U-shaped micro- 
channel. The laser beam was focused on the center 
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Figure 17.2 A circular scanning laser beam is used to optically drive a polymeric microgear as shown 
in (a). The gear is trapped (b) when the beam is focused on the center of the tooth causing the radia- 
tion pressure exerted on the microstructure to be balanced. If the focus is moved slightly to the 

side of the gear tooth (c), then the net radiation pressure is directed to the focus creating an exter- 
nal force. (With kind permission from Springer Science+Business Media: Nano-and Micromaterials, 
Optically driven micromachines for biochip application, 2008, 291-309, Maruo, S., Figure 12.9.) 
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Figure 17.3 Optical microscopic images of three-hand micromanipulators grasping a 541m diameter 
microglass bead. By optimizing the repetition rate (~100 Hz) between trajectories, it is possible to handle 
the microbead. (With kind permission from Springer Science+Business Media: Nano-and Micromaterials, 
Optically driven micromachines for biochip application, 2008, 291-309, Maruo, S., Figure 12.16.) 


column in an effort to hold the rotating disk with- 
out the need for a mechanical shaft. Experiments 
by the Yokohama researchers showed that the flow 
velocity is proportional to the rotational speed of 
the microrotor. 

Expanding on these concepts, researchers at 
MIT (Tang 2009) were able to perform nanoma- 
nipulation on a microchip using gradient optical 
forces. Specifically, the researchers showed it was 
possible to generate a gradient force in the pN range 
that was sufficient to activate a nanoscale oscillator. 
The investigated device was constructed from two 
parallel optical waveguides and required a light 
source at a known frequency (Li et al. 2008, 2009; 
Tang 2009). Hong X. Tang (2009) and his team at 
Yale continued this work and produced oscilla- 
tions using only one single-mode waveguide. The 
net force was generated by creating an asymmetri- 
cal optical field around the free-standing single- 
mode waveguide. A measureable transverse force 
exists on the waveguide because the guided light 
is evanescently coupled to the dielectric substrate. 
In this configuration, both the effective refractive 
index and the force on the waveguide depend on 
the separation distance between the waveguide 
and substrate. The calculations by Li et al. (2008) 
show that as the separation is reduced from 500 to 
50 nm, the magnitude of the optical force increases 
from 0.1 to 8 pN pm! mW. Although these trans- 
verse gradient optical forces are very small, this 


basic principle can be used to throw switches in 
silicon optical circuits (Tang 2009) and to develop 
nanomechanical beam resonators embedded in a 
photonic circuit with an on-chip interferometer for 
displacement sensing (Li et al. 2008, 2009). 


17.3 PHOTON-DRIVEN 
NANOACTUATORS 


Nanoactuators are pN force-generating structures 
constructed from several independent molecules 
to several thousand interconnect molecules func- 
tioning as a single system under an external energy 
source. An example commonly found in biology is 
a molecular nanomotor comprised of DNA mol- 
ecules or a single protein. The DNA nanomotors are 
able to switch between intramolecular tetraplex and 
intermolecular duplex states by alternating through 
DNA hybridization and strand exchange reactions 
causing the motor to shrink and expand like an 
inchworm (Kang et al. 2009; Li and Tan 2001). From 
an engineering perspective, these molecular devices 
can be exploited for developing novel methods in 
drug delivery, biochip design, and even nanoscale 
manufacturing. 

Kang et al. (2009) describes a light-driven single- 
molecule DNA _hairpin-structured nanomotor 
that utilizes a photosensitive azobenzene molecule 
to induce movement (Figure 17.4). Azobenzene 
undergoes a reversible cis—trans isomerism when 
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Figure 17.4 An illustration of the photoswitchable single-DNA nanomotor. The basic nanomotor com- 
ponents are a hairpin backbone, photosensitive azobenzene molecules, and fluorophore/quencher 
pair for signaling motor movement. The average size of hairpin structure is L,, and the average size of 
extended molecules based on persistence length of a single DNA strand is L,. The single-DNA nano- 
motor exhibits a CLOSE state when exposed to visible irradiation and an OPEN state with irradiated 
by UV light. (Reprinted with permission from Kang, H. et al., Nano Letters, 9(7): 2690-2696. Copyright 


2009, American Chemical Society.) 


exposed to alternating UV and visible radiation. 
Note that the term cis means “on the same side” 
and trans means “on the other side.” When azo- 
benzene is linked to other molecules then the 
switching mechanism can cause a relatively large 
dimensional change in the functionalized material. 
By controlling the azobenzene moieties integrated 
on the DNA bases in the hairpin’s duplex stem seg- 
ment, it is possible to modify the dehybridization 
(open) and hybridization (closed) state of a hairpin 
structure and control movement. Since the open- 
close cycle of the hairpin molecule exhibits revers- 
ible extension and contraction behavior, it can be 
identified as a nanomotor. The photoregulation of 
this simple system is concentration-independent 
and, therefore, suitable for fabricating high-density 
molecular motors. 

The notion of a molecular photoswitch has 
also been used to control the movement of living 
microorganisms. Neil Branda and colleagues at 
Simon Fraser University (Al-Atar et al. 2009; Boyer 
et al. 2010; Carling et al. 2009) have demonstrated 
that the light-induced reactions of photoresponsive 
dithienylethene can be used to trigger paralysis in 
the transparent nemotode worm, Caenorhabditis 
elegans. The C. elegans neotodes are fed a “ring- 
open” mixture of bipyridinium dithienylethene 
and a buffer containing 10% dimethlsulfoxide 


and exposed to a UV light (365 nm) for 2 min. The 
shorter UV wavelength radiation causes the dith- 
ienylethene photoswitch to undergo a ring-closing 
reaction that immobilizes the organism and turns 
it blue. These researchers further demonstrated 
that the paralyzed C. elegans worms could be 
reanimated and returned to their original color- 
less state by exposing the organism to visible light 
(> 490 nm) for 20min. This was possible because 
the visible light triggered the reverse reaction in 
the photoswitch by producing the ring-open iso- 
mer. Although the intent of Branda and his fellow 
researchers was not to create an optically driven 
nanoactuator, they were exploiting light to regulate 
the movement of a biological organism to perform 
a simple directed task. This study and its possible 
extension to in vivo drug delivery are of particu- 
lar interest to researchers who are developing new 
actuation technologies for biomedical applications 
(Al-Atar et al. 2009; Carling et al. 2009). 


17.4 PHOTOMECHANICAL 
RESPONSE OF MATERIALS 


Since the early 1990s, a large variety of materi- 
als have been investigated that exhibit optical-to- 
mechanical energy conversion properties. These 
materials include light-induced shape-changing 
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polymers (Jiang et al. 2006; Mitzutani et al. 
2008), shape-changing liquid crystal elastomers 
(LCEs; Finkelmann et al. 2001; Tabiryan et al. 
2005; van Oosten et al. 2007, 2009; Warner and 
Terentjev 1996, 2003; Yu et al. 2003), photostric- 
tive materials that take advantage of photovoltaic 
and inverse piezoelectric effects (Morikawa and 
Nakada 1997; Poosanass et al. 2000; Uchino 1990, 
2012), photomechanical actuators with charge- 
induced surfaces (Datskos et al. 1998; Lagowski 
et al. 1975; Suski et al. 1990), chalcogenide glasses 
(ChGs) influenced by mechanical polarization 
effects (Krecmer et al. 1997; Stuchlik et al. 2001), 
and photomechanical actuation of carbon nano- 
tubes (CNTs) (Kroerner et al. 2004; Liu et al. 2009; 
Piegari et al. 2002; Verissimo-Alves et al. 2001; 
Zhang and Iijima 1999). The functional behavior 
of several types of light-driven materials is sum- 
marized below. These categories are based on the 
underlying photomechanical response behavior 
and include shape-changing polymers, ferroelec- 
tric materials exhibiting photorestrictive effects, 
electrostatic structures mediated by photogen- 
erated electrons, and light-induced elastic CNTs. 


17.4.1 Shape-changing polymers 


A variety of polymeric gels and polymer films have 
been developed that exhibit relatively large dis- 
placements when exposed to light radiation. For 
example, hydrogels are fluid-filled polymeric net- 
works that can undergo dramatic volume changes 
when exposed to an environmental stimulus such 
as a change in surrounding temperature, pH, elec- 
tric field, specific ion concentration, and light. This 
unique ability to generate relatively large actua- 
tion dynamics and specificity of the triggering 
mechanisms has made smart hydrogels the prefer- 
able candidate for the core sensing and actuation 
transducers in a variety of microsystem technolo- 
gies (Al-Aribe et al. 2012; Knopf 2012; Knopf and 
Al-Aribe 2012). Poly(N-iso-propyl acrylamide) 
(PNIPAM) with triphenylmethane leuco deriva- 
tives is a UV light responsive hydrogel that swells 
when exposed to UV radiation and shrinks back 
to the original shape when the light is removed 
(Hirasa 1993). The swelling of the hydrogel is 
caused by an increase in osmotic pressure within 
the gels due to ionization reactions and ion-pair 
formation initiated by the UV irradiation. It is 
also possible to create hydrogels that respond to 


visible light by introducing trisodium salt of cop- 
per chlorophyllin into the PNIPAM gel. The swell- 
ing behavior of these thermosensitive gels are 
linked to the increase in temperature arising from 
the absorption of light energy. Some PNIPAM gels 
have also been observed to undergo very slow vol- 
ume changes under radiation exposure even when 
no photosensitive molecules are attached to the 
macromolecules (Hirasa 1993). 

An alternative approach is to create hydrogel 
microstructures that are responsive to specific 
wavelengths of light. To accomplish this Sershen 
et al. (2005) introduced an optomechanically 
responsive nanocomposite hydrogel that under- 
goes pronounced and reversible changes in shape 
when exposed to different wavelengths of light. 
The materials are composites of a thermally 
responsive polymer (PNIPAM with a 95:5 como- 
nomer ratio) and nanoparticles that have distinct 
optical absorption profiles. Duff and Baiker (1993) 
used gold colloids, while Oldenburg et al. (1998) 
used gold nanoshells, with an 110 nm diameter 
silica core and 10 nm thick gold shell, for similar 
purposes. The composites were formed by mix- 
ing the nanoparticles with the monomer solution 
thereby trapping the particles within the hydrogel 
matrix after polymerization. These nanocomposite 
materials responded to different wavelengths of 
light. Sershen et al. (2005) further demonstrated 
the independent control of two valves along the 
T-junction as shown in Figure 17.5. One valve is 
constructed from gold-colloid nanocomposite 
hydrogel and the other is gold-nanoshell nano- 
composite hydrogel. The channels are 100 jim wide. 
The entire device is illuminated with 1.6 W/cm2, 
532 nm (green) light source. The gold—colloid valve 
opened while the nanoshell valve remained closed. 
However, when the device is illuminated with 
2.7 Wicm?, 832 nm (near infrared) light then the 
opposite response was observed. In both cases, the 
valves opened within 5s. 

In order to respond directly to light, shape- 
transforming solid polymer films must often con- 
tain photosensitive fillers (Jiang et al. 2006) such as 
triphenylmethane leuco derivatives that undergo 
photoinduced ionic dissociation, photoreactive 
molecules (e.g., cinnamates), and photoisomeriz- 
able molecules (e.g., azobenzenes). For example, 
the triphenylmethane leuco derivatives will dis- 
sociated into ion pairs under UV illumination and 
recombine under dark conditions when the light is 
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Figure 17.5 Two light-activated nanocomposite hydrogel valves inserted at a T-junction in a micro- 
fluidic chip where one valve is made of an Au-nanoshell composite (left) and the other Au-colloid 
composite (right). The channels are 100 pm wide. (a) When the T-junction is illuminated with green 
light (532 nm, 1.6 W/cm?), the Au-colloid valve opened while the nanoshell valve remained closed. 
(b) However, when the device was illuminated with near-infrared light (832 nm, 2.7 W/cm?), the 
opposite response was observed. (Sershen, S. R. et al. Independent optical control of microfluidic 
valves formed from optomechanically responsive nanocomposite hydrogels. Advanced Materials, 
2005, 17, 1366-1368. Copyright Wiley-VCH Verlag GmbH & Co. KGaA. Reproduced with permission.) 


extinguished. If the triphenylmethane leuco deriva- 
tives are embedded in a nonrigid solid-state polymer 
or gel then the reversible variation of electrostatic 
repulsion between the photogenerated charges can 
induce expansion and contraction in the polymer. 
Cinnamate-type groups can also be used to create 
light responsive SMPs because these photochemi- 
cally reactive molecules form photoreversible cova- 
lent cross-links in the material. The most common 
photosensitive molecules are azobenzenes because 
these molecules switch reversibly from the cis or 
trans conformation states when exposed to light ofa 
predetermined wavelength (Finkelmann et al. 2001; 
Jiang et al. 2006; Warner and Terentjev 1996). If the 
azobenzene groups are linked to macromolecules 
then the light-driven switch can cause relatively 
large changes in the polymeric material. 

Some researchers have explored how nem- 
atic LCEs can be used to create photoresponsive 
SMPs. The term nematic refers to molecules that 
tend to align in loose parallel lines. For nematic 
LCEs containing azobenzene groups, the light- 
induced reaction from trans > cis isomerization 
of the azobenzene units produces the movement 
of the liquid crystal (LC) domains and subse- 
quently causes the collapse of the alignment order 
producing a significant contraction. Studies have 
also shown that the bending direction of certain 
LCE films can be controlled by applying linearly 
polarized light (Tabiryan et al. 2005; Yu et al. 
2003). In these cases, the incident light is largely 
absorbed by the surface layer of the polymer film 


because of the strong absorption by the azoben- 
zene moieties. 

Light intensity, wavelength, and the length 
of time that a LCE sample is irradiated will all 
impact the extent and speed of shape recovery. The 
response times of nematic elastomers is slow but 
can be improved by either decreasing the thick- 
ness of the polymer film or increasing the light 
intensity (Yu et al. 2003). LCEs obtain their unique 
shape-changing properties from the interrelation- 
ship between the elastic properties of the polymer 
networks and the ordering of the mesogenic liquid 
crystalline moieties. As a result, nematic LCEs are 
able to change their shape by up to 400% over a rela- 
tively narrow temperature interval straddling their 
nematic-isotropic (N-I) transition temperature. 
Finkelmann et al. (2001) found that at a tempera- 
ture of 60°C, the fractional contraction of nematic 
elastomers was as large as 22%. Yu et al. (2003) 
demonstrated that the large bending ofa single film 
of LC network containing an azobenzene chromo- 
phore can be produced by UV light. Tabiryan et al. 
(2005) reported a reversible bidirectional bending 
of the azo-LC polymer by switching the polariza- 
tion of the light beam between orthogonal direc- 
tions. For these samples, both the magnitude and 
sign of photoinduced deformation were controlled 
by the polarization state of the light beam. 

Yamada et al. (2008) performed a series of tests 
on a laminated structure composed of a LCE layer 
and a flexible plastic sheet in an effort to study the 
mechanical forces generated on the composite film 
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when exposed to UV light radiation. In these tests, 
both ends of the film were rigidly clamped and the 
film was then loaded with a force of 44 mN at 30°C. 
Although a single-layer LCE layer was experimen- 
tally shown to be brittle and crack at high intensi- 
ties due to low mechanical strength, the composite 
film was able to generate significant forces without 
breakage. The researchers also used the laminated 
LCE film as a plastic belt attached to a pulley sys- 
tem (Figure 17.6) to create a light-driven motor that 
directly converted light energy into rotation. The 
pulleys of the system rotate by exposing the top of 
the belt to UV light while irradiating the bottom 
side with visible light. The simultaneous contrac- 
tion and expansion forces at different regions along 
the long axis of the belt induce rotation in the pul- 
leys and enabled them to move continuously. 


Alignment 
direction 


Belt 


Van Oosten et al. (2008) investigated the revers- 
ible nonlinear response of photostimulated LC 
polymer by looking at the bending action of a pla- 
nar uniaxially aligned film. When exposed to laser 
light (351 nm) from one direction, the LC film would 
rapidly bend toward the light source (Figure 17.7). 
After a period of prolonged exposure, the film ini- 
tially relaxes and then curves in the opposite direc- 
tion over time. The speed of the transformation 
was experimentally found to increase as the LC 
film uncurls because of the unbending sample will 
be exposed to more direct light. The artificial cilia 
were similar to the natural cilia found in parame- 
cia in that it may produce a flapping, asymmetrical 
motion causing the surrounding liquid to flow. The 
asymmetric motion was the result of a backward 
stroke different from the forward stroke, which had 


Figure 17.6 A light-driven motor with a plastic belt constructed from LCE-laminated film. The basic 
motor design and relationship between the rotation direction and UV irradiation positions are illus- 
trated in (a). The diameter of the larger left pulley is 10mm and the smaller right is 3mm. The LCE 
plastic belt is 36 mmx5.5mm. The time series of images in (b) show the rotation profiles of the LCE- 
laminated film induced by 366 nm, 240 mW/cm? UV irradiation and >500 nm, 120 mW/cm? visible light. 
(Yamada, M. et al. Photomobile polymer materials: Towards light-driven plastic motors. Angewandte 
Chemie International Edition. 2008, 47, 4986-4988. Copyright Wiley-VCH Verlag GmbH & Co. KGaA. 


Reproduced with permission.) 
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Figure 17.7 Photostimulated bending of light-driven LC actuators driven by the isomerization of 
azobenzene is illustrated in (a). The top row of images show the response of the LC film oriented with 
Side A toward the 351 nm, 150 mW/cm? laser light. The bottom row is the same LC film actuator with 


Side B oriented toward the laser light. (b) The bending radius (1/R) of the sample as it is exposed 
to light for both Side A (C) and Side B (*). (Reprinted with permission from Van Oosten, C. L. et al. 
Macromolecules 41: 8592-8596. Copyright 2008, American Chemical Society.) 


been introduced into the LC azobenzene artificial 
cilia by temporally varying the intensity and wave- 
length of the light over the microactuator surface. 
The researchers envisioned that this simple bending 
principle could be used to create light-driven arti- 
ficial cilia that function as a microfluidic mixer or 
pump (van Oosten et al. 2009). 


17.4.2 Photostrictive ferroelectric 
materials 


When irradiated by light, various ferroelectric 
materials (Uchino 1990) will generate mechanical 
strain that can produce observable movements and 
measureable microforces. The photostrictive prop- 
erties of these materials are the result of combined 
photovoltaic and the converse piezoelectric effects 
(Poosanaas et al. 2000). The photovoltaic property 
causes the ferroelectric material to generate volt- 
ages in the kV/cm range when irradiated, while 


the converse piezoelectric effect causes the same 
material structure to simultaneously expand or 
contract. The most commonly used. piezoelectric 
materials are the lanthanum-doped lead zirconate 
titanate (PLZT) ceramics. The photostriction prop- 
erties of these PLZT ceramics are almost instanta- 
neous when exposed to light and, therefore, have 
been used to develop a number of rapid response 
optical microactuators, relays, and photon-driven 
micromachines (Uchino 1990, 2012). 

A bimorph-type optical actuator created by 
Morikawa and Nakada (1997) was able to pro- 
duce displacements of several hundred microns. 
The light-driven microcantilever beam was con- 
structed from a pair of adhesively bonded PLZT 
ceramic elements oriented in opposite polar- 
ized directions. When the upper PLZT element 
in the cantilever device was illuminated by a UV 
light source, the element stretched in the polar- 
ized direction, while the lower nonradiated PLZT 
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element did not expand. Since the two PLZT ele- 
ments were connected to a common electrode and 
the lower nonexpanding element was oriented in 
the opposite polarized direction of the top, the 
lower element would experience a negative volt- 
age and correspondingly contract due to the same 
piezoelectric phenomena. The combined effect of 
the two PLZT ceramic elements was a significant 
downward displacement of the microcantilever 
beam. By switching the illumination to the bot- 
tom element, it was possible to drive the cantilever 
beam upward. 

A similar photomechanical effect has also been 
used by Suski et al. (1990) to produce photostim- 
ulated vibrations in silicon (Si) microcantilever 
beams that were covered in a thin polycrystalline 
ZnO film. Earlier studies (Gatos and Lagowski 
1973; Lagowski and Gatos 1972; Lagowski et al. 
1975) had shown that thin crystals of polar (non- 
centrosymmetric) semiconductors can exhibit 
piezoelectric properties by light-induced elec- 
tronic transitions. The photomechanical response 
arises from the depopulation and population of 
surface states by sub-bandgap illumination while 
the overall number of bulk free carriers remains 
the same (Suski et al. 1990). The barrier height 
of the depleted layer can also change under these 
illumination conditions resulting in surface stress 
variations. When the light source is modulated, 
the barrier height will vary according to the fluc- 
tuations in photon intensity producing surface 
stresses with the same frequency. Consequently, 
it is possible to create resonant vibrations in the 
silicon (Si) microcantilever beam by optically 
altering the frequency of the surface barrier so 
that it closely matches the natural frequency of 
the beam (Lagowski et al. 1975; Suski et al. 1990). 
The observed photomechanical effect is consistent 
with the surface piezoelectric phenomenon where 
the external stress applied to polar semiconductors 
leads to a modification of the surface barrier height 
and causes pronounced changes in contact poten- 
tial difference (Lagowski et al. 1975). 

Suski et al. (1990) demonstrated the photome- 
chanical effect in the early 1990s by fabricating 
a 10 mmxX1.5 mmx50pm Si/SiO,/ZnO micro- 
cantilever beam and activating it using an argon 
laser (520 nm). The microcantilever resonated at 
a frequency of ~350Hz with a maximum deflec- 
tion of 160 nm under 130 pW of light power. The 
photostimulated effect was also observed in bulk 


semiconductor materials where the photon energy 
irradiating on the Si microcantilever was above 
the bandgap. As the photons become absorbed in 
the semiconductor material the free electrons are 
excited and move from the valance band into con- 
duction band (CB) leaving holes in the lattice. The 
movement of electrons creates local mechanical 
strain in the material. 

Another photomechanical effect has been 
observed in ChG when the material absorbs polar- 
ized light (Stuchlik et al. 2001, 2004). Reversible 
photoinduced anisotropy was the first reported by 
Krecmer et al. (1997) who showed that when a thin 
amorphous film of As.,Se,, deposited on a clamped 
atomic force microscope cantilever was exposed to 
polarized irradiation it would exhibit reversible 
nanocontraction movement. To demonstrate the 
effect, the research team performed measurements 
on a 200pm cantilever beam with a thickness of 
0.6 um. The surface of the beam was covered witha 
thin 250 nm As.,Se., film. When exposed to polar- 
ized light, the beam bent approximately +1 pm 
(Stuchlik et al. 2001). Upon irradiation with polar- 
ized light, a very small movement in the ChG was 
measured parallel to the direction of the electric 
field of the light and a very small expansion was 
also observed along the axis orthogonal to the elec- 
tric field. 

Poly(vinylidene difluoride) (PVDF) is another 
ferroelectric polymer with both pyroelectric and 
piezoelectric properties that can be optically 
driven (Mizutani et al. 2008). When the temper- 
ature of the PVDF material is increased through 
light irradiation, it causes a piezoelectric effect 
to occur and the material mechanically deforms. 
Mizutani et al. (2008) further demonstrated how 
a strip of PVDF could be used to create a small leg 
for a light-driven microrobot (Figure 17.8). In this 
case, one surface of the PVDF film was coated with 
a thin silver (Ag) electrode. When irradiated with 
a He-Ne laser, the PVDF cantilever generated an 
electric field by means of the pyroelectric effect. 
The thickness of the PVDF cantilever was 28 pm 
and the Ag electrode was 6 pm thick. The pyroelec- 
tric effect produced by the PVDF-generated con- 
duction electrons that were dispersed on the Ag 
electrode. The electric field in the cross section of 
the PVDF film induced the inverse piezeoelectric 
effect that caused the microcantilever leg to bend 
toward the light. In these experiments, a 10 mW 
irradiation was used to move the microcantilever 
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Figure 17.8 Photograph of the optically driven PVDF leg actuator (a) and the displacement of 

the leg for two different frequencies (1 Hz and 2Hz) (b). (With kind permission from Springer 
Science+Business Media: Optical Review, Optically driven actuators using poly (vinylidene difluoride), 
15, 2008, 162-165. Mizutani, Y. et al., Figures 6 and 7.) 


250 pm in 0.5 ms. The velocities were 33.3 [pm/s at 
1 Hz and 76.7 m/s at 2 Hz. 


17.4.3 Electrostatic pressure 
mediated by photogenerated 
electrons 


A “warped capacitor” microactuator that uses pho- 
togenerated electrons to change the electrostatic 
pressure on a thin silicon (Si) microcantilever beam 
was introduced by Tabib-Azar and his colleagues 
in 1990 (Tabib-Azar 1990; Tabib-Azar and Leane 
1990; Tabib-Azar et al. 1992; Tabib-Azar 1998). The 
parallel plate capacitor consists of the thin P* silicon 
microcantilever beam attached to a glass substrate 
insulator and suspended over a gold (Au) ground 
plate. Photogenerated electrons occur when the Au 
ground plate is exposed to a light source with suffi- 
cient energy. The photoelectrons travel through the 
air gap to the deformable microcantilever beam, 
reducing the charge on the capacitor, and causing 
the cantilever to bend. The capacitance (C) of the 
flat parallel plates is given by the expression 


C= keA — &oLb 


‘ 17.4 
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where k is relative permittivity of the dielectric 
material between the plates (k ~1 for air), €, is the 
free-space permittivity (8.854 107F/m), b and 
Lare the width and length of the cantilever beam, 
respectively, A is the surface area of the parallel 


plate capacitor, and d is the distance between 
the cantilever beam and the ground plane. From 
the definition of capacitance, the unit Farad (F) 
is equal to C/V. A bias current must be used in 
order to make the microbeam capacitor function 
effectively as a microactuator. A separate voltage 
source, V,, through a resistor is connected across 
the capacitor to form the bias current (Figure 
17.9). 

If the change in capacitance as the cantilever 
beam deforms is neglected, the steady-state deflec- 
tion at the end of the microbeam can be described 
in terms of the charge on the capacitor. However, 
the cantilever beam will bend in a stable manner 
up to the threshold of spontaneous collapse (Tabib- 
Azar 1998) given by 


6eV? L' 
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which is a function of the Young’s modulus (Y) and 
the thickness of the cantilever beam (b). The fast- 
est time that it can smoothly traverse this distance 
is approximately equal to the period of the funda- 
mental mode of free vibration. 

Tabib-Azar (1998) constructed a microactua- 
tor that uses a 600X501 pm? cantilever beam 
with a gap of 12 um to demonstrate this concept. 
A bias voltage of 6 V and optical power less than 
0.1mW/cm? was used to move the cantilever 
4\1m in approximately 0.1 ms. The light-controlled 
actuation is possible by charging the capacitor 
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Figure 17.9 The basic design of the electrostatic “warped capacitor” microactuator originally investi- 
gated by Tabib-Azar (1998). Photogenerated electrons occur when the gold ground plate is exposed 
to light with sufficient energy. The photoelectrons across the air gap and reduction in the charge on 

the parallel plate capacitor cause the microcantilever to bend. 


with a current from the battery circuit. A contin- 
uous photon flux, ®<i/n, where n is the quan- 
tum efficiency, short circuits the capacitor more 
slowly than the battery charges it, causing a charge 
buildup which closes the plates. A photon flux 
@ >i/n causes an opposing photocurrent greater 
than the charging current. The net charge then 
decreases, and the capacitor plates relax open. In 
this manner, the light source is not the primary 
source of energy to the actuating system rather it is 
sufficient to enable the action by pushing the cur- 
rent level above a threshold. 


17.4.4 Light-induced elastic CNTs 


CNTs are a relatively new class of one-dimensional 
carbon nanomaterials that have extraordinary 
mechanical, electrical, and thermal properties 
(lijima 1991; Sun and Li 2007). The Young’s modu- 
lus of a CNT is over 1 TPa and the tensile strength 
is approximately 200GPa. The thermal conduc- 
tivity can be as high as 3000 W/mK. Cylindrical 
single-walled carbon nanotubes (SWCNTs) have 
relatively small diameters (~0.4 nm) and based on 
their structure can be either metallic or semicon- 
ducting. Studies in the late 1990s suggest that the 
structure distortion caused by the van der Waals 


forces can modify the electronic structure of the 
CNT and influence both the mechanical and opti- 
cal properties of SWCNT bundles. 

Light-induced elastic responses from SWCNT 
bundles and fibrous networks were first reported 
by Zhang and Iijima (1999). These research- 
ers observed the elastic movement of bundles of 
~20-50 pm long SWCNT filaments when exposed 
to visible light. The electrostatic interaction of the 
SWCNT bundles was believed to be the cause of 
the elastic filament behavior. The authors con- 
cluded that the effect was the result of photovol- 
taic or light-induced thermoelectric effect related 
to the modification of electronic structure during 
the bundle formation. The orientation and align- 
ment of highly anisotropic CNTs has also been 
shown to be an important factor in determining 
the photomechanical properties. Liu et al. (2009) 
studied the effects of CNT alignment on the pho- 
toconductivity of thin films and showed that a 
high degree of nanotube orientation can improve 
the power conversion efficiency ~10%. In addition, 
films containing partially oriented CNTs pro- 
duced faster response times and attained a higher 
internal photon to electron power conversion 
efficiency than the film made up of nonoriented 
nanotubes. 
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Polymer CNT and nanocomposites have been 
receiving a great deal of attention in recent years 
because of enhanced mechanical properties and 
their unique electrical and thermal properties. 
One of the earliest material-based systems was a 
CNT-filled thermoplastic elastomer (Morthane) 
nanocomposite introduced by Kroerner et al. 
(2004). Morthane is a linear, hydroxyl termi- 
nated polyester polyurethane that exhibits a low 
glass transition temperature (T, = —45°C), near- 
ambient melting temperature of the soft-segment 
crystallites (T,, ,.=48°C), and exhibits huge strain- 
induced deformations (~700%). Significant defor- 
mations at room temperature (T, < Tyoom < Ty, 5) 
causes the flexible polymer segments to crystallize. 
The crystallization process forms physical cross- 
links, which prevent the polymer from undergoing 
strain recovery when the applied stress is removed. 
Subsequent heating and melting of the strain- 
induced soft-segment crystallites releases the con- 
strained polymer chains forcing the material to 
revert back to its stress-free shape. The researchers 
were able to deform the composite Morthane mate- 
rial by 300%, exerting ~19J to lift 60 g weight more 
than 3 cm with a force of approximately 588 N. 

More recently, Panchapakesan and his col- 
leagues (Lu and Panchapakesan 2006, 2007) 
exploited CNTs to create strain-induced crystal- 
lization for SMPs. Shape memory is the ability of 
the polymeric material to reversibly recover inelas- 
tic strain energy when exposed to specific envi- 
ronmental stimuli. The strain energy is captured 
in the material through a reversible morphology 
change induced by shape deformation or by the 
suppression of molecular relaxation. Molecular 
relaxation occurs in SMPs when the material is 
quenched through the glass transition or crystal- 
lization temperature for the SMP. For most cases, 
the material is able to recover the original shape 
when the temperature is raised above the critical 
thermal transition. 


17.5 PHOTOTHERMAL 
INTERACTION WITH 
MATERIALS 


A number of optical actuators have been designed 
that exploit the ability of light to generate heat and, 
thereby, directly influence the thermal properties 
of surrounding gases, liquids, and solids. A focused 
light source can be used to produce sufficient heat 


energy that can introduce thermo viscous effects 
(Hockaday and Waters 1990), transform a con- 
fined liquid into a pressurized gas (Hale et al. 1988, 
1990; Hockaday and Waters 1990; McKenzie and 
Clark 1992; McKenzie et al. 1995; Mizoguchi et al. 
1992; Tabib-Azar 1998), drive microflows (Shiu 
et al. 2007; Weinert and Braun 2008), initiate tem- 
perature induced phase transformation of solids 
(Okamura et al. 2009; Okamura 2012; Yoshizawa 
et al. 2001), or create photothermal vibrations in 
fiber optic waveguides (Inaba et al. 1995; Jankovic 
et al. 2004; Otani et al. 2001). 

Most engineering applications utilize lasers as 
the light source because the light rays from a CW 
or pulsed lasers travel in the same direction (unidi- 
rectional), are monochromatic (single wavelength), 
and are coherent (all rays are in-phase). When a 
stream of photons from a laser is projected onto a 
solid surface, the interaction may result in illumi- 
nation, light reflection, energy absorption, thermal 
and thermodynamic changes, melting, vapor- 
ization, or plasma effects (Steen 1998; Steen and 
Mazumder 2010). The precise nature of the inter- 
action depends on the optical power of the laser, 
duration of exposure, reflective and absorption 
properties of the target surface, thermal properties 
of the exposed material, and local environmen- 
tal conditions. The absorption properties of the 
target surface determine how efficiently the pho- 
ton energy is transferred into the material. This is 
dependent on the material’s absorption coefficient 
(relates to the amount of optical energy transferred 
per unit depth), reflectivity (if 100% reflective then 
no light is absorbed), and material surface finish 
(smooth or rough). 

The laser light striking a nonmirrored surface 
will produce a measureable localized thermal 
change in the material. Irradiance (E) is a measure 
of the incident laser power per unit area (W/cm? or 
W/m?) and can be given as 


E=_, 
A 


(17.6) 
where P is the power of the laser source in watts 
(W) and A is the area of the beam spot in (m7?). 
The amount of energy delivered to an object sur- 
face is, therefore, a function of the beam diameter 
where a smaller beam diameter produces a higher 
irradiance and, thereby, greater amount of energy 
striking the material. For example, a 2kW laser 
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focused to a 0.2mm diameter beam produces 
an irradiance of 6.310!°W/m2? (Steen 1998). In 
other words, a kW laser with a small beam size 
will vaporize an element in a fraction of a mil- 
lisecond and, therefore, a concentrated (focused) 
light beam can be used as a powerful “heat source” 
in the microworld. 

Applying laser light to the material heating 
process also requires a basic understanding of the 
thermal properties of the target material (Steen 
1998; Steen and Mazumder 2010). Thermally con- 
ductive, insulating, and semiconducting materials 
all behave differently when exposed to a focused, 
highly concentrated light beam. Since no energy 
bandgap exists between the valence and conduc- 
tion bands of a thermally conductive material, a 
large amount of the electrons in the CB can eas- 
ily absorb the photon energy. These CB electrons 
transfer their energy to the material through 
electron-lattice collisions. With very low energy 
from a laser, the photons are easily absorbed by 
conductors and the acquired energy is turned to 
heat. In contrast, the large bandgaps cause the 
insulator materials to have essentially no CB elec- 
trons and, therefore, exhibit no thermal conduc- 
tion. A large energy bandgap in the material will 
require a significant amount of energy from the 
laser for photon absorption. The thermal behav- 
ior of semiconductors is between conductors (e.g., 
metals) and insulators. A very small bandgap 
exists between the conduction and valence bands 
for semiconductor materials and the energy can 
be transferred fairly easily between the CB and the 
lattice structure. However, the amount of energy 
necessary from the laser for photons to be absorbed 
by the material is greater than conductive metals. 

Thermal properties also determine how the 
heat energy delivered by the laser will flow into the 
target material. Important engineering properties 
include thermal conductivity and thermal diffusiv- 
ity. Thermal conductivity describes how fast the 
heat flows through the material whereas thermal 
diffusivity reflects how fast the material will con- 
duct the thermal energy. The heat flow through 
a material depends on the thermal conductiv- 
ity (k) and on the specific heat (c,) of a material. 
Thermal diffusivity (a) tells us how fast materi- 
als will accept/conduct thermal energy and can 
be used to approximate the depth that heat will 
travel per pulse (with time ¢) through the material. 
Materials with a low value for thermal diffusivity, 


such as stainless steel and some nickel alloys, will 
limit the penetration depth into the material. Note 
that the heat flow is dependent on the specific heat 
of a material as it is used to determine the rate of 
change of temperature. 

Laser intensity and pulse duration also influ- 
ence heat penetration in a material. For some 
metals like stainless steel that have low thermal 
diffusivity, a lower-powered laser with a long pulse 
is used. In contrast, for metals such as copper that 
have a high thermal diffusivity, a higher power laser 
with shorter pulses to overcome the losses can be 
used. In addition to conductivity and diffusivity, 
the thermodynamic properties relate to the amount 
of energy required to heat, melt, or vaporize the 
material. This depends upon the target material’s 
density, heat capacity, melting and vaporization 
temperatures, and the latent heats of fusion and 
vaporization (Steen 1998). Heat capacity is a mea- 
sureable physical quantity related to heat, mass, 
and change in temperature. A basic understand- 
ing and appreciation of light material interaction 
is necessary for designing viable optically driven 
microactuators based on the photothermal effects. 


17.5.1 Light-driven diaphragms 


Many indirect optical microactuators take advan- 
tage of the heat generated by the light source 
to create the desired force or pressure to move 
the actuator structure (Hale et al. 1988, 1990; 
Hockaday and Waters 1990; McKenzie and Clark 
1992; McKenzie et al. 1995). When a simple gas is 
heated, it expands according to the ideal gas law 
pG,=nRT, (17.7) 
where p is the gas pressure, G, is the gas volume, 
n is the number of moles, R is the gas constant 
(0.0821 L atm/mol K), and T is the absolute tem- 
perature. Equation 17.7 states that a fluid in a con- 
fined space will expand and increase the pressure 
exerted on the walls of the enclosed structure as 
the temperature increases. The pressure in the 
enclosure may be sufficient to mechanically deform 
one of the walls if it is constructed from a flex- 
ible membrane. As the membrane, or diaphragm, 
expands under pressure, it produces the desired 
deflection, 5, and performs the desired mechanical 
work. The displacement produced by a diaphragm 
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Figure 17.10 An illustration of a simple light-driven micropump composed of five microcells 
(Mizoguchi 1992). Each microcell has a carbon wool absorber sealed in a chamber filled with liquid 
Freon 113 (boiling temperature ~47.5°C). The light delivered through the optic fiber is used to heat 
the absorber and, thereby, raise the temperature of Freon 113. The phase change from liquid to an 
expanding gas forces the diaphragm to deflect upward and push the working fluid that resides in the 
gap between the flexible membrane and rigid glass plate. 


microactuator at the center from its equilibrium 
position, 5, can be described as (Tabib-Azar 1998), 


p 


= 8b ob SELCOO| if js (17.8) 


where L is the length, o, is the residual stress, 
(Y/1—Y) is the biaxial modulus, and b is the thick- 
ness of the diaphragm. The dimensionless param- 
eters a,, a,, and f(y) depend on the geometry of 
the diaphragm. Tabib-Azar (1998) describes a 
square diaphragm given as a,=3.04, a,=1.37, and 
f(y) =1.075—0.292y. These microfabricated flow 
controllers have speeds of 21 ms in air flow and 67 
ms in oil flow, with sensitivities of 304 Pa/mW and 
75 Pa/mW, respectively. 

Mizoguchi et al. (1992) used this same simple 
concept to create a micropump that included an 
array of five closed diaphragm-actuated devices 
called microcells as shown in Figure 17.10. Each 
microcell consisted of a predeflected 800 x 800 pm 
square membrane that was micromachined in 
0.25 mm? of silicon and filled with Freon 113, a liq- 
uid with a boiling point of approximately 47.5°C. 
A carbon wool absorber was placed inside the cell 
to convert the incident light from the optic fiber 
into heat. The microcell exhibited a relatively large 
deflection, approximately 35 tm, when the cell’s 
content was heated and the Freon 113 undergone a 
phase change from liquid to a gas. The fluid being 


transported by the pump is fed into a flow channel 
between the glass plate and deflecting membrane 
using very small harmonic movements. The har- 
monic order of the membrane’s deflection deter- 
mines the fluid flow rate and direction. The small 
quantities of Freon in each cell allowed relatively 
low optical powers to be used to change liquid to 
gas, giving the large membrane deflections needed 
to operate the pump. The microcell was fabricated 
and operated by a laser source with no more than 
10mW. The micropump achieved a head pres- 
sure of approximately 30 mmag and flow rate of 
30 nL/cycle. 


17.5.2 Light-driven microflows 


The miniaturization and integration of various 
analytical laboratory processes on a single plat- 
form is a key design requirement for microfluidic 
and LOC devices used for medicine and envi- 
ronmental monitoring (Dittrich and Manz 2006; 
Eijkel and van den Berg 2005). The reduction in 
physical size of these functional components has 
increased the speed of analysis, lowered operating 
costs due to the consumption of small quantities 
of reagents, and enabled novel system designs that 
avoid biological sample cross-contamination. 

One of the most critical components on the 
microsystem platform is the microfluidic mixer. 
Efficient mixing of chemicals and biological sub- 
stances in order to create the desired reactions is 
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an essential step in preparing samples for analy- 
sis. The typical mechanism for mixing fluids in 
the larger macroworld is to create turbulent flows 
at high Reynold’s numbers. In the microdomain, 
however, mixing liquids becomes more difficult 
because laminar flow dominates. Asa consequence, 
the much slower process of molecular diffusion 
becomes the primary mixing mechanism (Hessel 
et al. 2005; Nguyen and Wu 2005). To improve the 
mixing rate over shorter length microchannels, 
researchers have developed a variety of micro- 
mixer designs (Nguyen and Wereley 2002). Passive 
mixers do not require external power sources and, 
typically, use specially designed microfluidic chan- 
nel microstructures to improve the overall mixing 
rate. T-and Y-micromixers are the simplest passive 
micromixer designs for combining two adjacent 
streams of liquid (Kamholz et al 1999; Kamholz 
and Yagar 2002). Unfortunately, a relatively long 
microchannel is required because the mixing 
mechanism is completely depended on the molec- 
ular diffusion rate. In contrast, active micromixers 
use an external energy source to create distur- 
bances in liquid streams to improve mixing rates. 

Weinert and Braun (2008) developed a light 
pump approach for active mixing that moves liquid 
through LOC devices. The authors demonstrate how 
small volumes of fluid could be optically pumped 
along a predefined path of a moving warm spot that 
was created using an infrared laser. The repetitive 
motion of the laser beam is used to remotely drive 2D 
microflows of liquid with a resolution of 2 1m. The 
experiments produced pumping speeds of 150 jrm/s 
with a maximum 10°C temperature increase at the 
localized light spot. The study also confirmed that 
the fluid motion was the result of the dynamic ther- 
mal expansion in the gradient of liquid viscosity. The 
viscosity of the liquid at the light spot is reduced by 
an increase in local temperature, resulting in a bro- 
ken symmetry between the thermal expansion and 
thermal contraction in the front and wake of the 
spot. Consequently, the fluid is observed to move in 
an opposite direction to the movement of the heated 
spot because of this asymmetric thermal expansion 
at the spot front with respect to the corresponding 
asymmetric thermal contraction in the wake. 

In comparison, pressure-driven microfluid- 
ics will operate at significantly faster flow speeds. 
However, to achieve these flow speeds, it is neces- 
sary to incorporate millimeter-sized interfacial 
connections to the outside of the pumps. The light 


pump approach enables highly localized fluid 
actuation. Since the pump pattern is program- 
mable in real time, it can be adapted to a variety 
of geometric patterns. Interestingly, the light pump 
based on thermoviscous expansion can be used to 
move fluids through an unstructured environment 
(Weinert and Braun 2009; Weinert et al. 2009). In 
this work, an infrared laser was used to melt liquid 
channels into a sheet of ice. Since the entire chan- 
nel was not melted at once, the liquid would rapidly 
refreeze behind the laser spot. The repetition of the 
laser spot motion with high frequencies enabled the 
water to undergo a series of melting and freezing 
cycles, thereby increasing the velocity of the pump- 
ing action. As the warm spot formed by the laser 
was moved through a thin ice sheet, the ice was 
observed to thaw at the front of the spot while simul- 
taneously freezing in the wake. The solid ice bound- 
aries confine the liquid flow such that the movement 
is from the back of the molten spot to the front. 
The pumping action was performed with a repeti- 
tion rate of f=650Hz and a chamber temperature 
of T,=—10°C (Weinert et al. 2009). With densities 
Pwater= 1000kg/m? and @,..=917 kg/m, the pump 
velocity is Vjump=9.5mm/s. Experimentally, the 
authors were able to measure the pump velocity at 
11 mm/s. The length of the molten spot depends on 
the temperature of the ice sheet. At low ice tempera- 
tures, only a short molten spot is formed. At higher 
ice temperatures, the molten spot can reach lengths 
beyond 500 tm with the pump velocity exceeding 
50 mm/s. 

From a design perspective, the light pump 
approach implemented on “ice sheets” as intro- 
duced by Weinert and Braun (2009) does not 
require separate valves to switch between pump 
paths, thereby significantly reducing the hard- 
ware necessary for the LOC devices. Further, the 
approach to fluid transport does not require pre- 
fabricated channel walls to define the fluid motion 
path or permit external pressure control to drive 
flow. In other words, it is possible to locally drive 
fluids without structural changes to the substrate, 
eliminating some of the steps in microfabrication. 


17.5.3 Temperature-induced shape- 
changing metal alloys 


The photothermal effect arising from light striking 
a solid material can also be used generate measure- 
able forces (Okamura et al. 2009; Okamura 2012; 
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Yoshizawa et al. 2001). Shape memory alloys 
(SMAs) such as 50/50 nickel-titanium (Ni-Ti) 
and 50/50 gold-cadmium (Au-Cd) are a group 
of metal alloys that experience a discontinuous 
change in their physical structure near their crys- 
talline phase transformation temperature. The 
dimensional change arising from this phenom- 
enon is significantly greater than the linear vol- 
ume change that occurs under the normal thermal 
expansion. Further, the phase transformation tem- 
perature can be often modified by varying the alloy 
composition. 

The SMA Ni-Ti is commonly used to transform 
thermal energy into mechanical work (Gilbertson 
1993). The mechanism for microactuation is the 
forward and reverse martensite-to-austenite 
phase transformations that occur as Ni-Ti mate- 
rial is heated and cooled. The transformation pro- 
duces a hysteresis effect where the temperature 
at which the material undergoes a phase change 
during heating is different from the temperature 
that causes the same material to return to the 
martensite state during cooling. The alloy can 
be formed into a wire or strip at high tempera- 
tures when it resides in an austenite condition. 
Increasing the temperature applied to a preloaded 
Ni-Ti wire, originally at ambient temperature, 
will cause the material to undergo a phase trans- 
formation and move the position of the attached 
load a distance of approximately 4% of the overall 
wire length. In other words, the small force cre- 
ated during the contraction period can be used to 
perform mechanical work. The reduction in the 
wire length can be recovered by cooling the mate- 
rial back to ambient temperature. The number of 
times the Ni-Ti material can exhibit the shape 
memory effect is dependent upon the amount 
of strain, and consequently, the total distance 
through which the wire is displaced. The amount 
of wire deflection is also a function of the initial 
force applied. Thicker wires will generate greater 
forces but require more heat and longer cooling 
time. For example, a 200 pm Ni-Ti wire produces 
4x the force (~ 5.8 N) than a 100 pm wire but takes 
5.5x as long (~2.2s) to cool down once heating has 
ceased (Gilbertson 1993). 

Although SMA materials exhibit unique and 
useful design characteristics such as large power/ 
weight ratio, small size, cleanness, and silent actu- 
ation, the successful application of the material 
has been limited to a small number of actuation 


devices that produce small linear displacements 
(Okamura et al. 2009; Okamura 2012; Yoshizawa 
et al. 2001). Okamura et al. (2009) investigated 
a number of different pretensioned light-driven 
SMA actuators for grasping and manipulating 
macroscopic-sized objects. One effective design 
was a simple torsion spring with the arm extensions 
formed into a tweezer as shown in Figure 17.11. 
The 50 pm Ni-Ti wire wrapped around the arms 
provided the compression to close the end tip and 
the spring provided the necessary bias force to 
return the SMA back to its original length after 
cooling. In this work, a 1.0 W Argon-ion laser 
was used to heat the actuating wire. However, the 
design challenge is to find an efficient method to 
optically heat a very thin wire. 


17.5.4 Photothermal vibration of 
optical fibers 


Inaba et al. (1995) investigated how the photother- 
mal vibration of the quartz core of an optical fiber 
by laser light can be used to construct a vibration- 
type transducer. The microcantilever beam in this 
design was the quartz core of the fiber and fabri- 
cated by etching the clad layer from the optical 
fiber tip. The resonance frequency is dependent 
upon the physical qualities of the cantilever such 
as size, material density, and Young’s modulus. The 
effect is also partially dependent upon the density 
of the gas or liquid, which surrounds the cantile- 
ver because the resonance shapeness of the beam 
is a function of the viscosity coefficient for the gas 
or liquid. The resonance frequency for the micro- 
cantilever was observed to decrease from 16.69 to 
16.59kHz with an increase in pressure from 1 to 
100 Pa, and a reduction in the resonance sharpness 
with an increase in pressure from 100 Pa to 10kPa 
(Inaba et al. 1995). 

Based on the concept of photothermal vibra- 
tion, Otani et al. (2001) proposed a dynamic optical 
actuator that was driven solely by light. The device 
is a walking miniature robot constructed from 
three optical fibers, which represent legs, attached 
to a base. Each fiber was cut for a bevel and the sur- 
face was painted black so that it could absorb light 
and convert it to heat. The photothermal effect 
occurred in response to a flashed incident beam 
onto one side of the optical fiber leg. The flashing 
light source with a constant cycle time produced a 
stretch vibration on the tip of the fiber that enabled 
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Figure 17.11 A schematic drawing of a light-driven tweezer originally proposed by Okamura et al. 
(2009) for grasping small objects (a). The SMA actuator is based on the crystalline structure transfor- 
mation of the Ni-Ti alloy as it is heated laser light and cooled (b). The torsion spring provides the bias 


force that extends the Ni-Ti wire when cooled. 


it to operate like a flat spring. The authors also 
experimentally demonstrated that the diameter 
of the fiber has a direct influence on the amount 
of deformation. It was observed that a 10mm long 
fiber with a diameter of 250 um would deform by 
30 pm, whereas a 1000 pm diameter fiber of the 
same length would deform by as much as 50 um. 
Furthermore, Otani et al. (2001) studied the effect 
of fiber length on the amount of displacement 
generated. A 1-mm-long fiber with a diameter of 
250 pm was found to deform 10 pm, whereas a 
15mm fiber of the same diameter deformed 90 pm. 

In the microrobot design, the optical fiber 
became bent due to the thermal expansion that 
occurred on the beveled side of the fiber when 
the light was turned on. If the light is turned off, 
it returns to its original shape. The switching fre- 
quency of the 12.1mW HeCd laser (442 nm) was 
4 Hz. The light was delivered to an individual opti- 
cal fiber by a mirror and object lens mounted on 
a moving stage that followed the movement of 
the photothermal actuator. The size of the optical 


actuated walking robot was 3mmx10mm and 
moved 2.3mm at 25 pm per second using a pulsat- 
ing light source. 

A micromanipulator based on the photother- 
mal bending effect experienced by a beveled opti- 
cal fiber was described in a paper by Jankovic et al. 
(2004). The micromanipulator design incorporated 
four fingers, two bendable fibers for actively grasp- 
ing small objects and two stationary fibers to pro- 
vide structural support while holding the object 
as shown in Figure 17.12. Each finger was a 1mm 
diameter acrylic optic fiber with a 25mm beveled 
edge near the tip. The beveled edge as coated with 
a thin layer of black paint where the thickness has 
a measurable impact on the amount of tip deflec- 
tion. A light beam, from a 150 W halogen illumina- 
tor, was directed into the fixed end of the sculpted 
optic fiber causing the tip at the free end to deflect 
by approximately 501m. Several experiments were 
conducted to demonstrate that this simple micro- 
gripper is able to grasp, hold, and release a variety 
of small metal screws and ball bearings. 
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Figure 17.12 A micromanipulator based on the photothermal bending effect of optical fibers as 
described by Jankovic et al. (2004). The device consists of two bendable fingers and two stationary 


fingers. 


17.6 SUMMARY 


The fundamental aspects and unique character- 
istics of light-driven actuators were discussed in 
this chapter. In general, nano- and microscale 
actuator technologies evolved rapidly in the past 
decade because of unprecedented advances in 
innovative materials, new fabrication processes, 
and a multidisciplinary approach to product and 
process design. These optically powered devices 
are typically small remotely activated transduc- 
ers that transform the spectral, intensity, or phase 
properties of light into very small structural dis- 
placements and forces. Radiation pressure and 
optical gradient forces have been used to manip- 
ulate minuscule objects, whereas the spectral 
properties of light have changed the mechanical 
behavior of various stimulus responsive poly- 
mers. Alternatively, optically driven photothermal 
effects have been used to heat liquids and gases in 
an effort to increase the fluid pressure acting on a 
flexible diaphragm. Many of these induced effects 
are not readily observable with the human eye and 


may not appear to be significant in the larger more 
familiar world, but within the micro- and nanodo- 
main of the very very small these light-driven 
mechanisms become viable solutions. 

As predicated by Richard Feynman in his 1959 
lecture to the American Physical Society at Caltech, 
“There’s Plenty of Room at the Bottom” (Feynman 
1960), the world of the very small will play a sig- 
nificant role in future technology innovations. The 
future, he believed, lies in “manipulating and con- 
trolling things on a small scale” (Feynman 1960). 
With respect to light, a tightly focused beam pro- 
vides a highly versatile precision tool for reaching 
out and manipulating tiny cells, or providing a 
feathered touch to rearrange molecules, or creat- 
ing the gentle forces necessary to drive a submicron 
motor. Specific wavelengths of electromagnetic 
radiation can also induce physical changes in the 
shape of preformed photoresponsive polymers and 
gels. The smaller the polymer structure, the faster 
and more pronounced the observed reaction to the 
incident light. These light-induced shape-changing 
materials can be exploited by creative engineers 
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to develop embedded mechanisms for regulating 
microfluidic flow or manipulating solid objects in 
Feynman’s world of the very small. An interesting 
illustration of exploiting the unique characteristics 
of photoresponsive materials is the light-driven 
single-molecule DNA hairpin-structured nanomo- 
tor introduced by Kang et al. (2009). The nanomotor 
incorporates photoisomerizable azobenzene mol- 
ecules that enable the DNA structure to undergo a 
reversible light-controlled switching operation. 
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18.1 INTRODUCTION 


To create great paintings, an artist not only needs 
a magnificent vision but must also know how to 
stretch canvas and mix paint. This chapter dis- 
cusses practical skills for the design and imple- 
mentation of optoelectronic systems. Because art 
has no agreed boundaries, technical depth may 
appear inconsistent, with items perhaps appear- 
ing trivial to one reader being useful insights for 
another and vice versa. A specific optoelectronic 
system is chosen as an example to provide conti- 
nuity while illustrating many techniques. 

Departing from the more formal style of oth- 
ers, this chapter stresses practicalities, examples 
from painfully acquired experience, and methods 
of reducing risk. Ideas both familiar and less com- 
mon are offered to permit the reader, at whatever 
level, to explore beyond the obvious, and to antici- 
pate problems that may otherwise compromise 
success. While earlier chapters give depth and 
essential numerical specificity, practical arts are 
predominantly hints, recipes, and general guide- 
lines. This chapter attempts to offer useful ideas to 
the beginner, while including sufficient subtleties 
to interest those with extensive experience. 

The need for specifications and definitions of 
success (Section 18.2), understanding require- 
ments (Section 18.3), and modeling (Section 18.4) 
apply generally. Practicalities of design, planning, 
and organizational discipline (Section 18.5) are 
important to the success of any complex optical 
and electronic endeavor. Rather than repeating 
quantitative details readily available elsewhere, 
emphasis is given to properties and things that 
go wrong at system design and component levels, 
with some ideas of how to avoid or recover from 
them (Section 18.6). Later sections introduce test- 
ing (Section 18.7), some effects of hostile environ- 
ments (Section 18.8), checklists (Section 18.9), 
aphorisms (Section 18.10), and a further reading 
list selected for its practicality. 


18.2 SPECIFICATIONS 


Optoelectronic programs can be put at risk by insuf- 
ficient definition of the problem, the environment, 
the constraints, and/or the available tools. A domi- 
nant precursor to success in the art of optoelec- 
tronics is therefore to get a comprehensive written 
specification. Introducing our example, we quote as 


Table 18.1 the abbreviated target specification for an 
optical air data system (OADS), intended to replace 
the pneumatic system currently in the control loop 
of intrinsically unstable high-performance aircraft, 
to allow operation closer to the limits of the flight 
envelope than otherwise possible. Outputs are air- 
craft speed and attitude with respect to the ambi- 
ent air, whose temperature and pressure were also 
required but provided by an independent optical 
system. 

The following sections exemplify methods used 
to meet these specifications and how success may 
be defined. Table 18.2 shows a hierarchy of success, 
enhanced by proper specifications, top-down con- 
ceptual design, and control of risk. Available com- 
ponents and circumstances may occasionally force 
bottom-up modifications. 


18.3 REQUIREMENTS 


The art of practical optoelectronics is about com- 
bining components, whose specifications we 
believe are plausible, into instruments and systems 
to solve problems for which we will be rewarded. At 
least four main factors are necessary: motivation, 
specifications, aptitude, and components. First, 
because optoelectronics is a challenging discipline, 
motivation must be sufficient, usually personal sat- 
isfaction or even financial gain. Second, we must 
demand explicit specifications and accept implicit 
boundary conditions, over both of which we have 
only limited control. Third, aptitude enhances the 
ability to create and configure an effective system. 
Frequently appearing as experience, it guides the 
designer and prevents unnecessary innovation. 
Fourth, the choice of components is based upon 
optimization of complex interrelationships of what 
is available, and what quoted claims are trust- 
worthy. Possible consequences of the omission 
of one or more of these four factors are shown in 
Figure 18.1. The practicality of any optoelectronic 
system also depends strongly on the control of risk, 
and the optimization of many complex interac- 
tions, not all of which may be fully understood or 
can be wholly controlled. 

Supplementary to these requirements, but at 
least as important for success, is time, money, anda 
champion. Even with ample resources, things take 
time, but neither time nor money will be enough 
unless someone has made the personal commit- 
ment to succeed. Occasional digressions into areas 
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Table 18.1 Optical air data system specifications 


Performance 

Speed (true air) 15-1500 ms-" 
Pitch (angle of attack) +25 to —10°arc 
Yaw (angle of sideslip) +22.5°arc 
Update rate 40 Hz 


Pressure (altitude) 


Temperature (amb.) —70°C to 50°C 


Environment 
Altitude 

Temperature 
Acceleration 
Ambient light 

Sensed particle radius 


-3 to+12g 


Particle concentration 


Physical 

Throw 1.2m 
Weight <50kg 
Lifetime 100 000h 
Final cost <$100k each 
Power consumption <1kW 


Other 


—500m to 16km 


Ground to 25km 
-70°C to 140°C 


Darkness to sun stare 
0.2m and larger 


+0.1ms7" 

+0.1°arc 

+ 0.1°arc 

64Hz preferred 

+8 m (higher preferred) 
Unspecified (best) 


37 km preferred 

Local ambient 

Peak 

Darkness to 1.5kW m-2 
<0.12m preferred 


<1 m~' to dense cloud 


More preferred 
Less preferred 
MTBF 

>1000 units 

28V + 50% (noisy) 


Performance must be acceptable with naturally occurring particles anywhere in the 


global atmosphere in all weathers. 


The system must be ‘eye-safe' and ‘stealthy’. 


Table 18.2 Hierarchy of success 


Ideal All specifications are met 
Adequate 

modification 
Incomplete 

recovered 
Malfunction 

recovered 
Failure 

went wrong 
Catastrophe 
Disaster 


possibly personnel 


Specifications are met and performance confirmed after acceptable 

Not all specifications are met, but useful information and diagnostic data are 
The system produces no useful scientific results, but diagnostic data are 

No science is recovered and the system diagnostics do not indicate what 


The system destroys itself; we have no idea why 
The system destroys itself, causing collateral damage to the facilities and 


less susceptible to scientific analysis are intended 
to help distinguish what is important from what is 
not, because much risk lies here. 

Earlier chapters of this book give precise 
numerical facts about optical, electronic, and 
mechanical components, subsystems, and behav- 
iors. Many are complex, interactive, and frequently 


counterintuitive, typically more so in optics than 
in electronics. Familiarity with these subtleties is 
necessary, as is studying what others have done, 
and why, and whether these led to success or inge- 
niously redefined failure. 

Almost all measurements and systems for which 
optoelectronics is conventionally used may be 
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High risk 


Motivation 


Delay 


Empty hope 


Specifications Futility 


Success 


Aptitude 


Inaction 


Components 


Figure 18.1 Venn diagram of success factors. 


described as instrumentally assisted observation 
of a phenomenon. Ideally, the phenomenon can be 
controlled, where it essentially becomes part of the 
system. It may however only be probed with neg- 
ligible perturbation, as with the OADS example, 
or merely observed passively as in astronomy. The 
challenge is to create a specified instrument or 
experiment to enhance quantitative knowledge of 
the phenomenon, confirm existing wisdom, and 
perhaps suggest new ideas. 


18.4 MODELING 


Making an optoelectronic system to the custom- 
er’s satisfaction demands that specifications and 
requirements are understood. An initial approach 
may be to construct a mental model of a possible 
system, from concept through design, assembly, 
testing, operation, and interpretation of the output. 
Table 18.3 lists eight major functions of the initial 
thought experiment. At each design level new tasks 
are spun off, although questions are asked and 
answered to permit continuation of the planning 
and design sequence. 


Table 18.3 Functions of a model 


OnN Aun BWhDHY = 


Eliminating or solving problems uncovered 
while addressing these items frequently redirects 
attention. For example, maintaining adequate 
temperature stability for OADS originally seemed 
impossible. Combining air circulation and heat 
pipes offered a possible if inelegant solution per- 
mitting design progress to continue, and although 
this might have been acceptable, the final choice to 
circulate a temperature stabilized liquid through 
a sealed labyrinth held temperatures with 0.5°C, 
maintaining optical alignment and _ electronic 
performance. Eliminating such obstacles as early 
as possible reduces redesigns, which get more dif- 
ficult and expensive the later their necessity is 
accepted. The thinking process implicit in Table 
18.3 depends strongly upon feedback to achieve 
performance satisfactory to the customer and con- 
sistent with available resources. 

Figure 18.2 maps a possible design sequence, 
where arrows represent potential information 
flow upon which changes may be contingent. 
The starting point is the customer’s specification. 
If this appears infeasible or even unreasonably 
demanding, for example by violating the laws of 


. Quantification of the physics and phenomenology 

. Verification of compatibility of specifications and constraints 

. Understanding quantitative interactions between parameters 

. Optimization of architecture, design configuration, and components 

. Prediction of ideal performance possible with the planned design 

. Prediction of performance deterioration when implementation is not ideal 
. Diagnosis of anomalies anticipated during construction, testing, and use 

. Reduction of risk and enhancement of confidence 
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Figure 18.2 Operation of design model. 


physics or similarly extreme constraints, early and 
amicable negotiations can yield great advantages 
in simplicity, cost, and timeliness—less com- 
monly in performance. Quantitative examination 
of the proposed physics and phenomenology © 
may confirm acceptability or demand changes. 
For OADS, the 3-D velocity was calculated from 
the measured transit times of naturally occurring 
particles crossing each of three inclined pairs 


No expected problems 


OK 


® Proceed with 
caution 


of parallel light sheets projected 1.2m from the 
optical unit looking sideways from the aircraft. 
Pressure, altitude, and temperature were obtained 
from simultaneous pulsed measurements of total 
elastic scattering and fluorescence of molecular 
oxygen over a common volume of air outside 
the boundary layer of the vehicle. Simple calcu- 
lations @ with tests for scientific plausibility and 
technical capability can confirm that nothing 
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silly is being planned ©. Plausible and self-con- 
sistent numerical criteria, which for OADS were 
difficult but not infeasible, may then allow a pre- 
liminary © design. The projection of 8mm light 
sheets with a near diffraction limited full width 
to e? intensity of 70m, separated by 11.2mm, 
looked reasonable, but required transit time mea- 
surements accurate to 10-4, challenging but not 
impossible—see below. External and internal 
interfaces with a coherent logical and structural 
architecture (Figure 18.3) iterate to a self-con- 
sistent set of available parts ©, again. Clear and 
unambiguous names for each module help to 
avoid confusion among participants. Defined 
hardware and software modules must be isomor- 
phic, with each other and also with the necessary 
functions and interfaces to allow testing and pos- 
sible remediation. Within this structure an ideal 
performance can be predicted ©. Independent 
knowledge, measurements, and/or testing allow 
consideration of effects of ageing, deterioration 
during use, or even failure of parts or subsystems 
to anticipate likely real performance ©. In critical 
applications where single point failure cannot be 
tolerated, initial choices for system architecture 
may require adaptation. Expanding the rigor of 
preliminary thoughts and testing at this level may 
show detrimental problems ©, requiring more 
serious remediation. For OADS, the pressure 
and temperature system (PATS) technology was 


ALTA window 


changed from the originally planned differential 
absorption to Schumann-Runge fluorescence of 
O, because of installation constraints. However, 
tedious and expensive preliminary modeling 
might appear, it is incomparably cheaper than 
later fixes, even should these prove possible ©. 
Design errors are better corrected on paper or the 
more modern electronic equivalent documents 
than in hardware components or computer code. 
Three outcomes from this exercise may be (1) no 
obvious problems, (2) problems with reasonable 
remedies, or (3) problems requiring negotiation 
with, or at least reporting to, the customer. Such 
a sequence of increasingly demanding analytical 
processes for OADS identified five major risks: 
(1) Would there be enough particles everywhere 
in the atmosphere? (2) Is the velocity measure- 
ment geometry acceptable? (3) Can the system 
be calibrated? (4) Is everything stable enough? 
(5) Would PATS work? In the event enough par- 
ticles were found everywhere flown, the velocity 
measurements were sufficiently accurate—more 
so than systems used to calibrate them, stability 
was just acceptable, and PATS performance was 
marginal. Ironically the major risk was not tech- 
nical, and the program, of which OADS was but 
a tiny fraction, was cancelled by the customer’s 
customer for wholly unconnected reasons. 
However, rather than insisting that everything 
be modeled it is often more sensible to evaluate 
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Figure 18.3 OADS system components. 
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where effort is most wisely spent in terms of the 
overall requirements of the customer. Run-away 
costs, or late delivery, can be just as serious as an 
inadequate instrument. 


18.5 DESIGN 


Design, like art, is a creative process involving 
flashes of insight. However, the first practical step in 
building a cathedral is pounding rocks into a ditch, 
and in this sense the preparation of foundations is 
an important, ifa less apparently rewarding, precur- 
sor of more detailed design work. Using appropriate 
tools for design planning amply repays the effort; 
examples follow. A more colloquial style is adopted 
here to lighten the overwhelming complexity that 
can be engendered by rigorously performing the 
following tasks for which a fairly logical sequence is 
established below, and followed later in Section 18.6. 

A top-down design methodology helps to assure 
that specifications can and will be met. Important 
aspects of the system, although initially existing only 
in imagination, can be solidified into pathways along 
which the physical construction can later proceed. 
Nine ideas introduced in Table 18.4 are expanded in 
the following paragraphs. General program manage- 
ment and design activities must often be augmented 
for specific systems by special software packages for 
optical design, signal analysis optimization, com- 
puter aided design (CAD), and finite element analy- 
sis (FEA) for mechanical and thermal optimization. 


18.5.1 Riding-the-ray 


Riding-the-ray tracks information-bearing pho- 
tons, later transformed into electrons, through 
successive components. Examining the effects 
of each component, including the phenomenon, 


Table 18.4 Tools and purposes 


. Riding-the-ray 

. Assembly methods 
. Calibration 

. Operation 

. Risk reduction 

. Signal estimation 

. Information retrieval 
. Interfaces 


SO ON A OO BRWHDY = 


. Management 


from generation of photons to the recovery of 
the numerical and graphical representations of 
the final measurement, can explore the proper- 
ties of many devices in the system. This approach 
is particularly useful for examining the effects of 
aberrations, polarization, spatial and temporal 
coherence, optical power budget, pointing stabil- 
ity, alignment criteria, and various other com- 
ponent-related properties. Where there is more 
than one optical path, each may be analyzed 
independently or in various combinations. This 
approach is the essence of ray tracing, estimating 
aberrations and ghosts. It may also show unan- 
ticipated effects. 


18.5.2 Assembly methods 


In addition to describing the structure, design 
must specify how the system may be constructed, 
how it will be aligned, and the alignment main- 
tained. For example, mechanical support may 
be (1) fixed by design and manufacture, (2) ini- 
tially adjustable over a specified range and then 
locked, or (3) dynamically compensated during 
operation. In mass production, where adjust- 
ments are neither necessary nor permitted, 
functionality may be compromised to accommo- 
date manufacturing tolerances. A more versatile 
design avoids such performance reduction at the 
expense of increased complexity. The extreme is 
to have everything adjustable, with ranges and 
sensitivities sufficient to accommodate almost 
any manufacturing imperfection. This can be a 
dangerous and unwise choice. A better solution 
optimizes reasonably tight manufacturing toler- 
ances with a few well-chosen adjustments, whose 
ranges are best limited to the smallest neces- 
sary. Each adjustment should be independent 


Analyze all optical components 

Provide adequately sensitive adjustments and criteria 
Build trust in the planned measurements 

Use and experience the virtual system 

Consider the unknowns, and the unknown unknowns 
Predict signal, noise, accuracy, precision, and errors 
Simulate signal processing and information recovery 
Estimate power, environment, and communication needs 
Track status, changes, costs, and consequences 
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Table 18.5 Adjustments and alignments 


Available adjustments 


Provide independent control for each variable 


Arrange suitable sensitivity of adjustment 


Assure that every adjustment may be rigidly locked 


Alignment criteria 


Agree a definition of acceptable alignment 


Specify a quantitative criterion for its achievement 


Completion 


Confirm that the alignment criterion is finally met 


Mark the final position and document all activities and observations 


and appropriately sensitive. After meeting a clear 
criterion of acceptability, each should be locked, 
and not be susceptible to residual creep. If such 
stability is impossible, perhaps because of chang- 
ing external conditions, then adjustments may 
be driven by a closed-loop feedback system con- 
trolled by a prespecified algorithm to an estab- 
lished alignment or performance criterion. The 
most likely candidate for real time control is tem- 
perature stabilization, but other adverse effects 
may be similarly mitigated. 

Table 18.5 summarizes guidelines for mount- 
ing and aligning the chosen components, with 
specific techniques given later (Section 18.6.3). For 
any particular system, some line items may appear 
either obvious or too detailed. It is important to 
know which, and to assign effort intentionally, 
rather than by crisis intervention. 

Every intentional, or even accidental, vari- 
able should be reviewed in the light of Table 18.5. 
Listing all variables allows the definition of accept- 
ability criteria for each alignment. Early attention 
to surprises that often emerge from this exercise 
greatly enhances assurance of success. 


18.5.3 Calibration 


Although calibration is often necessary to assure 
the desired accuracy of measurements, it may be 
simplified by careful manufacture, by ingenious 
choice of parameters, or by specifying measure- 
ments of more stable output quantities, such as 
time and frequency, rather than intensity or volt- 
age. If possible, the reported measurement should 
be independent of quantities that are difficult to 
measure or guarantee constant, such as luminance, 
wave front flatness, stray light, noise level, and oth- 
ers. Table 18.6 suggests a sequence of informal 
questions to which proper answers can improve 
confidence. 

Calibration implies that the system is both stable 
and well characterized and that its initial behavior is 
understood and close to what is intended. Residual 
disparities are assigned numerical values to correct 
or compensate measurements, possibly even while 
the instrument is operating. Implicit in calibration 
is the planning of methods to test the system for 
efficacy, performance, reliability, constancy, and 
other properties discussed in Section 18.7. 


Table 18.6 Calibration and characterization 


Do the numerical results mean what they seem to mean? 


What precision can be expected, or achieved? 


What accuracy can be expected, or achieved? 


What evidence links the current results to previously known results? 


Can errors be bounded using comparisons of known and measured data? 


Do the measurements depend exclusively on the phenomenon of interest? 


If not, what other effects must be allowed for? 


Can these effects be measured and the data corrected? 


If not, can the errors be otherwise bounded or some compensation performed? 
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18.5.4 Operation 


Anticipating what instructions may be necessary 
and what responses are expected from an operator 
is desirable. A proper balance between automatic 
operation and user intervention can accommodate 
unforeseen situations, without making routine 
tasks so complex that they invite errors. Routine 
operation should be automatic, perhaps run by 
prewritten scripts, requiring user intervention 
only if something unexpected arises. If results or 
monitored quantities differ from expectations, the 
operator can request additional diagnostic displays, 
preempta script in favor of a new one, make changes 
to existing commands, or take over progressively 
more complete control of the system. This hierarchy 
of increasingly operator-intensive actions must be 
fairly robust to operator errors induced by fatigue 
or stress caused, for example, by something unex- 
pected. Even if the operator must assume complete 
control, with increased risk of mistakes, online error 
checking can prevent incompatible instructions. 


18.5.5 Risk reduction 


Table 18.7 lists five main areas of risk, although 
optoelectronics carries special types of risk with 
each element potentially leading to further ques- 
tions. Experience helps identify critical areas, and 
often a barely remembered thought about some 
device or approach can intimate something that 
has perhaps been neglected, at future peril. 

One great risk may be not staying aware of what 
is wanted, perhaps by misunderstanding the cus- 
tomer’s original desire or intention, or even a change 
of priorities during design and/or development. 
Sharing risk with the customer by frequent timely 
exchanges of current status information, thoughts, 
feelings, concerns, intuitions, and expectations 


Table 18.7 Risk areas 


Customer 
Resources 
Technical 


enhances the probability of a mutually successful 
outcome. Rarely, will a customer become impatient 
with such attention, but neglect of personal relation- 
ships and communication is a sure way to fail. Of 
particular hazard in optoelectronics is suboptimiza- 
tion, achieving flawless performance in areas with 
which one is most familiar, while less well under- 
stood areas receive less, or perhaps even inadequate, 
attention. The following paragraphs address areas of 
risk not always stressed in optoelectronics texts, but 
critical nevertheless. 

Ideal and real component behaviors may differ. 
Buying or borrowing critical parts for early testing 
increases confidence. Design reviews are vital and 
once a design is accepted, changes should be evo- 
lutionary rather than revolutionary, with ripple- 
through effects reduced by the early definition of 
modularity and interfaces. 

Testing to limit functional risk can only be 
complete when physical hardware exists, too late 
for conceptual and design changes without great 
inconvenience. However, components, software 
and hardware modules, and supporting structures 
can be independently tested as soon as they are 
available. To be effective, testing requires a clear 
definition of what is expected, a valid criterion for 
its manifestation, and quantitative documentation 
of how and to what extent this is observed. 

Robustness to external conditions, misuse, and 
misunderstanding is necessary once control of the 
system passes to a user who may typically have dif- 
ferent attitudes and capabilities from those of the 
designer. The term foolproofis probably optimistic— 
fools are more creative than can be imagined. 

Complexity increases the technical risk of not 
performing as wished, or of doing inadvertently 
something neither expected nor understood. 
Having fewer parts increases the chance of success. 
The least trustworthy components are complex or 


Are there enough timely, frequent, and honest exchanges? 
Are planned schedule, funding, personnel, and expertise available? 
Is the physics right to show the desired effects? 


Will the experiment design allow proper measurements? 


Is the system implementation acceptable in all areas? 


Implementation 


Can what is designed be built? 


Can adequacy be confirmed by well-specified testing? 


Deployment 


Will it stay that way during all conditions of use? 
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nonstandard parts for which little prior experience 
exists. Using common commercially available parts 
where possible, even with some slight performance 
compromise, is often recommended. Although this 
might apply particularly to optics and/or mounts, 
commercial mounting assemblies for optics may be 
intended for environments different from the cur- 
rent need, and some commitment to modification, 
such as adding locking screws, must be accepted. 
An almost trivial example of simplification is the 
specification of screw sizes. As far as practicable, 
the screws should be the same type and size; that 
way the ability to find the right wrench is much 
enhanced, especially if you buy many of that size. 
The ideal of toolless assembly and maintenance is 
rarely achievable in optoelectronics. 

In optics, common sense is often corrected by 
experience, and optical designs that seem accept- 
able on paper, or even in the laboratory, can fail in 
use in various subtle ways. Time and effort devoted 
to anticipating failure modes is better directed to 
the more likely ones, identified seemingly only by 
painfully acquired experience. Here, design for 
graceful degradation can permit remedial action 
before complete failure. Time invested with sup- 
pliers before committing to final design, parts, or 
assembly methods is never wasted. 

Although durability is not always a direct con- 
cern, many optoelectronic systems are so diverse 
and complex that it is well to build them as 
though they must last forever, or at least for much 
longer than the specification demands. Three 
approaches may be useful: (1) design the system 
to tolerate all anticipated conditions, (2) preserve 
the system from unplanned conditions that could 
cause deterioration, and (3) arrange continuous 
or frequent monitoring to assure that no relevant 
parameter is, or has been, outside its permissible 
range. 

An optoelectronic system should be robust 
to any change in conditions within its specifica- 
tion. This has sometimes been called hardening, 
which is commonly used with three meanings, 
the robustness of (1) the design, (2) the resultant 
instrument, and (3) the ability to operate without 
serious deterioration in radiation environments. 
Even allegedly benign conditions can have haz- 
ards. From experience, the author once expected 
to lose one or two digital camera charge coupled 
device (CCD) pixels to radiation damage on every 
long-haul commercial airline flight until the 


technology of substituting less vulnerable com- 
plementary metal-oxide-semiconductor (CMOS) 
sensors became almost universally adopted. 
Although rarely in the specifications, a robust 
design not only establishes initial operation, but 
also accommodates inevitable later changes. 
Design quality is enhanced by having experienced 
people review it for potential weaknesses before 
commitment to procurement and manufacture. 
Reviewers are best selected from those who are 
skeptical that it can be made to work at all, and are 
looking for justification of their prejudice. Heeding 
their observations and improving the objects 
of their concern improves the robustness of the 
design to many unquantifiable properties of real 
life. Four aspects merit review: (1) general concept, 
(2) interface and layout drawings, (3) design frozen 
prior to commitment to hardware, and (4) soft- 
ware responsible for the operational experience of 
the user. Once testing is begun, the only permit- 
ted design changes should be those demanded by 
problems uncovered by or during that testing. 


18.5.6 Signal estimation 


Quantitative estimates of expected system proper- 
ties can be obtained by representing each compo- 
nent by a cell on a spreadsheet, entering a plausible 
estimate of the power or signal properties at that 
point, from source to detection, allowing for all 
the plausible or possible effects. Following columns 
contain best-case and worst-case approximations 
that may raise confidence, suggest remedial action, 
or demand redesign. This technique can identify 
potential problems arising from many sources, 
from stray light to amplifiers or subsequent pro- 
cessing. Commonly, modern optoelectronic sys- 
tems make at least two sequential transitions: 
optical radiation to analog electrical signals, and 
digitization of the analog signal. Unless the system 
is based upon individual photon detections that 
are in some sense already digital, converting from 
analog to digital transmission as close to the detec- 
tor as practicable minimizes further increase of 
noise from parasitic and transfer effects. 


18.5.6.1 OPTICAL NOISE SOURCES 


However good, detectors and electronics can never 
recover properties of the optical signal lost or 
diluted by unintended effects. Examples vary from 
a mismatched aperture to windows that deteriorate 
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or get dirty, but can be improved by care and atten- 
tion to component choice, environment, and oper- 
ating methods. Stray light from various internal 
and/or external sources can dilute the optical sig- 
nal. Most commercially available lens and optical 
design programs have the facility to analyze ghosts, 
which although reducible by multilayer antireflec- 
tion coatings, may still be detrimental in laser- 
based systems. With high power pulsed lasers even 
ghost reflections can focus enough of the beam to 
cause ionization along the optical path or perma- 
nent damage if the focus is near or within a com- 
ponent. Cleanliness is always essential. Although 
superpolishing is sometimes recommended, it is 
expensive and parts require special handling. 

Internal flare can be improved by machining 
fine annular grooves with a matt black coating on 
all enclosing barrels. Conventional optical stops 
may be augmented by a few well-placed discs 
with internal knife-edges to reduce stray light. 
Despite potential external attractiveness, shiny 
or polished surfaces should be avoided internally, 
substituting surface finishes such as black anod- 
izing for matt aluminum, and various other oxi- 
dation processes for materials such as brass or 
steel. Black plastic shrouds also help reduce reflec- 
tions and flare, and enclosing the light path often 
suppresses convection, an occasional source of 
image degradation where density gradients occur. 
Carbon-based matt black paints can be useful and 
cobalt oxide pigments survive and stay black at 
high temperatures. 

The advent of lasers as illumination sources has 
introduced the phenomenon of cavity feedback, 
which is less rare than its cause might suggest. 
Even a tiny percentage of light fed back coherently 
into the laser cavity can destabilize cavity gain and 
hence the output intensity or mode shape. Even a 
minutely fluctuating feedback from optical com- 
ponents can be significantly amplified in the reso- 
nator to become a serious instability. Sensitivity to 
mechanical or thermal instabilities becomes more 
serious as the instrument is better aligned, or con- 
tains surfaces conjugate with the laser output face. 
For example, when launching the laser beam into 
a single-mode fiber, the best match of numeri- 
cal aperture, focus, and centering is also the most 
likely condition for laser destabilization. Tilting the 
input face sometime helps, but wedging is better. 
The opposite end of the single-mode fiber is also 
conjugate, and if the fiber is shorter than the laser 


coherence length, or a near multiple of the beat 
length between close modes if more than one is 
stimulated, then this also will make the laser flicker. 


18.5.6.2 DETECTION AND NOISE 


Currently the only directly detectable property of 
an electromagnetic field at optical frequencies is 
its intensity, although techniques such as coherent 
mixing can visualize phase, and spectral character- 
istics can be derived via various dispersive or non- 
dispersive optical processes. Square-law detectors 
measure light intensity in one of three regimes: (1) 
low intensity, where quantum properties prevail, 
(2) high light levels, with classically continuous 
intensity levels, and (3) a “gray” area between, 
where observed behavior depends on the power 
level and detector type. Noise effects differ signifi- 
cantly between regimes, easily estimated by noting 
that a green photon has energy of about 4e~°J, giv- 
ing about 2.5e!* photons sin a 1 W beam. 
Detectors capable of resolving individual pho- 
tons often show defects such as after-pulsing, 
fluctuating sensitivity, and photoelectron pulse 
pile-up, which can distort the statistics upon which 
information depends. If these effects are reduced 
to be negligible, the detector output at sufficiently 
low light levels, and/or high bandwidths, consists 
of a series of amplified photoelectron pulses. Even 
if many photons are missed because of reduced 
quantum efficiency, the statistics can remain an 
unbiased estimate of incident intensity, usually as 
modifications to the Poisson distributed arrival 
times from an unmodulated coherent source. 
Noise sources in the detector itself, the following 
amplifier(s), and resistive loads, usually become 
significant only in the classical regime. Certain 
types of detector, such as photomultipliers (PMTs) 
and avalanche photodiodes (APDs) in the Geiger 
mode, have enough internal gain to produce a 
discriminable electrical pulse from each released 
photoelectron. The statistical fluctuation, com- 
monly called shot noise, which increases as the 
square root of the optical signal power, is an effect 
of quantization of the optical field, and thus is not 
really noise at all. For detectors of lesser capability 
internal noise forces approach to the gray area. 
For sufficiently high light levels, particularly for 
restricted bandwidth systems, the dominant noise 
in a uniform level signal is the shot noise, typically 
approaching Gaussian statistics, and although the 
signal-to-noise ratio (SNR) improves, the actual 
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noise increases with the square root of the power 
and the system bandwidth. 

The optical power incident upon a detector 
is proportional to the square of the electric field, 
whereas photon noise typically depends on the 
square root of optical power or the electric field 
itself. In later circuitry, the electrical power is typi- 
cally described by the product of the current gen- 
erated by the incident light and the voltage across 
a load, becoming the fourth power of the electric 
field. A transimpedance amplifier, presenting a 
high input impedance to the detector with a low 
impedance to following devices, can provide wide- 
bandwidth high-gain linear amplification with- 
out seriously worsening the noise inherent in the 
signal. 

After optical detection, electronic condition- 
ing using pulse-height or time discriminators 
as photon counting circuits, or box-car analyz- 
ers and lock-in amplifiers to average repetitive 
signals, can extract a signal seemingly buried in 
noise. Further improvements may be possible 
using auxiliary timing or other information that 
is either independently known or can be derived 
from the signal. Techniques such as photon cor- 
relation can make good use of the photon arrival 
statistics, using information a light beam carries 
in the intervals between successive detected pho- 
tons. For OADS, this permitted accurate speed 
measurements from correlated pairs of pulses as 
particles crossed two parallel sheets, whether the 
pulses were from individual or clusters of scat- 
tered photons, from particles of <0.2 1m radius up 
to snowflakes. 

Many real experiments fall between photon 
resolved and classical, and measurements are 
required where noise is a combination of statisti- 
cal effects, whose relative importance depends 
on properties of the optics and detectors, as well 
as the phenomenon of interest. In this regime it is 
essential to consider all sources of noise, the rela- 
tive importance of each, and their effect on the 
accuracy of the measurements. Simple calculations 
supplementing answers to carefully worded ques- 
tions to device manufacturers improve confidence. 

For single-photon detection, the signal is, in 
some sense, already digital, allowing various 
ingenious time dependent pulse processing tech- 
niques to examine the information encoded as 
variable intervals between one-bit level changes. 
Processing methods available here can be complex, 


as intimated by the information in the further read- 
ing list. Where detection yields an analog signal, 
transformation to a digital form is preferable before 
further signal manipulation. An analog-to-digital 
converter (ADC) produces a sequence of multi-bit 
words, immediately exploitable by digital proces- 
sors. Transformation of an optical measurement to 
a usable digital signal usually demands low-noise 
linear amplification, DC offset control, and suffi- 
cient resolution for acceptable quantization noise, 
arising from the number of discrete levels, 256 with 
8 bits, 4096 with 12 bits. Only for an extremely 
noise-free input or a large dynamic range is a 16-bit 
ADC or higher occasionally justified. Where suit- 
able averaging is possible, quantization noise can be 
reduced by adding known noise to provide random 
jitter between discrete levels, a technique originally 
used in RADAR. Most ADCs tend to be power 
hungry, and careful choice is necessary for applica- 
tions where power is scarce, such as from batteries 
or in spacecraft. Noise specifications in 2-D arrays, 
such as CCDs or addressable CMOS arrays, can be 
less optimistic than at first apparent. Cooling can 
reduce the readout noise that dominates for CCDs, 
but excessive cooling can reduce sensitivity. All 
relevant properties of any detector must be well 
understood before committing to its use. 


18.5.7 Information retrieval 


Two important questions are “What is to be 
inferred from the measured optical signal?” and 
“How is the inference to be drawn?” Relating the 
value of the quantity to be measured to the detected 
light is not always obvious, as the subject and study 
of inverse problems attests. 

Care is taken to optimize the signal in terms of 
intrinsic or additive noise, nonlinearity, overrang- 
ing, etc. It is convenient to distinguish ‘signal and 
data processing’ from ‘information retrieval’. The 
former is typically intended to manipulate, rear- 
range, and/or often selectively reduce raw numeri- 
cal data. The latter, is weighted toward extracting 
the values of the required quantities, and their 
boundaries, accuracy, precision, constraints, and 
even significance. Its final aspects are to present 
these values to the user in an easily understood 
form, and to archive all pertinent information and 
provenance in a format that may be accessed for as 
long as it may be required. 
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18.5.7.1 SIGNAL AND DATA PROCESSING 


Signal and data processing may include precursor 
analog manipulation, and the later rearrangement of 
one or more rich 1-D raw data streams into a more 
useful form. Whether from a single-point detector, 
or serially accessed pixelated sensor array, the data 
stream must typically be processed to emphasize the 
required measurement over irrelevant factors. Figure 
18.4 is a symbolic diagram of the Weiner-Khintchine 
relationship linking the conservative Fourier trans- 
form (horizontal) with the selective discard of data 
(vertically down). Although both routes from upper 
left to lower right are mathematically identical, prac- 
tically the route via correlation can be preferable 
because unwanted data are discarded sooner. 
Conservative transformation may make data 
more intelligible, whereas selective discard of the 
irrelevant may improve recovery of useful informa- 
tion. For the illustrative example of OADS, the analog 
APD pulse from each particle transit was processed 
by automatic gain control and dynamically matched 
filtering. Successive subsamples of each pulse were 
digitized and buffered by a first-in-first-out (FIFO) 
memory with conditional exclusion of digital sam- 
ples not part of any useful pulse, obviating the need 
to process information-free noise between pulses. 
The FIFO was drained and processed to produce an 
amplitude and interpolated time for each probable 
particle transit, as a discriminated event. Infeasible 
sequences were removed by a conditional editor, 
again reducing processing load. A table of transit 
time pair probabilities, analogous to a correlogram, 
was accumulated in random access memory (RAM). 
Particles not crossing both sheets contribute a noise 
floor, while useful transits give a clear peak, whose 
centroid is parabolically interpolated to better than 
the 5 ns resolution available from the time tagging of 
pulses. This sequence of processes, combined with 
the parallel sheet geometry and its control, achieved 


the 1 part in 104 accuracy demanded here, but not 
usually accepted as possible with conventional opti- 
cal anemometry. Accuracy was also aided by the 
absence of residual turbulence since the instrument 
was moved through relatively undisturbed air in 
which are embedded near stationary particles. 

Several processes must often be performed 
sequentially, and choices between real-time and 
off-line processing can be the subtle. If either the 
data input rate or the processing overhead vary sig- 
nificantly, then asynchronous interfacing with local 
buffering or even storage can be appropriate (Section 
18.6.6). OADS used the FIFO to tolerate wildly fluc- 
tuating input pulse rates, dual port RAM to accumu- 
late correlograms, and conditional peak finding so 
that interrogation by the aircraft bus always accessed 
the most current measurement. Unless the mea- 
surements are required in real time, or with as little 
latency as possible, more accurate or comprehensive 
information may often be recovered by complete 
recording of the raw data streams, with sufficient dig- 
itization rate and bit depth, for later off-line process- 
ing, perhaps by several different techniques or even 
including information from independent sources. 

Estimating computing resources predicts bot- 
tlenecks to be alleviated by appropriate hardware. 
Machine architecture is tailored to provide sequen- 
tial, pipelined, or parallel task execution, and assign- 
ment of the various synchronous or asynchronous 
tasks. However, the cost, risk, and suitability of avail- 
able software can force hardware choices. 

Many systems rapidly generate enormous num- 
bers of data as one or more signal streams too rich 
to be recorded in real time as suggested above. 
Examples are a laser-based velocity sensor applied 
to a wind tunnel, an industrial flame, a ship’s wake 
at sea, transonic flow, or inside an aeroengine. 
While waiting for hardware performance capa- 
bilities to catch up with the ever more demanding 
ideal of raw data archiving, techniques such as cor- 
relation can compress the incoming event stream, 
selectively discarding only that which is less inter- 
esting, in this case, relative phase information. 
Where tens of thousands of correlograms may be 
acquired, often at great expense, a balance must be 
established between archiving all that could later be 
necessary and a reasonable storage expense, in time, 
money, or hardware resources. As a minimum, data 
as raw as possible, together with all system health 
and status monitors, should be stored and backed 
up as comprehensively as can be arranged. 
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Built and flown between 1989 and 1993, OADS 
acquired six incoming data streams each at 20 
Mbytes s! to be processed by various dynamic and 
conditional discriminators to give three simultane- 
ous channels, each of forty 32-bit correlograms per 
second with sub nanosecond delay time discrimi- 
nation, for each typically 2-h flight costing several 
hundred thousand US$ (1993). This system used all 
of the techniques outlined above, and indeed many 
more. Data processing choices of how to display 
updates to the experimenter for real-time inspec- 
tion and system control during research and devel- 
opment were also extensively different from those 
necessary for a final deployable system. 


18.5.7.2 INFORMATION RETRIEVAL 


Although distinguishing between data and informa- 
tion may seem merely philosophical, it can help to 
establish hardware requirements, and both usability 
and intelligibility of the reported numbers. Although 
some systems are passive, perhaps even with data 
only available after the experiment, others may yield 
a real-time indication permitting user intervention, 
or possible redirection of the experiment. 

By information retrieval is implied the recovery 
of the measurement in a format for archiving, with 
experiment provenance and monitors, and which 
is also immediately accessible to the observer. This 
implies that the presentation is meaningful to a 
human, important when real-time control or inter- 
vention may be necessary, or where proper experi- 
ment progress must be monitored or confirmed. 


18.5.7.3 SOFTWARE 


For any instrument or system involving optoelec- 
tronic disciplines, an organizational methodology 
is necessary to make it all play together. Functional 
and structural design must be augmented by a 
software specification as early and as complete as 
possible. Without such discipline, a monster is cre- 
ated, a hostage to fortune, later to move out of con- 
trol into cost overruns, chaos, and disillusionment. 

Typically with optoelectronic instrumenta- 
tion, signal and data processing and informa- 
tion retrieval will require software, as will the 
user interface, with its control, monitoring, and 
archiving needs. Software must be modular, test- 
able, and appropriate for the planned hardware, 
although more usually nowadays the hardware is 
chosen on the basis of computing capabilities and 


resources demanded by software, which becomes 
more expensive and less flexible than hardware. 


18.5.8 Interfaces 


One critical area in any optoelectronic activity is 
the establishment and comprehensive definition 
of external and internal interfaces. Each interface 
must be compatible with both the elements that 
it is intended to connect. Isomorphic mapping of 
software and hardware modules (Section 18.4 and 
Figure 18.3) introduced the need for ideas expanded 
in Table 18.8, without which testing and diagnos- 
tic analysis are made more difficult. An initial top- 
down design process is attractive, although it may 
often be necessary to accommodate bottom-up 
changes driven by available components, modules, 
subsystems or blocks of reusable software or code. 

Accurate definition of all interfaces is impera- 
tive. Even, or perhaps especially, when other crite- 
ria may superficially appear to be more important 
in the establishment of system architecture, it is 
essential to resolve priorities to create consistent 
structure. Interface definitions start with a con- 
nectivity map of the proposed system, includ- 
ing its interaction with the outside world (e.g., 
Figure 18.3), proceed via required functionality, 
ultimately to details of actual wires or numeri- 
cal quantities to be exchanged, and the formats to 
which they are constrained. 

The importance of these documents centers 
upon their practical use. Interface drawings are 
best as portable files in a format that can be read 
by everyone interested, and printed at least once 
as hard copy on a paper that will accept bold red 
marks, for peer or final review. The existence of 
interface diagrams as working documents, spread 
upon a conference table or a videoconference 
screen for examination and critical discussion, 
can greatly enhance the effectiveness of the final 
system, instrument, measurement, or scientific 
understanding obtained from its use. 

Commitment of definitions of interfaces and 
connectivity to a formal document improves con- 
fidence that the system can perform its intended 
task, often suggesting simplifications and more 
logical ways of doing things, but more importantly 
defining modular boundaries. The specification 
of every module allows each part of the system 
to be tested independently of others, an ability 
whose importance cannot be stressed too highly. 
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Table 18.8 Interface maps 


1. Operator interface 


Command, control, access, reports, displays, archiving 


2. Modularity 


Block diagram and naming of functions, hardware, software structures 


3. Information flow 


Control, data, health and status monitoring, communication, formats, timing 


protocols, archive, retrieval 
4. Power and wiring 


Input power availability, conditioning, connections 


Internal power supplies: average and peak voltages and currents, acceptable noise, 


cross-modulation limits 
Wires: type, gauge, insulation, color 


Connectors: type, gender, shells, cable harnesses; runs, and makeup 


5. Optical train 


Power, wavelength, spectrum and bandwidth, polarization, acceptable aberrations, 


spatial and temporal coherence, noise, sources of stray light, ghosts 


6. Physical structure 


Mechanical: hardware systems and mounting structure, detail drawing index, 


assembly methods and sequence, available adjustments, alignment criteria 


Thermal: sources, sinks, pathways, effects, stability 


A second advantage is that critical components may 
be tested as soon as they become available, without 
waiting for everything else to arrive, not only for 
functionality, which is mostly reasonable for com- 
mercially purchased parts, but more importantly 
for fitness and agreement with expectations. 
Within the larger context of the initially specified 
environment of the system, Table 18.8 lists six gen- 
eral types of interface or definable structure. During 
design, each must be described down to a level suf- 
ficiently detailed to prevent inevitable later changes 
from rippling through the system with unantici- 
pated effects. A hierarchy of mapped and docu- 
mented interfaces reduces later problems, although 
some interference is to be expected between the 
tabulated items, expanded somewhat below. 


18.5.8.1 OPERATOR INTERFACE 


The highest level, and in many cases the only one 
visible to the operator, includes the appearance 
of the system, the user friendliness of the avail- 
able commands, the methods of reporting and 
displaying the required measurements, either 
in real time or recalled from archive, the confi- 
dence given by suitable values of health and status 
monitors, and a method of deciding what to do if 


something unexpected occurs. Despite the obvi- 
ousness of this, many user interfaces could be bet- 
ter designed. Past examples were once exemplified 
by the arcane programming of video recorders. 
Even the continuing quest to make user inter- 
faces more intuitive, especially as devices have an 
increasingly extended functionality, is threatened 
by lack of standardization and a distressing need 
to learn a new “operating system” for every new 
device, once the prerogative of the engineering 
world but now as universal as telephones, refrig- 
erators, transport and entertainment. Typical 
optoelectronic instruments are complicated and 
are required to make difficult measurements. As 
far as possible the operator should be protected 
from this complexity, but nevertheless have 
immediate access to more detailed information 
should anything unexpected occur. Successful 
operation should need minimal or preferably no 
intervention, presenting measurements succinctly 
and intelligibly, with a comforting status indica- 
tion that everything is within its design range. If 
more control is desired, then it is available imme- 
diately on interrupt, which need not pause data 
acquisition, but only make more information or 
control available to the operator. This includes the 
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ability to request more detailed monitoring of any 
function, to modify the existing control scripts, 
to recall archived data for comparison, and above 
all to have sufficient feedback to know what is 
going on either as a result of operator instruc- 
tion or unanticipated factors. It can be expensive 
and embarrassing to have an experiment yield a 
record that is either unintelligible or has uncer- 
tain provenance. 


18.5.8.2 MODULARITY 


Most purchased optoelectronic parts or subsys- 
tems already have well defined interfaces. Even if 
they are not always the most convenient, it may 
be better to accommodate something that already 
exists. This can save time and reduce risk, but 
conversely much design effort may be consumed 
exploiting standard components and interfaces in 
a system being designed to satisfy unique needs. 

The system overview document should be a 
connected block diagram where each block is 
labeled by function, hardware name, and software 
requirement, as an expansion of Figure 18.3. On a 
yet more detailed view of this same diagram will be 
labeled the types of interconnectivity representing 
module control, information flow, health and sta- 
tus monitors, and supplied power. Even at the sys- 
tem level not all blocks need have all connections, 
and as the same discipline is applied to lower lev- 
els of modularity down to individual components, 
the connections per component will become fewer, 
simpler, and more specific. At the lowest levels of 
the functional hierarchy, many components will 
be passive. For example a lens, even though passive 
and appearing first in item 5 in Table 18.8, must be 
included in item 6, so that a later drawing may be 
created to specify its physical mounting structure 
and alignment requirements. 

At each level the specifications of interfaces with 
the real world environment are again reviewed, 
understood, confirmed, and fully documented. 


18.5.8.3 INFORMATION FLOW 


Interface drawings for control, information flow, 
and health and status monitors are powerful tools. 
The thought devoted to creating these diagrams 
becomes the design work necessary to assure the 
complete and proper functionality of the final sys- 
tem. This is also a valuable method of checking 
internal consistency, and raising concerns that may 


even require customer choices. The optical part of 
this can specify everything about the radiation, 
the optical train, and optical components. For the 
electronic parts, except for standard communica- 
tion interfaces, the actual wires that carry different 
types of information should be clearly identified to 
avoid confusion. Three separable overlays of con- 
nectivity are useful, (1) the command and control 
information passed to modules and components, 
(2) the health and status information returned 
from modules and components, and (3) signals, 
which also include the light, pertaining to the 
desired parametric measurements. In addition to 
their separate functions, the first two types allow 
closed loop control where this is desirable. 

Active components have command and control 
capabilities with responses and properties moni- 
tored as often as necessary. Optical signals typi- 
cally end at the detector, transforming to electrical 
signals, and thence to numerical signals for inter- 
pretation and archive, with inspection possible at 
various intermediate positions. Wherever possible, 
information exchange should be via standard- 
ized interfaces of which many are available to be 
selected according to the application. The five stan- 
dardized communication layers of physical, data 
link, network, transport, and application should 
be understood, with special care necessary for real- 
time instruments. 


18.5.8.4 POWER AND WIRING 


Power supplies deteriorate or fail in various ways, 
devolving from improper specification or perfor- 
mance all the way to poor mounting or inadequate 
cooling. A connector can be the wrong gender, 
sometimes the wrong connector, occasionally even 
with the wrong number of pins, and the actual 
voltages delivered in service may be anybody’s 
guess. Careful specification of input power avail- 
ability, instrument requirements, and bounding 
conditions can mitigate such mishaps. Auxiliary 
aspects must not be taken for granted—for exam- 
ple the occasional need for a trickle current to 
sustain settings—and are better discovered early 
to prevent the interconnect wiring from declining 
from a traceable color coded minimal set into a 
rat’s nest of post-assembly fixes. 

Voltages and currents to various parts of the 
system may be quantified on the power interface 
diagram. Conditioning the supplied voltage and 
minimizing noise is the main function of a power 


18.5 Design 663 


supply, of which there will frequently be more than 
one. Each must accommodate maximum currents 
under all operational combinations and must be 
stable, with adequate freedom from noise, either 
intrinsic or introduced by switching or operation 
of some other part of the system. Where several 
power supplies are involved, they may interfere 
with each other to introduce mysterious noise 
properties whose spurious  cross-modulation 
signals can masquerade as legitimate measured 
effects until properly diagnosed. Problems trace- 
able to power supplies are not rare (Section 18.6.7). 

Specification of wire gauge, type, insulation, 
color, and especially connectors should be consis- 
tent throughout the system, but will probably be 
compromised in purchased subassemblies. Heat- 
shrink insulation of each termination may be desir- 
able, but does prevent electrical probing. Choosing 
the appropriate gauge, type, and insulation is obvi- 
ous, and specifying an appropriately consistent 
color coding is extremely useful. The author once 
had the opportunity to diagnose a cross-modulation 
problem within a customer’s power supply where 
well over 230 wires supplied more than 54 different 
subassemblies from seven different types of alleg- 
edly independent power supply connected in par- 
allel to the same aircraft bus. The compact wiring 
looms were tightly laced. Every wire was unlabeled, 
16-gauge, Teflon coated, and white. 

Specifying connectors at the top level of mod- 
ularity, subsequently percolating down to every 
component for which electrical connectivity is 
required, as early as possible, prevents much pain 
later. Many connectors have long delivery times, 
and early definition of the type and gender of con- 
nectors, and the length and construction of cable 
harnesses, is valuable. 


18.5.8.5 OPTICAL TRAIN 


The experiment in Section 18.5.1 implicitly creates 
the interface diagram for all optical paths, and this 
evolves into formal documentation of power bud- 
get, wavelength spectrum and bandwidth, polar- 
ization, spatial and temporal coherence, noise, 
acceptable aberrations, sources of stray light, 
ghosts, and many other properties. In fact these 
may not all appear explicitly on the interface dia- 
gram, but are included here since a rethinking of 
these properties in the context of interfaces and 
compatibilities almost always identifies previously 
unconsidered items. 


While poorly understood or excessive aber- 
rations can seriously compromise performance, 
effort can be wasted reducing aberrations more 
than necessary. Monochromatic wave front or 
ray aberrations can be generally represented by 
a polynomial expression of increasing order in 
aperture, field angle and azimuth and their prod- 
ucts. The five third-order Seidel aberrations not 
correctable by a focal shift are primary spherical 
aberration, coma, astigmatism, field curvature, 
and distortion. Occasionally higher orders must 
be considered. Chromatic aberrations must also be 
considered in multiwavelength systems. Unless the 
system demands special devices with exotic opti- 
mizations, good-quality components optimized 
for general use should be examined for suitability. 

With many modern components, tolerancing is 
of more interest for performance prediction than 
correction, although as with ghost analyses it merits 
sufficient consideration to justify its subsequent dis- 
missal. Where optical or physical path length is at a 
premium, substantial gains may be realized by fold- 
ing the path. Although this may also improve rigid- 
ity and general robustness, there may be hidden 
penalties in design costs, performance, and flare. 

Because in recent years, smaller, brighter, and 
more coherent optical sources have become com- 
monplace, calculations based upon Gaussian 
beam behavior are often necessary, and the appro- 
priate diagram and formulae are repeated here. 
Figure 18.5 and the inset box show the relation- 
ships of the beam radius and wave front radius of 
curvature at a distance z from a Gaussian source of 
e~ intensity radius w,, with the asymptotic approx- 
imations in the far field. For a converging lens of 
focal length fat distance z, from the effective waist 
of the source, the remaining two equations give the 
aberration-free waist position and radius. 
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Figure 18.5 Gaussian beam parameters. 
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Beam radius w, at distance z is 


Axial position of waist w, 
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18.5.8.6 PHYSICAL STRUCTURE 


Mechanical layout drawings are necessary for 
optical component mounting, later detailed by 
a draughtsman for a machinist to make. Mostly 
now these processes may be automated by CAD 
systems and optical design software easing the 
more tedious tasks of the designer and allowing 
semiautomated extensions to thermo-physical 
behavior, mechanical rigidity, strength, align- 
ment retention and many other necessary features 
of any optical system. Beyond these capabilities 
and specifically for optoelectronic systems they 
can quantify ranges of necessary adjustments, 
confirmation of adjustment orthogonality, with 
appropriate locking techniques, establish a ther- 
mal interface map, including sources and sinks, 
effects of conduction, convection, and radiation 
on the heat budget and temperature distribu- 
tion. Thermal effects can compromise success, 
and show different behavior in service from that 
expected during design or noted during assem- 
bly, testing, calibration, and evaluation. In the 


absence of gravity, for example, convection is 
suppressed, and under different operating condi- 
tions very different thermal distributions can be 
expected, perhaps changing calibration or caus- 
ing irreversible effects. Gradual drift from ther- 
mal cycling is common but not always suspected 
in its early stages. Beyond these automated capa- 
bilities lies the necessity to apply personal expe- 
rience to assure that the design has not included 
some ridiculous feature as a result of an unsus- 
pected error or omission. If experience suggest 
that “it does not feel right, then it probably is 
not,” whatever the beautifully produced auto- 
mated design might suggest. 

Occasionally an optoelectronic system is so 
complex that a FEA of the mechanical structure 
is a good investment, permitting mechanical 
and thermal loads to be simulated throughout 
any environmental range. Although design pro- 
grams are increasingly able to interface with 
each other, the inclusion of all aspects necessary 
to guarantee a successful system remains far 
from trivial. Occasionally, creating a supposedly 
effective design can consume too many of the 
available resources, and may be better assured by 
modularization, testing and frequent empirical 
closure. 


18.5.9 Management and 
documentation 


An individual scientist or engineer responsible 
for creating a simple optoelectronic device can 
occasionally track everything well enough to 
retain control using only anotebook and memory. 
However, this is rarely sufficient. The notebook 
is essential, and will likely evolve into a design 
document in electronic format. Four things are 
emphasized: that documentation should (1) be 
subject to configuration control from the first 
keystroke, (2) be backed up frequently and at 
more than one site, (3) be in a portable format 
that can be read by everyone who needs it, and (4) 
not become unreadable with time, either because 
it is stored on obsolete media, or because the 
generating software is no longer available. Few 
can now read 5% in. floppy disks, once a useful 
standard. 3% in. disks rapidly followed them, 
and CDs and DVDs became as quaint as vinyl 
records. Whatever medium is chosen for archive 


18.5 Design 665 


one can guarantee that it will have a finite life, 
sometimes quite short. Even in comprehensive 
corporate networks, programs used to create 
and store information, particularly drawings, 
become obsolete surprisingly quickly, and trans- 
lators disappear. Fortunately, server farms and 
the distributed “cloud” claim to overcome these 
problems, but be suspiciously cautious for at least 
two reasons, (1) there is so much growth of stored 
information that it may be impossible to find the 
source, and (2) decoding the information may 
become infeasible. 

Documentation is key to success, whether by 
allowing design reviews to preempt problems, 
or by providing the necessary audit trail to diag- 
nose and fix them. Table 18.9 suggests documents 
that support the success of the program and the 
measurements. 

The minimum program plan is that which, in 
addition to its technical merit, will satisfy the 
customer about cost and schedule. Full-service 
project management software is usual for general 
programs, but for optoelectronic activities and 
systems the match is rarely perfect, representing 
overkill in some areas and under-specification in 
others. A simple spreadsheet can be a valuable 
tool for keeping track of optoelectronic system 
development. For example, the first column can 
contain the nonorthogonal set of all components, 
activities, and concerns, followed by exemplary 
column headings such as specifications, source, 
supplier, address, universal resource locator 
(URL), phone, name of personal contact, cost, 
order date, promised delivery date, status, actual 
delivery date, tests to be performed, test results, 
assembly status, problems, comments, contin- 
gency plans, second sources, alternatives, mys- 
teries, and more as design and implementation 
progress. More still will be added during build- 
ing, testing and evaluation. This spreadsheet is 


Table 18.9 Documentation 


Program plan 
Configuration control 
Interface specifications 
File backup 

File portability 

Design document 


a convenient place to keep current information, 
which might otherwise be spread among many 
people, documents, mailrooms, desks, file draw- 
ers, memories, and wastebaskets. 

Configuration control assures that informa- 
tion is current. Editing an obsolete file, repeating 
a task already completed, misplacing an unla- 
beled lens, or machining a part from a drawing 
that was changed in review but not sent to the 
machine shop, wastes resources that may not 
easily be replaced. Of particular concern is that 
purchased parts are often not exactly what the 
catalogue offered, and ripple-through effects can 
only be minimized by good configuration con- 
trol. Everyone involved needs the most current 
information. 

Comprehensive and powerful documents are 
wasted unless people have access to them, can read 
them, can understand them, and will act upon 
them. File portability and file backups are mostly 
handled by modern computer networks, but a poor 
choice of commercial software or unfamiliar mod- 
eling program can inhibit dissemination of, and 
access to, necessary information. 

The primary responsibility for a design docu- 
ment is that of the technical champion who has 
assumed personal responsibility for satisfying the 
customer. Although in covering all levels from 
overview to the smallest detail, it can seem diffi- 
cult to create and maintain, the power in compre- 
hensive understanding of the final system justifies 
the investment. A convenient sequence of chapters 
might be (1) introduction and physical principles, 
(2) specifications, (3) system architecture, (4) 
interfaces, (5) performance modeling and predic- 
tion, (6) module function and physical descrip- 
tions (several chapters), (7) software, (8) testing 
and calibration, (9) power supplies, (10) table of all 
relevant and known numerical values, (11) method 
of contacting suppliers or other expert sources, 


Cost and schedule plan, tracking of all activities 

What documentation is current, and where to find it 
Modularity, information flow, monitoring, power 

All documents immune to loss from single-point failure 
All documents readable by all, now and in the future 
A single source for all current data 
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(12) reasons for design choices, and why alterna- 
tives were rejected, (13) log of evolutionary changes, 
(14) unresolved mysteries, and (15) several specific 
technical appendices. If this document is brought 
into existence at the beginning of the optoelec- 
tronic project and maintained merely by ordered 
accretion of all pertinent information, then it will 
prove invaluable. Abstracted sections can become 
the final report, comprehensive provenance for a 
scientific publication, the operator’s handbook, the 
service manual, the diagnostic methodology, the 
platform for evolutionary improvements, and a 
delight to the customer. 


18.6 COMPONENTS 


Planning and design prepare the way for the choice 
and acquisition of components. In optoelectron- 
ics, the design process naturally iterates with what 
is available, and the loop between design and 
component choice is always tight. Components 
improve almost daily and innovations can some- 
times solve problems considered challenging at 
the beginning. This is often magnified by a steep 
learning curve, where the uncertainty of initial 
unfamiliarity gives way to confidence as knowl- 
edge increases. 

While Section 18.5 is too brief to include all 
design concerns, this section addressing physical 
components must be even less comprehensive and 
current. Deterioration of relevance with time is 
exacerbated because quantities, qualities, nature, 
and even functional principles of the compo- 
nents available to the scientist or engineer practic- 
ing optoelectronics evolve almost daily. To avoid 
becoming even more outdated than is unfortu- 
nately inevitable, rather than current device speci- 
fications, this section emphasizes questions and 
classes of properties that may perhaps continue to 
be relevant for the future. 

Component choices are often guided by famil- 
iarity, cost, and/or common sense, any of which 
can be disastrous. Typically in any optoelectronics- 
based activity the cost of the components, however 
high it may appear, is largely irrelevant compared 
with the total program cost. Buy the best and cry 
only once. In optoelectronics, and often optics 
in general, the common sense approach is often 
derailed by the intrinsically counterintuitive nature 
of the subject. Even after many years of familiarity, 
what should be a naturally obvious improvement 


can often makes things worse. Conversely a less 
familiar component may seem attractive, if expen- 
sive. Rather than basing a choice on the manufac- 
turer’s recommendation, a better technique is to 
talk with someone who has used that component, 
ask the vendor for a demonstration, or borrow one 
and try it. Of course in some cases final choices 
may depend upon what we already have, or can get, 
or think we understand. Pragmatic examination of 
every aspect, reaffirming the validity of even well- 
formalized beliefs, carries high dividends. Check 
every assumption; check it again. 


18.6.1 Light sources 


In a passive experiment, the phenomenon provides 
photons from which information is to be extracted. 
In active experiments using light to probe or inten- 
tionally stimulate a phenomenon of interest, the 
light source must have known, constant, and well- 
understood properties. The performance of the 
source may depend on properties of its final envi- 
ronment not necessarily suspected when the equip- 
ment is characterized in the laboratory. Where 
many types of light source are available, the sim- 
plest device satisfying the requirements is usually 
the best. Lasers have high intrinsic luminance, a 
desirable property for most applications, but where 
coherence is either not required or even undesir- 
able, a superluminescent diode may be better. Light 
emitting diodes (LEDs), cheap and available over 
a wide range of wavelengths, may be adequate. 
Increasing source power alone improves the signal 
less often than might be wished, and accidental 
modification of the probed phenomenon by a poor 
choice of light source is not impossible. 


18.6.1.1 TYPE OF SOURCE 


The light source must have sufficient power and 
be stable under all likely conditions. Its properties 
must be matched to the system. Spatial coherence, 
implying the capability for good optical beam con- 
trol, is usually desirable, whereas temporal coher- 
ence, essential for applications such as holography 
and interferometry, in others can be a serious dis- 
advantage, giving rise to unwanted fringes or signal 
fluctuations unrelated to the measurement. Often 
not all the desirable properties may be available 
from a specific light source, and a trade-off may be 
necessary between, power, wavelength, coherence, 
polarization, modes, and their distribution and 
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stabilities. The availability of a commercial product 
may even dominate the trade-off. Occasions where 
a thermal source is best are becoming less common 
with the increasing availability of many other ways 
to generate light, from innovative discharge lamps 
such as microwave excited sodium, through room 
temperature solid-state sources whose properties 
may be well specified over an increasingly wide 
range, to ingeniously contrived quantum-well 
emitters, and many more available both now and 
in the unpredictable future. Although much signif- 
icant optical knowledge was gained with sunlight 
or wax candles, new principles of light production 
and photonic materials continue to drive the opti- 
cal arts at an ever-increasing pace. 

For OADS each of six thin sheets of light was 
imaged from a 1W GaAlAs laser at 812 nm with 
20 adjacent 11m X3,m emitters on the same die, 
with weak coupling to give constant output phase 
with a longitudinal intensity uniformity better 
than 20%. This was in the 1980s—the world moves 
on at an ever increasing pace. Keeping up is dif- 
ficult but essential. 


18.6.1.2 WAVELENGTH 


Optimum wavelength is often an early consider- 
ation. The choice is rarely trivial since both the phe- 
nomenon and the optical system can force complex 
trade-offs. The spreadsheet technique introduced 
earlier to predict signals and noise levels can also 
be used to optimize wavelength, or indeed, any 
aspect of the system, by setting up a figure-of-merit 
involving all the parameters considered important 
and exploring the effects of various values, avail- 
able components, and plausible changes. 


18.6.1.3 POWER AND STABILITY 


Since optical power must be both sufficient and 
stable, it is highly desirable to either monitor the 
intensity (Section 18.7.1) or better still use the 
monitor signal to stabilize the power with a suit- 
able closed-loop controller. Despite manufactur- 
ers claims, it is essential to characterize the source 
under the conditions of anticipated use, including 
ambient temperature and changes, power supply 
modulations, drifts and transients, mechanical 
vibrations and shocks, contamination, and age- 
ing of construction materials. Although visual 
confirmation of spatial properties of the source, 
enhanced by special techniques if it is outside the 
visible spectrum, is often adequate, it is sometimes 


desirable to quantify the spatial distribution and 
transverse mode structure by a beam profiler or 
by image capture and analysis, requiring an addi- 
tional 2-D sensor. If direction of polarization or 
longitudinal mode structure, particularly their 
effects on coherence, for example, can change the 
measurement, then these too must be monitored 
and/or controlled. 

Beam-pointing stability is particularly impor- 
tant in optical systems with high magnification or 
with strict directional requirements. A sufficient 
change of laser beam pointing direction can seri- 
ously compromise launching into a fiber, spatial 
uniformity of illumination, position of a measure- 
ment volume, or receiver alignment. Experimental 
quantification of effects increases confidence. 
Diode lasers typically must be temperature stabi- 
lized to retain power, wavelength, mode structure 
and pointing stability, with residual stabilization 
of polarization purity and orientation particularly 
important when launching into single-mode and/ 
or polarization-maintaining fibers. The OADS 
lasers exhibited a negative wavelength dependence 
on temperature of about 0.3nm °C7! and all six 
were kept near the center of the 3.5nm full-width 
half-maximum (FWHM) narrowband filter, also 
temperature stabilized. Because each laser was 
slightly different the base temperature of each was 
individually controlled differently to give the same 
wavelength. A better technique would have been to 
use six slightly different wavelengths and six sepa- 
rate filters to reduce signals from particles crossing 
other channels. The potential advantage was con- 
sidered but outweighed by the necessary increase 
of size and physical complexity. 


18.6.2 Optics 


The term “optics” conventionally describes the 
components whereby light is transmitted between 
source and detector via the phenomenon of inter- 
est. Wavelength is conserved in the most common 
cases, but inelastic processes such as fluorescence, 
or other nonlinear effects, change wavelength in 
complex ways. Optical components include lenses, 
beam splitters, polarizers, fiber-optics, graded- 
index devices, coatings, stops, filters, attenuators, 
acousto-optic, Faraday or Kerr effect devices, 
windows, transparent adhesives, holographic ele- 
ments, photonic devices and structures, and many 
others. The choice from so many components 
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should always be the minimum set of the simplest 
parts that will meet the specification. Avoid mov- 
ing parts; avoid unusual or nonstandard parts; 
where possible, avoid parts. While the usual goal 
is to transmit as much light as possible that carries 
useful information, the reduction of flare can be 
as or more important, because excessive flare can 
mask desired effects and also consume bandwidth 
or processing resources. The optical properties 
of the system are crucial, for once information is 
lost it can never be recovered. Even if it is merely 
swamped recovery may be impracticable. 


18.6.2.1 LENSES AND OTHER 
COMPONENTS 


Competing criteria affect the choice of optical 
components. These include maximum trans- 
mission at the wavelength of interest, control of 
aberrations, implying not only reduction but also 
exploitation such as apodization, surface place- 
ment and curvature, and treatments designed to 
reduce unwanted scattering and reflections such as 
ghosts and flare. However good a standard design, 
it must be evaluated for each application, and any 
sufficiently innovative requirement may demand 
special consideration. A particular example of this 
is to think that zoom lenses, whose compromises 
seem adequate for photography, might be appli- 
cable to optoelectronic instruments. Despite the 
tempting cost and apparent versatility the results 
are often disappointing and sometimes disastrous, 
because of the complexity and unsatisfactory com- 
promises introduced to solve a problem not present 
in the system of interest. 

Sometimes an unconventional component may 
promise, or even deliver, great rewards. However, 
excess attention to the improvement may hide 
shortcomings not apparent with a more conven- 
tional approach. All that may be said about lenses 
applies to flatware such as mirrors and prisms, or 
a variety of more complex and unusual compo- 
nents such as diffractive or holographic optics, 
nonimaging concentrators, or other less common 
devices and techniques that may yet solve a spe- 
cific problem effectively. Smaller components have 
better mechanical integrity and are less prone to 
distortion with mounting and conditions of use, 
but they can be more expensive and difficult to 
handle, and are more prone to impaired proper- 
ties close to the edges. Even with well-made optics 
it is good to have the component at least 10%-20% 


bigger than the active dimension. For Gaussian 
beams an operational aperture radius of at least 
twice the e* intensity contour of the beam is 
highly recommended to avoid intensity profile or 
wave front structure changes. For conventional 
optical systems the presence of implicit apertures 
forced by component clear dimensions or edge 
imperfections may introduce effects in addition 
to, and different from, the intentionally designed 
stops. Vignetting, intensity roll-off with radius, 
can impair imaging systems. Attempt at com- 
pensation in software by radial gain scaling can 
reduce the usable dynamic range of the sensor and 
increase the effective noise. 


18.6.2.2 STRAY LIGHT 


Reducing stray light is a black art. For monostatic 
systems, where common optics illuminate the phe- 
nomenon and receive the returned light, control of 
stray light is always a challenge. Finding all sources 
of flare demands a full design understanding of 
the optical geometry and component properties, 
which can still yield surprises. For bistatic systems 
the problem is alleviated by physical separation of 
illuminating and receiving optical trains, but even 
here high multiple-order reflections and scattering 
in any common parts, such as windows, can con- 
tribute unwanted scattered light comparable to the 
often extremely low signal from which information 
must be extracted. 

Stray light from sources external to the instru- 
ment may often be reduced by baffles, optical laby- 
rinths, stops, and/or filters. Spectral control using 
narrowband or sharp-edged filters usually requires 
acollimated beam, even for suchas the temperature- 
tunable and no-longer fashionable Christiansen 
filter, where a cell contains transparent liquid and 
powder with different dispersion curves intersect- 
ing at the transmission wavelength. OADS used a 
temperature-tuned dielectric stack in a well col- 
limated beam. If necessary, spatial and spectral 
methods may be augmented by temporal methods, 
such as a modulated source with phase-sensitive 
detection. This can vary from simple ac coupling 
with a tuned or lock-in amplifier, with phase-locked 
or frequency-locked loop capabilities, to driving 
the source with a pseudorandom code generator 
and correlating it with the detector output. Where 
flare does not have the same time characteristics as 
the signal, many orders of magnitude of rejection 
become possible, and in the last case range-gating 
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is an additional bonus. Modern solid-state sources, 
for whose physical properties a nanosecond is a 
long time, especially permit these techniques and 
the associated improvement of SNR. 

Internal flare is often more difficult to suppress, 
but is amenable to the usual techniques of high- 
quality components, properly chosen and applied 
coatings, cleanliness, stops, and the choice of sur- 
face shapes. A tunnel diagram can help understand 
systems with reflective components, where the 
optical system is unfolded about each reflection, 
and all possible stray light paths become straight 
lines. This is good for minimizing ghost reflections 
and finding where best to put stops and baffles, and 
quantitatively augments the powerful technique of 
actually looking down the system from the detec- 
tor end. This visual assessment of the system often 
finds sources of stray light, particularly in systems 
designed for visible wavelengths, where the dark- 
adapted human eye is almost as sensitive as typi- 
cal detectors. Even looking through systems not 
designed for the visible wavelengths can still warn 
of unexpected consequences. 

Ray-tracing programs extend the tunnel dia- 
gram concept, allowing for surface curvatures dis- 
torting the tunnel. Almost all offer ghost analyses, 
and in many cases help to identify sources of stray 
light and allow its reduction by proper choice and 
placement of stops, for example. Optimization is 
less obvious than software designers might sug- 
gest and there is no substitute for thorough optical 
knowledge and experience. Using optical design 
programs without having a realistic expectation 
for what the result should look like can be danger- 
ous. It is easy to mis-set a single variable, and pro- 
duce a design that may be a poor optimization or 
even nonfunctional. One aspect of the art is that if 
something ‘feels’ wrong, then it probably is. For the 
150mm diameter receiver common to all six chan- 
nels of OADS, quite elaborate spatial and spectral 
restriction was essential to discriminate against 
direct sunlight. 


18.6.2.3 COATINGS 


Coatings are applied to most optical components 
to minimize surface reflection, to confer spectral 
selectivity, or even to protect. Using multiple lay- 
ers improves efficiency and antireflection effective- 
ness but can increase scatter. The trade-off depends 
upon a calculated figure-of-merit for the chosen 


system. Well-chosen coatings can increase the 
nongeometrical aspects of light gathering power, 
reduce ghosts and increase SNR, particularly in 
monostatic systems. Conversely, they can increase 
scatter, reduce SNR(,) and/or be sensitive to angle. 
Optical coatings are sometimes more fragile than 
the base material, preventing cleaning, but con- 
versely can sometimes be chemically or physically 
protective. Less commonly considered coatings, 
for example, electrically heated indium tin oxide, 
decrease transmission but may prevent worse 
damage from condensation, as with a cool window 
in a condensing environment. Where surfaces are 
superpolished, coatings require extra quality con- 
trol and handling to avoid worsening the expen- 
sively achieved improvement. 


18.6.2.4 WINDOWS 


Preservation of optical access with clean, undam- 
aged, birefringence-free windows is often man- 
datory, but is rarely trivial. Windows and optical 
access in general can be ruined by local melting, 
stress cracking, surface crazing, ablation, thermo- 
phoretic deposition, condensation damage, defects 
induced by the phenomenon, improper mounting, 
or even by physical accidents. Occasional manual 
or automatic cleaning may be essential, as may 
cooling to avoid deterioration, or heating to avoid 
condensation. Table 18.10 suggests some factors of 
interest when specifying optical access, but other 
techniques are sometimes necessary, for example a 
high-pressure gas curtain or liquid film may some- 
times be superior to a solid material as a window 
to protect the instrument from the phenomenon 
or its environment. Sacrificial membranes can be 
occasionally or continually replaced, such as those 
to prevent spalling damage in laser machining. 
Transmission losses arise both from bulk properties 
and from surface effects. Temperature dependence 
of bulk absorption effects, stress birefringence or 
mechanical variability are determined exclusively 
by the material and specified accordingly. Surface 
properties including adsorption are more variable 
and less well controlled. Good material, polish- 
ing, coating, and the assurance of their constancy 
are the best ways to reduce attenuation, scatter- 
ing, polarization effects, and changes with time. 
Superpolishing under liquid reduces scattering 
from surface defects, but later coating or improper 
handling may compromise this. A trade-off must be 
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Table 18.10 Window properties 


Material 
Physical 
Optical 
Coatings 
Mechanical 
Thermal 
Stability 


Transmissivity, suitability, availability, preparation, cost 

Strength, hardness, damage resistance, distortion 

Refractive index, homogeneity, aberrations, surface finish, birefringence 
Effectiveness, durability, scatter, cleaning capability 

Thickness, width, shape, stress, strain, retention, sealing 

Expansion, conductivity, ablation, cooling, heating, embrittlement 
Change of state, etching, chemical or biological attack, crazing, ageing 


made between scatter, reflectance, and vulnerability 
to damage, as it is often impractical to achieve the 


best of all three simultaneously. 

Transmission through one or more windows 
must not unacceptably change intensity, wave 
front curvature, polarization, wavelength distri- 
bution, nor introduce scattering or detrimental 
reflections during operation. Note, for example, 
that in an uncollimated beam even a perfectly flat 
plate has aberrations. 

If requirements are either mutually exclusive or 
demand an expensive material, then double win- 
dows with an intervening balancing medium some- 
times offer a cheaper and more practical solution. 
For example, simultaneous exposure to extreme 
pressure and temperature is possible using a thin 
magnesium oxide window exposed to the heat, fol- 
lowed by a cooled gas at high pressure, and a sub- 
sequent thick float glass window. Edge polishing 
and annealing reduce mechanical stress-raising 
weak points. Support and sealing are best distrib- 
uted over one or more faces with a suitable material 
that retains its flexibility. Elastic materials are good 
for repeated cycling over temperatures tolerated by 
the material. Annealed copper is good for one time 
sealing, and reasonable temperature cycling. For 
higher temperatures and cycling ranges, internally 
pressurized stainless steel bellows work well, as do 
thermally compensated materials. For many appli- 
cations an adhesive with slight residual elasticity 
is good, but a rigid ledge may be better to support 
mechanical load, provided that stress is acceptably 
distributed. Thermal as well as physical shock can 
break materials. 

More expensive and exotic materials are not 
necessarily better. For example, specific condi- 
tions in a flame chamber test rig broke all its sap- 
phire windows immediately. Short of spares, time 
and money I had a local spectacle lens maker cut 
replacement windows from a piece of float glass. 


Surprisingly, a simple set of these cheap windows 
survived through the several month test program. 


18.6.2.5 OPTICAL FIBERS 


Optical fibers are commonly used both to transmit 
information, and to sense various physical proper- 
ties as a technique for measurement. Because these 
capabilities are mutually exclusive, optical fibers 
are less easy to optimize for any given applica- 
tion than might be expected. Most investment has 
been funded by telecommunication requirements 
in the 1.3-1.5,1m wavelength region. Although 
this knowledge and experience has wide utility in 
optoelectronics, the properties of interest, either 
required or accidentally exhibited, are not always 
well controlled nor characterized, since they are 
not necessarily those of most concern to the man- 
ufacturers. Great care is essential in examining 
the properties affecting the specific application. 
Table 18.11 lists a few questions best answered 
before commitment to a specific optical fiber. 

Optical fibers essentially transmit intensity 
alone, but extremely complex effects can alter the 
relationship between input and output so that the 
fibers act like intricate optical systems, behaving 
both as integrated optics, and also capable of appli- 
cations, such as amplification, formerly occupied 
by electronics. The existence of photonic band- 
gaps and special optical properties, derived from 
physical structure on the order of the wavelength 
or even much less, has an increasingly wide field of 
potential applications. 

Transmission of information is possible in many 
ways, including simple intensity, polarization, 
wavelength or frequency division multiplexing, 
solitons, and individual photons. Two applications 
of fibers are common, as a sensor, requiring that 
the output shall depend uniquely on the sensed 
quantity, and as a channel, requiring that the out- 
put shall indicate only the required properties of 
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Table 18.11 Optical fiber questions 


Is single- or multimode fiber necessary? 


Which of many fiber types is optimal, or even adequate? 


What materials are appropriate and available? 


Are transmission losses acceptable? And will they remain so? 


Does transmitted or external hard radiation cause deterioration? 


Should the fiber maintain, select, or control polarization? How? 


How is injection mode-matching to be established and maintained? 


Is the fiber sheath opaque to ambient light? Does it matter? 


Must cladding modes be suppressed? If so, how? 


Can properties of the fiber introduce signal fluctuations? 


Does high light intensity induce nonlinear effects or cause damage? 


Will the fiber be properly sensitive to the measured properties? 


Will the fiber be properly insensitive to ambient properties? 


What adhesives are usable or necessary for construction? 


How shall terminations be arranged, aligned, and controlled? 


How do vibration, bending stress, temperature, and ageing affect performance? 


the input. In the second case, filtering using prior 
knowledge can improve the signal. For example, a 
single-mode fiber of sufficient length to attenuate 
cladding modes can operate as an excellent spatial 
filter. Since a typical single-mode fiber remains sin- 
gle mode over a factor of two in wavelength, inject- 
ing the output from a superluminescent diode gives 
an excellent temporally incoherent point source, an 
important device once only available as an inferior 
approximation. Replacing the source with a laser 
permits retention of temporal coherence and again 
gives a versatile point source with small mass to 
align accurately. In the latter case, however, earlier 
comments must be augmented, and the fiber may 
have to be terminated with a wedged face to prevent 
coherent reflections from generating instability in 
the laser cavity or elsewhere, since the fiber can 
act as a sensitive Fabry-Perot interferometer. An 
inclined face changes fiber emission geometry from 
circular to slightly elliptical, with a pointing direc- 
tion no longer parallel to the fiber axis. Dome pol- 
ishing the fiber end can also mitigate detrimental 
effects of back-reflections, and facilitates compres- 
sion coupling, but sometimes has other limitations. 

Launching into a single-mode optical fiber 
requires matching convergence angle or numerical 
aperture and waist size to preserve the Lagrange 
invariant, the product of aperture, field angle and 
refractive index, similar to the Abbé sine condi- 
tion or the brightness theorem, which determines 


the highest possible freedom from loss of light 
gathering power (étendue). It is rarely possible to 
meet this condition totally, even with accurate and 
stable alignment, and birefringence and _polar- 
ization properties of fiber and other components 
may worsen it further. Once light is launched into 
an optical fiber, it is not desirable to let it out and 
then try to relaunch it—the power loss is always 
worse than expected. Contamination of end-faces 
exposed to an open atmosphere can massively 
reduce the amount and quality of transmitted 
power, or even destroy the component. Emergent 
power density near the fiber face for one transmit- 
ted watt of green light can be 10!' W m”, eight 
orders of magnitude greater than sunlight. This 
may be sufficient to attract dust by thermophoresis, 
causing enough local heating to craze or ablate the 
fiber material. Such problems are easily recognized. 
Output damage is identified by the appearance of 
Airy-type rings or other intensity structure within 
the ideally Gaussian output beam in the far field. 
Input damage causes only loss of power. Optical 
fibers are typically fragile, requiring one or more 
tough concentric sheaths outside the cladding for 
protection. Their use also demands high-precision 
mechanical designs and mounts, typically to sub- 
micron and microradian tolerances. Even standard 
connectors for single-mode optical fibers have 
complex and exacting requirements, and transmis- 
sion efficiency can fall with each reconnection. 
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Transmitting high optical power, especially 
through single-mode fibers, requires careful speci- 
fication of fiber material, such as a germanium- 
free core that will not form color centers, and 
special optical adhesives that do not denature. 
Transmission may reduce with exposure to high 
transmitted power or cosmic or other radiation, 
micro-cracking with vibration or under sus- 
tained strain from too small a bend radius, or 
inadequate physical support. Unless intentionally 
exploited, sensitivity to external effects is detri- 
mental. Microphonics may sometimes be reduced 
by special care with alignment, and sometimes 
not, depending on fiber and environment, either of 
which may be insufficiently well specified. The only 
way to assure adequate performance is by testing a 
sample of the specific optical fiber from the batch 
available. 

The conjugacy of end faces of coherently bun- 
dled fibers allows imaging of relatively inaccessible 
situations from body cavities to turbine blades. 
However, beware of the potentially ambiguous 
meaning of ‘coherent’ as applied to fibers. A coher- 
ent bundle is a loom of fibers whose ends map spa- 
tially so that an image may be transmitted—like 
the naturally occurring mineral ulexite, whereas 
a coherent fiber typically conserves spatial and/ 
or temporal coherence of the light. Optical fibers 
based upon coherent constraint of the propagating 
modes, by step or graded refractive index control 
in regions of a few microns diameter, are distin- 
guished from incoherent optical light guides, typi- 
cally with step index confinement within a few tens 
of microns up to millimeters in solid material or 
liquid-filled tubes. 

A special case is the GRaded INdex (GRIN) lens 
exploiting the radial control of refractive index to 
perform the operation of miniature lenses, using 
the techniques of optical fibers. Control of refrac- 
tive index gradient in fibers may also inhibit dis- 
persion and increase bandwidth. In a step-index 
single-mode fiber an evanescent wave extends 
into the cladding material. Ambient conditions 
may be sensed by their effect on this evanescent 
wave, and hence on the propagated intensity or 
polarization. This same sensitivity manifests as 
detrimental effects for pure transmission, where 
bending radius, temperature changes, stress micro- 
cracking, vestigial inhomogeneity from irregular 
sleeking near the core-cladding interface, and 
other manufacturing imperfections can give rise 


to a range of curious effects only fully appreciated 
by exhaustive empirical testing. Polarization rota- 
tion, temporal fluctuations over a wide range of 
frequencies from seconds to months, and cladding 
mode coupling are all possible. Used with GRIN 
lenses that exhibit residual birefringence, the fiber 
can behave as a temperature dependent wave plate. 
Sometimes fluctuating light leaks are visible where 
the sheath is not opaque. Even with polarization- 
maintaining fiber, using bow-tie stress birefrin- 
gence or elliptical cores, a slight error in launch 
orientation can yield large and unexpected sensi- 
tivity to fiber environment, and not necessarily in 
a predictable way. Polarization preservation where 
one of the two orthogonal axes is attenuated gives 
a purer polarization output, but occasionally at the 
expense of intensity instability. As fiber manufac- 
turing control and methods of characterization 
improve, many of these effects may be reduced, 
or at least their sources better understood and 
avoided. Meanwhile specification for a given appli- 
cation must allow for experience specific to the 
individual fiber type, manufacturer, and indeed 
the production batch. Six 1.2mm diameter clad 
quartz fibers were used in OADS as light pipes fol- 
lowing a single chrome etched field stop to direct 
each of the six sheet images to its own APD. 


18.6.3 Physical mounting 


Physical mounting of components must maintain 
correct position without stress or distortion. To 
avoid stress concentration, support must spread 
the load either by flexible media or sprung clamp- 
ing. The hole, slot and plane mount, with gravity 
or light spring loading, provides kinematic loca- 
tion but cannot be locked without over-constraint. 
It is thus useful for exact relocation, for example 
of a hologram for reconstruction, or a component 
that must be removed and then replaced exactly, 
but is not robust and is typically confined to the 
vibration-isolated stable table of a temperature 
controlled laboratory. Even in that environ- 
ment, where optics are mounted on posts, the 
posts should be as short as possible with the larg- 
est possible diameter. Where a mount is fixed to 
a supporting metering structure with screws, the 
component face should be dished to provide the 
largest footprint, and controlled locking torque 
specified to constrain distortion. While the meter- 
ing structure may be as simple as a flat table with 
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tapped holes or magnetic clamps for greater ver- 
satility, for specialized instruments it may be a 
complex casting, machined frame, or composite 
assembly of diverse elements. The retention of 
its physical properties, especially dimension and 
rigidity, is important. In OADS the necessary 
beam pointing stability of <5yrad was achieved 
by a metering structure of low-expansion stainless 
steel, with weight and rigidity optimized by FEA 
of the complete structure, isothermalized by eth- 
ylene glycol circulating in labyrinthine manifolds. 
Each laser was precisely located in one of the six 
circularly symmetric transmitter lens assemblies 
with five lockable adjustments, two tilt, two trans- 
verse, and one focus. Each completed transmitter 
assembly was directed to the correct point in space 
by rotation in a hollow spherical bearing near the 
output nodal plane, using one rotation and two 
more tilt adjustments. Alignments were made in a 
set sequence to meet specified criteria. The adjust- 
ments were locked by screws to a specified torque 
and all critical sliding or pressure points encapsu- 
lated in rigid adhesive. Most applications require 
neither this precision nor its associated complexity. 

Whether in benign or more demanding envi- 
ronments, there is a trade-off between the rigidity 
of accurate placement and available adjustment to 
accommodate changing conditions. Active control 
is complex, to be avoided if possible, but some- 
times essential. Large or fragile components may 
be held in a sine wave mount, a perforated annu- 
lus of elastomer. To seal and/or distribute stress, 
O-ring seals or well-chosen flexible adhesives often 
suffice. Metal clamps are hazardous unless appro- 
priately sprung. Where no movement of the com- 
ponent can be tolerated, rigid sealing may exploit 
materials with graduated properties to distribute 
the stress. 

Ideally, optical mounts must be robust and 
rigid, without affecting the optical properties. For 
materials with optical activity or birefringence 
induced by stress, this can be confirmed by view- 
ing the component between crossed polarizers, 
often an instructive test, with surprising results if 
applied, for example, to spectacle lenses. For high- 
quality components even small mounting stress 
can introduce aberrations, and rigid retention for 
high performance competes with more flexible 
location for field survival. 

Adhesives are common in optical packaging, 
from component retention to optical function. 


Several popular transparent epoxy-based materi- 
als can be cured by ultraviolet exposure; others 
require two components, a catalyst, heating or time 
to reach the required state. When used for reten- 
tion, the final mechanical properties dominate, but 
when adhesives are in the optical path more careful 
choice is advisable. Transparency is usually neces- 
sary, and this may be compromised in use by age- 
ing, nonoptical radiation, stress cracking, change 
of polymer properties, crystallization, or high 
light level. The last is especially relevant where, for 
example, an illuminator has a GRIN lens glued to 
a single-mode fiber. 


18.6.4 Detectors 


Ideally, any square-law detector yields an electri- 
cal signal corresponding to incident intensity, 
manifest in the semiclassical approximation as 
successive quantizations, which may not be resolv- 
able. For sufficiently low light levels, an appro- 
priate detector reports a given fraction, based on 
quantum efficiency, of individual photon arrivals, 
missing some but ideally not changing the arrival 
statistics. As the light level increases, this becomes 
a semicontinuous signal whose shot noise increases 
with the intensity, approximately as the square 
root of the equivalent number of photon arrivals, 
improving SNR. An audible analogy of this might 
be hail on the metal roof of a noisy workshop. This 
is true for a single-point detector such as a PMT 
or APD or for one pixel of a CCD, CMOS or other 
array. Criteria for optimizing detector type assume 
different relative importance depending on the 
application. For photon correlation, the ideal is a 
single-point detector free from dead time, after- 
pulsing, pulse pile-up, internal correlations, and 
with a sensitivity and quantum efficiency as high 
as practicable, to produce single resolvable pulses 
from every field quantization, with no trigger- 
ing events from any other source. This may be 
approached quite closely for high-energy ultravio- 
let quanta with a rubidium telluride photocathode, 
as with OADS pressure and temperature sensing. 
For red photons, both energy and cathode sensitiv- 
ity fall, allowing more false triggers and impairing 
the correspondence of pulse statistics with those 
of the optical field. Single-photon performance is 
approached with reverse biased silicon APDs in 
the Geiger mode. To obtain highly accurate mea- 
surements of high intensity at low bandwidth, the 
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criteria may shift from shot noise intrinsic to the 
signal to other sources such as Johnson noise from 
the load resistor or preamplifier, and keeping the 
noise low moves from optical to electronics design 
expertise. Cooling the detector and its electronics 
below a certain temperature often helps with noise, 
more so where the detector relies upon absorbed 
energy as with a thermopile or bolometer. 

For single-channel detectors, PMTs are fast 
and can have huge sensitive areas, with the pen- 
alty of being physically large and requiring high 
voltages for the accelerating grids. However they 
offer amplification of many orders of magnitude 
up to thousands of amperes per watt, without add- 
ing excess noise to the original quantum realiza- 
tion variance. Wavelength sensitivity typically 
falls from mediocre (~25%) in the blue and green 
to rather poor (~1%-4%) in the red, although 
newer implementations claim significant improve- 
ment. APDs have much better quantum effi- 
ciency (~90%) in the red, and are physically small. 
Although their noise figure can be as low as 10-4 
W Hz-1, when stably cooled to around 0°C, they 
also require a moderately high (~ 200+ V) and 
stable voltage to achieve a gain from a few to a few 
tens of amperes per watt. The sensitive area is often 
inconveniently small (<1 mm?) with a longer dead 
time than PMTs. Lower-performance bulk detec- 
tors of various materials are often cheaper, and 
may be adequate for less demanding applications. 
Operating in either photovoltaic or photocon- 
ductive modes without intrinsic gain, their noise 
typically increases with surface area and tempera- 
ture. Most detectors are fairly robust to optical 
overload, but take varying times to recover. For 
example, a PMT will typically recover its sensitiv- 
ity lost from transient optical overload as soon as 
the dynode decoupling capacitor chain recharges, 
but to recover its noise characteristics may take 
several days in darkness with full voltage applied. 
APDs flip into an avalanche overload state if the 
optical input exceeds a level not much larger than 
the average, but are not damaged if the current is 
externally limited. Recovery after the removal of 
excess light can be quite rapid, but must sometimes 
be encouraged by the temporary complete removal 
of the reverse bias voltage. Thus both PMT and 
APD detectors can be protected by self-limiting 
mechanisms, and are not necessarily permanently 
damaged until the optical input melts the sensing 


or amplifying material. This is also true for con- 
ventional bulk material sensors, although, without 
limiting the electrical current, thermal runaway 
can cause permanent damage. 

Array detectors operate on similar photoelec- 
tric mechanisms but store charge locally for an 
equivalent exposure time. CCDs read the charge 
out serially via a “bucket brigade,” which is a major 
source of noise at low light level. Individual pixels 
rarely have uniform sensitivity and a calibration 
table is desirable for quantitative work. Ageing and 
radiation damage also change the characteristics 
of individual pixels, demanding calibration and 
update of lookup tables. Types of array detector 
are available with different characteristics such 
as pixel size and shape, dead space between pixels 
(fill-factor), well depth (how much charge can be 
stored at an individual pixel site without nonlin- 
ear effects), readout speed and method, and even 
manufacturing technology. For example, CMOS 
requires less power and can be constructed so that 
each pixel may be separately addressed, clocked, 
and amplified. This conditional interrogation can 
greatly increase the dynamic range of the array, 
because those pixels that are more brightly illumi- 
nated may be read out more rapidly, accumulat- 
ing the total from each pixel in external memory. 
Pixels in the darker areas are merely read less fre- 
quently or even at the end of the image acquisition 
time. In this mode, low pixel crosstalk is essential. 
Although excessive cooling can cause a CCD to 
stop functioning, the effect on a CMOS detector is 
desirably to reduce the noise. Typically, the output 
is read as a filtered analog signal to an ADC, either 
on or close to the detector chip. Although the digi- 
tization noise might appear to be no more than the 
inverse of the number of bits, real devices rarely 
perform this well and a bit or two of additional 
noise is common. Claims for the relative merits 
of each type of device change frequently and for 
any application it is important to understand how 
the manufacturer’s claims relate to the proposed 
experiment. 

The art here is to choose a detector that seems 
appropriate and become as familiar with it as 
possible, including how best to drive and control 
it, how to compensate for its shortcomings, and 
what its limitations dictate for the accuracy of the 
experiment and the quantitative validity of the 
conclusions. 
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18.6.5 Miscellaneous observations 


Light gathering power can easily fall below the 
upper limit set by the Lagrange invariant unless 
the numerical aperture is matched everywhere. 
Errors can occur readily when coupling single- to 
multimode fibers, or at the entrance to a detector, 
under the impression that an alignment criterion 
may be relaxed. Typically the receiving optics, col- 
lecting light scattered from the probed phenom- 
enon, is merely a photon-bucket, but this does not 
necessarily mean that aberrations can be tolerated; 
for example, the proper performance of a field 
stop may rely upon its high-definition image at 
some other location. Aperture stops may be simi- 
larly critical. The performance of dielectric stack 
narrowband filters is typically compromised in a 
poorly collimated beam. 

A given specification may be met and imple- 
mented by many different candidate components, 
devices and the phenomena upon which they 
rely. New techniques and innovative devices are 
constantly becoming available. Naturally occur- 
ring materials are now being augmented by arti- 
ficial structures, including sequences of differently 
doped layers, physically and/or chemically con- 
trolled photonic bandgaps, and many other ways 
of spatially or temporally manipulating light from 
classical to quantum domains. The formerly popu- 
lar division of experiments was into classical, where 
the signal is assumed to be continuous analog with 
additive noise, and “photon resolved,” where all the 
information is contained in the intervals between 
photon arrivals, assuming monochromatic light, 
but with device limitations such as dead time, pulse 
pile-up—endemic with Poisson statistics—and 
other limitations. This separation represents rare 
extremes, and increasingly the interesting typically 
low-light-level experiments are in the “gray” area 
between the two, demanding a more carefully con- 
sideration of the nature of the signal and how best 
to use it to measure the phenomenon of interest. 
New manufacturing methods, extending beyond 
chemical vapor deposition and molecular beam 
epitaxy, are being introduced. Although fashion, 
implying ready availability and perhaps low cost, 
favors the new, many older ideas also have merit. 
Having at least heard of as many obscure or archaic 
devices and techniques as possible can be very use- 
ful, where old ideas may find renewed application 


because of other perhaps unrelated advances. For 
example, etching diffraction gratings from laser 
created fringes exceeds the capability of the once 
necessary complex and precise mechanical ruling 
engines. A broad knowledge of natural history is 
beneficial, since many subtle optical and electronic 
ideas have been common in nature for millions of 
years. For example, the Bayer mask used to obtain 
color information from sensor arrays with mono- 
chromatic pixels, without too serious a resolution 
penalty is similar to the implementation of color 
vision in the pigeon by color masking of retinal 
cones. The eye of Anableps has bifocal lenses to give 
clear images above and below the water surfaces 
it favors. Contraction of a cat’s pupil to a slit per- 
mits a larger dynamic intensity range than a circle 
would, without impairing the motion sensitivity of 
an already fovea-deprived organ. However, other 
than as examples, such details are beyond the pres- 
ent discussion of arts and ideas to enhance success. 


18.6.6 Electronics 


The transition from detected photons to sen- 
sible information is mostly performed by elec- 
tronic devices, starting with amplification, via 
digitization, signal processing, and data reduc- 
tion, to retrieval of information for archive and/ 
or presentation as a human-readable display. The 
hardware implementation of ideas introduced in 
Section 18.5.7 has expanded to once unimaginable 
capabilities. Since this trend seems likely to con- 
tinue, a discussion of specific devices is less use- 
ful than brief comments about general techniques, 
although even those become more comprehensive 
with each passing publication. An important trend 
is that optical techniques and devices are increas- 
ingly able to perform tasks once exclusive to elec- 
tronics, from the extreme example of optical fiber 
replacing copper wire, to the integration of optical 
components with architectures formerly used only 
for electronic integrated circuits. The discipline 
of photonics parallels that of electronics, exploit- 
ing nanostructures with capabilities as yet barely 
imagined. 

Once the detector has responded in whatever 
terms are available, the transition to a digital for- 
mat should follow as soon as possible, usually via a 
pulse discriminator or ADC. Although in principle 
the same bandwidth product of data throughput 
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may be represented as a rapidly varying 1-bit sig- 
nal or as a less frequently updated multi-bit word, 
available hardware often suggests an optimal oper- 
ation between these two extremes. 

For real time systems where the input is syn- 
chronized with an independent activity, bottle- 
necks impose undesirable limits, and techniques 
such as dedicated hardware with a real-time oper- 
ating system may be necessary, perhaps demand- 
ing low-level coding and testing, even with integer 
arithmetic and/or bit-sliced architecture. However, 
hardware devices change so rapidly that a once 
clever way to overcome a specific limitation can 
become an irrelevant complication. It seems that 
advances in hardware may shift risk areas toward 
increasingly complex software. However, that has 
not always been so and situations change quickly. 
All recent choices made to achieve system func- 
tion are susceptible to critical review in the light of 
innovations. Sadly, past experience may be a detri- 
ment here. 

Where delayed throughput is allowed, asyn- 
chronization can be achieved by a FIFO buffer, 
through single-or dual-port RAM, to hard disk 
files in a suitable format for later access, depend- 
ing upon the required buffering and accumula- 
tion rates. More than one asynchronous interface 
may be needed to retain clean modularity in 
some complex systems, and to complete specified 
tasks within the time available. Hardware choices 
vary widely, from application specific or very 
high-speed integrated circuits (ASIC or VHSIC), 
through digital signal processors (DSPs), via field 
programmable gate arrays (FPGAs), to generalized 
multicore serial microprocessors. Each has its pros 
and cons. ASICs and VHSICs are fast, expensive 
and inflexible. DSPs are fast, versatile, and typi- 
cally externally preprogrammed. FPGAs can be 
slower, but are dynamically, even conditionally, 
reconfigurable. Graphics processors, developed 
for online gaming, can off-load many of the more 
calculation-intensive tasks to enhance perfor- 
mance. Microprocessors are completely versatile, 
but with the most common operating systems may 
not be able to process signals in real time because 
of intrinsic and uncontrolled task timing, preclud- 
ing interrupt driven actions. Real-time operating 
systems mitigate this, but application software is 
less readily available. At each stage of the process- 
ing one wishes to have fewer data to handle, but 
can perhaps tolerate more complex processing 


algorithms. Hardware and software tools evolve 
rapidly. They become faster, more powerful, and 
more versatile, but also more complex, more dif- 
ficult to use, and often have reduced backward 
compatibility. 

Processing algorithms and hardware must 
be optimized as a system, retaining modularity 
and testability. Analysis is based on information 
intrinsic to the signal, but can be augmented by 
supplementary data to extend the limits of what 
information can be extracted, and the boundaries 
of attainable resolution and accuracy. Exploitation 
of independent auxiliary data, or any of a wide 
range of mathematical and/or statistical tech- 
niques, such as maximum entropy, analytic con- 
tinuation, and super-resolution, may improve 
results. Understanding the output is not always 
trivial and iteration between the design concepts 
in Section 18.5.7, and the hardware choices here, 
may be necessary and can be challenging, par- 
ticularly where the physics and phenomenology 
of the available measurement make extracting the 
required information an “inverse problem” with 
all its pitfalls of multiple values, measure theory 
errors, nonlinearities and numerical instabilities. 

For OADS, the 200MHz raw signal was 
dynamically scaled and filtered before the ADC, 
followed by discrimination and conditional edit- 
ing implemented in discrete components prior to 
loading into the FIFO, subsequently accessed by 
a TMS320C25 DSP, using scaled integer arithme- 
tic with a real-time operating system. Its program 
was downloaded on initialization, permitting easy 
upgrades during development, and later evolu- 
tion of function. For example, velocity, pressure 
and temperature were reported during twenty- 
four 1-2hour flights of the F-16B aircraft in 1990. 
Velocity and particle properties were measured 
during later research flights on F-104 and SR-71 
aircraft, with final emphasis changed towards 
particle size and concentration measurements on 
some final DC-8 flights. 


18.6.7 Power supplies 


Without sufficient attention, power supplies can 
be a high risk item. Not only can they fail, but 
they may also induce into the signal instabilities, 
correlated fluctuations, noise in one or more fre- 
quency bands, and inconstancy with input volt- 
age, temperature, load, and/or ambient conditions. 
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Although these are mitigated by careful design, 
specifications must be confirmed by testing, cali- 
bration, characterization and monitoring of any 
artifacts in the light source or signal. For thermal 
sources, inertia prevents significant fluctuations 
faster than a few tens of kilohertz, but solid-state 
sources can have sub-nanosecond fluctuations. 
Accordingly, great care must be given to the power 
supplies driving the light sources, detectors, and 
sensitive electronics. Although nontrivial, feed- 
back stabilization should always be considered 
and may often be necessary. Note also that while 
a single power supply may perform acceptably, 
simultaneous operation of several even apparently 
unrelated supplies in the field environment may be 
susceptible to greatly worsened noise performance. 


18.7 TESTING AND CALIBRATION 


Much can be learned from just looking at the sys- 
tem and phenomenon, and even incidental obser- 
vations may indicate unexpected behavior, whose 
rectification builds confidence in subsequent 
measurements and the validity of conclusions. 
Components should be rigorously tested as soon as 
available. Curious effects are often noticed near the 
limit of awareness, and although they may be nei- 
ther repeatable nor completely understood, their 
dismissal as insignificant may invite later catas- 
trophe. Table 18.12 augments Table 18.6 suggest- 
ing further questions whose proper answers may 
improve confidence. 

No amount of testing can ever prove that soft- 
ware is infallible. The likelihood of unknown 
error is high. Good designs allow upload of new 
code modules to correct anomalies found in test- 
ing, permitting analysis and rectification before 
damage occurs. Power supplies must not only be 
tested as individual units in isolation, but also in 
their final operational configuration. It is well to 
impose test conditions outside those expected, and 
for longer periods, logging all relevant parameters 
with higher accuracy than seems necessary. It is 


Table 18.12 Testing and calibration 


Initial function 
Anomalies 
Deterioration 
Contingencies 


important to examine these test results in various 
graphical formats and actually think about what 
trends might mean. Surprisingly many failures 
have origins in data measurable, or even measured, 
long before, but neither understood nor even prop- 
erly examined. 

Tests should simulate operation as closely as 
possible. In OADS as a subset of system testing, the 
lasers could be independently modulated and the 
response of each APD to its own and other chan- 
nels examined for various targets. Complete test 
documentation allows later insights about why the 
results are not identical with what was expected— 
they never are—and can allow compensation or 
correction to improve the measurements or per- 
formance, or at least to predict the boundaries of 
accuracy. 

Calibration is based on knowledge gained by 
testing. Quantitative characterization may permit 
adjustment of output values to reflect their input 
origins during operation, or to correct measure- 
ments later. The two aspects of calibration are the 
quantitative aptitude of the apparatus for its design 
purpose, and the traceability of the recovered 
information to required properties of the exam- 
ined phenomenon. 

For OADS, the only important physical calibra- 
tion was the separation and parallelism of the sheet 
pairs, performed by exposing a dimensionally sta- 
ble sensitive film to the illuminated pattern at sev- 
eral axial stations and using a traveling microscope 
to measure the geometry. 


18.7.1 General health and status 
monitoring 


Extensive monitoring is desirable. Power on system 
(or self) test (POST), and routines to exercise the 
equipment as it approaches operational status, are 
conventionally provided, but should be augmented 
by health and status monitors for all quantities that 
could affect performance or the values of retrieved 
measurements. Monitored values should be 


Does everything work as planned? Does it meet design expectations? 
What do unexpected observations mean? 

In a simulated environment, does it get worse at an acceptable rate? 
How is less-than-ideal operation to be accommodated? How is the 


unexpected to be handled? 
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recorded as a time-logged data stream at whatever 
rate is appropriate for the parameter concerned. 
Table 18.13 lists examples of usefully monitored 
quantities with supplementary notes following. 

For temperature, pressure, and thermal con- 
trols, a readout rate of 1Hz is mostly adequate. 
Power supplies require either a more rapid readout, 
or a separate assessment of root mean square (rms) 
fluctuation, noise, crest factor, or other warning of 
unexpected operational noise or transient spikes. 
If the illumination intensity can fluctuate on a tim- 
escale shorter than the inverse monitoring rate, 
then additional properties should be assessed, as 
with power supplies, above. Most serious may be 
fluctuations and drifts on a timescale similar to 
that of the detector or camera exposure or read- 
out, but not necessarily in phase. This can lead to 
quite subtle effects in later processing or, for 2-D 
sensing, picture-to-picture variations, and moiré; 
or other patterns in single pictures. If such obvious 
effects persist despite design approaches intended 
to remove them, then the bad data may be rejected 
later only at the cost of efficiency, if they may be 
rejected at all. However, small attendant changes 
in SNR in the detector signal or picture are not 
so easily discovered nor compensated. Indeed, 
these effects may be almost impossible to compen- 
sate, so it is important during design and testing 
to eliminate or at least reduce problems associ- 
ated with the interactive effects of power supplies 
and the devices they drive. For example, calibra- 
tion may change if only parts of a multifunction 
apparatus are currently activated, and this must be 
reported in the health and status archive attached 
to the provenance file for the specific experiment. 
Monitoring helps support claims for the validity of 
measurements. 


Many internal systems merit stabilization by 
negative feedback, with the loop signal monitored. 
It is relatively cheap and easy to characterize such 
behavior. Experimental data may be only poorly 
understood without at least a time record of rms 
and peak-to-peak fluctuations, together with the 
amplitude and phase, or even power spectrum of 
the light source noise and drift. 


18.8 HOSTILE ENVIRONMENTS 


Optoelectronics can be challenging enough even 
in benign conditions. Hostile environments that 
threaten success are of two kinds, hostile to the 
instrumentation, and hostile to the observer. These 
include all aspects of the measuring process during 
operation, and the program that brings the mea- 
suring device to the current situation. 

In the first class, conditions such as extremes 
of temperature, pressure, radiation, acceleration, 
vibration, shock, chemical attack, electromagnetic 
fields, noise, power source fluctuations, cosmic 
radiation and perhaps other specific conditions 
lead to potentially less than satisfactory results. 
Even if the system is designed to undergo grace- 
ful degradation, successive levels of deterioration 
may include (1) loss of calibration leading to poor 
or untrustworthy measurements, (2) loss of func- 
tion, preventing any measurements, (3) destruc- 
tion of the instrument, or finally (4) collateral 
damage, implying that the instrument is not only 
destroyed but also causes associated damage that 
may be much greater. The loss of a $1.5b Mars mis- 
sion because the same type of vulnerable compo- 
nent was used in all three of the triply redundant 
timing circuits was a sad example of instantaneous 
and unexpected failure. 


Table 18.13 Candidate health and status monitors 


ISO date, time, and all experimental provenance 


Power supply input and output voltages and currents 


Illumination level and stability of the light source(s) 


A sampling of temperatures throughout the apparatus 


Internal pressure, humidity, and other vapor pressures 


Electrical and thermal control voltages and currents 


Fluid pressure, temperatures, flow rates, leaks 


Loop signal in negative feedback stabilization wherever used 


Independent views of the phenomenon and instrument (video cameras) 
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Table 18.14 Ideal properties 


Simple, low technology 
Rigid, robust, small 
Modular, reliable, cheap 
Automated by default 
Low power demand 
Calibrated, verified 
Tested, characterized 


Easy to understand 
Resistant to damage 

Easy to diagnose and fix 
Reduces operator mistakes 
Stays cool, widely applicable 
Constant and trustworthy 
Known and understood 


Table 18.15 Warnings 


Chemical attack ruins 
Gravity changes disturb 
Vibration changes things 


Electromagnetic radiation gives errors 


Stress reduces performance 
Temperature drifts misalign 


Changes affect things unexpectedly 


Noise reduces capability 
Hard radiation damages 
Power supplies fail 
Windows get dirty 
Software crashes 
Pressure distorts 

Fluids leak 


Three aspects of environments hostile to the 
designer of the system are (1) not enough money, 
(2) not enough time, and (3) not having a cham- 
pion. Although the first two are common and 
obvious, the last is the most serious. To make 
anything happen takes someone dedicated to 
its successful completion. During instrument 
deployment the observer may also be exposed 
to hostile environments, such as extremes of 
temperature, pressure, vibration, acceleration, 
radiation, noise, antagonism, unreasonable 
expectations, or stress. 

Four classes of hostility merit consideration 
during the design of the instrument or methods for 
its deployment and use. These are where (1) con- 
ditions in the phenomenon are extreme, (2) access 
may be constrained, (3) the instrument is exposed 
to an extreme environment, and (4) the observer is 
under difficult or stressful conditions. Table 18.14 
introduces a few of the ideal properties whose con- 
sideration may improve success. 

Obvious though they may appear, warnings in 
Table 18.15 can apply in benign as well as hostile 
environments and may apply to the instrument or 
to the operator, or both. 

Outdoor or field installations are often exposed 
to diurnal temperature cycling or steep tempera- 
ture gradients, whether in use or not. Direct or 
reflected sunlight, rainwater, and unsuspected 
fixed or pulsed magnetic fields can introduce 


problems. Exposure to moisture or high humid- 
ity can age optical surfaces and dielectric coatings, 
also possibly causing fungus buildup or other bio- 
logically driven damage surprisingly rapidly. Any 
one of these is easily averted or accommodated: 
avoiding them all, and also the unanticipated ones, 
can be challenging. 

In summary, optoelectronic measurements in 
hostile environments are difficult. Analysis and 
thoughtful design mitigate risk. Planning graceful 
degradation, anticipating the unknown unknowns, 
talking honestly and often with the customer, and 
having plenty of money, time, support, and good 
luck are all beneficial. 


18.9 CHECKLISTS 


Table 18.16 lists some component properties 
deserving examination, followed by Table 18.17 
stressing environmental aspects. 


18.9.1 Analyzing unwanted effects 


Any part of a system or its environmental expo- 
sure may have unanticipated effects on any other 
part, and to minimize later surprises a useful 
technique is to create a spreadsheet whose first 
column contains all possible affective proper- 
ties and whose first row contains all aspects that 
could suffer effects. Examples of both properties 
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Table 18.16 Component considerations 


Optical radiation source 


Type 


Geometry 
Wavelength 
Spectrum 
Intensity 
Power 
Polarization 
Stability 


Optical components 
Lenses 


Beam splitters 
Filters 
Aberrations 
Geometry 
Refraction 
Materials 
Coatings 


Optical system 
Mechanical 
Alignment 
Polarization 


Detectors 


Type 
Principle 


Mechanical 
Detection 
Speed 
Noise 


Thermal, laser, diode, flash, arc, discharge, synchrotron, fluorescence, 
phosphorescence, explosion, biological, others 

Size, 2- and 3-D beam shape, divergence, pointing, spatial coherence 

X-ray, far and near ultraviolet, visible, near and far infrared 

Bandwidth, distribution, temporal coherence, tuning 

Intrinsic luminance, brightness, spatial and temporal stability, noise 

Efficiency, electrical and thermal stabilization, cooling 

Type, orientation, degree 

Frequency, pointing, intensity, beam shape 


Type, material, design, specification, compromises, speed, resolution, 
surface finish, tolerances, handling and mounting 

Flatness, thickness, multiple reflection, parallelism 

Bandwidth, tuning, edge roll-off, efficiency, reflections, absorption 

Seidel, chromatic, higher order 

Complexity, aperture, stops, vignetting 

Refractive index, angular dependence of reflection, dispersion 

Birefringence, transmissivity, homogeneity, strength, chemical stability 

Complexity, efficiency, materials, durability, scatter 


Design, rigidity, robustness, beam control, adjustments (and ranges) 
Methods, sequence, sensitivity, criteria, stability, locking, interactions 
Necessity, preservation, type (“S," “P," circular or elliptical) 


Array, line, single point, static (e.g., film), dynamic (e.g., electronic) 

Photoelectric effect, permanent or reversible chemical change, thermal 
or mechanical effect 

Sensitive area, packaging, thermal stabilization, protection 

Capabilities, quantum efficiency, sensitivity, gain, noise 

Dead time, read rate, fatigue, recovery time 

Shot, dark, amplifier, statistical, after-pulsing 


Table 18.17 Environmental considerations 


Mechanical Vibration, shock, g-loads, rigidity, static and dynamic stresses, strength, materials, 


construction, pressure, extraneous damage, weight, size, cost 


Thermal Heat, cold, static/dynamic gradients, changes, distortion, misalignment, insulation, 


Radiation Internal and external stray light, stops, filters, surface treatments 


Aero-optics 


Variable refractive distortions of wave front 


Noise Electromagnetic or radio frequency interference (EMI, RFI), grounding, hardening, 


screening, isolation, spurious signals, power supplies 
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and components are given in Table 18.18. In the 
body of the spreadsheet, a manual entry is made 
into each and every cell to verify that any mutual 
effects between row and column heading have 
been considered. Effects here include any possible 
interaction or sensitivity, which may then be quan- 
tified, or less tangible warnings, such as excessive 
complexity. The exercise may trigger a memory of 
a similar or analogous situation with good or bad 
former outcome. The cell entry creates an audit 
trail later to become a powerful diagnostic tool. If 
you think this is a tedious and unnecessary exer- 
cise, try fixing things later. Most entries will indi- 
cate that there is no cross effect. Other possible 
entries are that the effect has been considered and 
is acceptable or that the effect has been analyzed 
and remedial action taken, as indicated by a trace- 
able reference. Perhaps most seriously, and a major 
justification for this exercise, is that an effect is 
found to be likely but unknown, requiring further 
effort directed either towards its understanding 
and rectification, or its quantitative inclusion in 
the risk analysis. 

Table 18.18 is not intended to be exhaustive, but 
merely contains an exemplary set of possible sub- 
ject items to be reduced, refined and/or augmented 
according to the specific optoelectronic system. 


Table 18.18 Possible effects matrix 


Examples of properties 


As an example, a reduction in ambient pressure 
once seriously compromised the behavior of an 
argon ion laser. With the reduced refractive index 
of the lower-pressure air, the emission angle from 
the Brewster windows inside the cavity changed 
sufficiently to detune the resonator. Unexpectedly, 
retuning did not restore the power since the new 
direction in the cavity passed through a slightly 
thicker section of the wavelength selection prism, 
also intracavity. Even though the increase in 
absorptive attenuation was slight, the many passes 
within the resonator substantially reduced the 
cavity gain. The first lesson is that reduced ambi- 
ent pressure changed the pointing direction and 
power in a way that was not immediately recover- 
able by realigning the cavity—the prism also had 
to be physically displaced. The second lesson is that 
since such ion lasers are no longer in common use, 
the first lesson is largely irrelevant. The real lesson 
therefore is that a situation must be examined in 
greater depth and with more imagination than 
might at first seem necessary. 

In a working career thousands of such examples 
appear, but using the cross-reference effects matrix 
postulated above can improve the probability of 
creating a working design with minimal evolution- 
ary iterations. 


Aberrations, absorption, adjustments, ageing, alignment, blooming, 


breaking, calibration, changes, charging, chemistry, collimation, 
constancy, corrosion, cost, damage, design, detectivity, dimensions, 
dispersion, distortions, durability, efficiency, electromagnetic 
interference, errors, external stray light, field, flare, friction, fungal 
growth, ghosts, g-loads, ground loops, hard radiation, heat sources and 
sinks, insulation, internal stray light, isolation, Lagrange invariant, 
mounting methods, noise, nonlinear effects, optical feedback, 
outgassing, packaging, pointing, polarization, positioning, pressure, 
quality, quantum efficiency, quantum statistics, radio-frequency 
interference, reflections, refractions, rigidity, robustness, screening, 
sensitivity to everything, shock, signal and data formats, size, 
specifications, spectrum, stability, static and dynamic stresses, stiction, 
strength, temperature, testing, thermal gradients, transmission, 
vibration, vignetting, weight 


Examples of components 


Adhesives, apertures, baffles, beam splitters, bearings, coatings, 


construction, detectors, electronics, Faraday and Kerr effect devices, 
fibers, filters, GRINs, health monitors, lenses, light sources, materials, 
mounts, locking, moving parts, polarizers, power supplies, software, 
stops, structures, surface treatments, thermal control, windows 
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Table 18.19 Design homilies 


¢ Software and power supplies give more trouble and consume more resources than can possibly be 
imagined; plan not to be disappointed by this 

¢ Performance and ruggedization are better achieved by good initial design than by any number of 
later attempts at remediation 

¢ Underspecified science leads to impossible engineering 

¢ Communication, error checking, and conflict resolution are equally applicable to systems, 
hardware, software, scientists, engineers, managers, and customers who provide the funds 

* Knowledge alone is not sufficient ... choosing people with proper experience is essential 

* Configuration control from the start, where design evolves most rapidly, may prevent errors; 
unconscious neglect makes later disasters almost inevitable 

¢ A top-down straw-man design, starting at system level, and percolating down to isolate any area of 
uncertainty, is desirable as soon as possible 

¢ Hardware and software must be modular, and should map isomorphically 

¢ Optical, mechanical, interconnection, and functional layouts must be available early 

¢ Everything is made of India rubber—nothing is truly rigid 

¢ The Internet is an extremely valuable source of ideas and information; however, since content is not 
typically peer reviewed, it must be independently corroborated 

e Fifty hours in the lab will easily save 1h in the library 

¢ Simplicity is a worthy goal, but oversimplification can invite problems 

¢ Innovation is only acceptable where no possible method currently exists 

* Quality is meeting the specifications 

¢ If in doubt, make it stout, out of things you know about 


Table 18.20 Attitude homilies 


¢ Talk to the customer 

¢ Intend the excellence of your design 

¢ Be honest, pragmatic, and courageous 

¢ Study how others have approached and solved similar problems 

¢ Test potential solutions and guidelines for relevance to your purposes 

¢ Understand why something was done the way it was done, before committing to doing it 
differently 

e Seek advice, comments, and recommendations; detractors are especially creative, and will be most 
helpful, if somewhat indirectly 

¢ Consider all guidance carefully: suspect that which does not fit with intuition 

¢ Never trust opinion, however informed, without supporting evidence 

¢ If you are being brilliantly innovative, think about it harder, and insist others check it 

¢ Question everything; analyze and understand the answers 

¢ Check every assumption; check it again 

¢ Historical reasons can vary from wise cautionary guidelines to unexamined prejudice 

¢ Distinguish between intelligent compromises and short cuts; there are no short cuts 

¢ Avoid moving parts; avoid nonstandard parts; if possible, avoid parts 

e Buy the best and cry only once 

¢ Hope is not a strategy 
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18.10 HOMILIES FROM 
EXPERIENCE 


A. Good choices 
A. Experience 


Q. How do you succeed? 

Q. How do you make good 
choices? 

Q. How do you get A. Bad choices 


experience? 


Many lessons may be encapsulated in aphorisms, 
whose illusion of triviality belies the pain associ- 
ated with truly apprehending their significance. 
As light relief, two tables give examples of maxims 
encapsulating arts acquired by bitter experience. 
Table 18.19 passively addresses system design issues, 
whereas Table 18.20 advances less scientifically 
tractable ideas and attitudes whose comprehension 
helps to reduce risk. These more psychologically 
based concepts are cautiously offered to mitigate 
the anguish incurred by those with the temerity to 
embark upon optoelectronic projects. 
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modes, 208-209 
antiferroelectric liquid crystals, 
209 
deformed helix mode 
ferroelectric liquid 
crystal, 209 
electroclinic effect, 209 
N*-SMC*, 208-209 
Analogue optical processing, 
542-543 
Analogue processors, 554-557 
Anamorphic lenses, 338f 
Angular disparity, 295, 295f 
Angular divergence, 434 
Anisotropic Bragg diffraction 
geometry, 552f 
Anisotropic properties, in LCDs, 
142-145 
Anode, 285 
Anthraquinone, 149 
Antibonding, 578f 
Antiferroelectric liquid crystals, 
209 
Anti-Stokes Raman thermometry, 
397-399 
AO Bragg cell, 246t 
Aperture, 116, 117f, 556 
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repeaterless systems, 9 
Arrayed waveguide grating router 
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Assembly methods, systems, 
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Astigmatic lens, 520-521 
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sensing, 439, 444-458, 
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Attenuation, 393-395 

Augmented reality (AR), 206 
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Automatic focusing, 520-521 

Automatic track following, 520, 
521-522 

Automotive windshields, 415-417, 
416f 
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300-301, 311, 312f 

Aviation, 458-467, 477-478 

AWGER, see Arrayed waveguide 
grating router 

AWG, see Array-waveguide gratings 
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mode LCD, 196 
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Backbone networks, 81-91 
cost approach, 92-93 
market introduction, 92-93 
multiplexed architectures, 73 
Backplane, 284-285 
Backscattering, 392-393, 400-401, 
445, 447-448, 485-487 
Backward-pumped Raman fiber 
amplifiers, 24-25 
Ballistic air drops, 477-478 
Bandshifts, 581 
Band structure, 609f, 611 
Bandwidth, 26-27 
amplification, 26-27 
Bathymetry, 482 
Beamforming, 548-550, 555 
Beam index cathode ray tubes, 119, 
120f 
Beam-riding configuration, 324 
Beam size, 434 
Bearer services, 47-48, 48t 
Benes switching matrix, 39-40, 40f 


BER, see Bit error rate 
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522-526 
Binocular depth cues, 295-296 
Binocular vision, 295f, 298 
Biochemical spatial light 
modulators, 251-252 
Biological warfare (BW), 478 
Bioluminescence, 583 
Birefringence, 146 
electrically controlled mode, 
155-161, 159f, 160f 
Bistability, in LCDs, 200 
Bistable cholesterics, 198-199, 199f 
Bistable nematic LCDs and ZBD, 
199-203 
Bistable twisted nematic (BTN) 
mode, 200 
Bit error probability (BER), 20-21 
Bit error rate (BER), 10-13, 20 
Bit-oriented holographic storage, 
534-537 
Bitrates, 47-48, 48t 
Bit stream encoding, 522-526 
Blocking target types, 100-101 
Blue phase TFT displays, 209-211, 
210f, 211f 
Blu-ray Disk, 512 
Bonding, 578f 
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Boojum, 203 
“Bookshelf” geometry, 188 
Bragg cell geometry, 551f 
Bragg diffraction, 551-553 
Bragg gratings 
coherence interferometric 
sensors, 410 
distributed optical fiber sensors, 
410 
fixed-channel OADMs, 45 
intensity-based optical fiber 
sensors, 361 
multiplexed optical fiber 
sensors, 386 
Bragg-selectivity, 532-533 
Bragg wavelength, 361 
Braun, K.F., 113 
Breaking camouflage, 298 
Brightness, 299 
Brillouin backscattering, 400-401 
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Brillouin scattering, 400-401, 
408-410 

BIN mode, see Bistable twisted 
nematic mode 

Bubble cross-connect arrays, 251 

Burn-in, 283 

OLED aging and, 273-274 
Bursts, 83 
BW, see Biological warfare 
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Caenorhabditis elegans 
neotodes, 625 
Calibration 
lidar, 439-442 
pressure error, 458-462 
systems, 654, 677-678 
Cameras, 97-110, 99f 
CCD, 101-102 
imaging, 97, 101-102 
Camera tube, 97-110, 99f 
appearance of ultrahigh- 
sensitivity HARP, 105f 
basic configuration and 
operating principle 
of, 98 
injection and blocking types 
of targets, 100-101 
types of scanning electron 
beam systems, 100 
development of HARP camera 
tubes 
basic structure of targets for 
practical use, 105 
current-voltage 
characteristics, 104 
experimental details, 
103-104 
investigation of high 
sensitivity and high 
picture quality imaging 
devices, 102-103 
origin and operating 
principle of, 104-105 
target types and 
amplification effects, 103 
evolution of HARP camera 
tubes, 105-106 
focusing and deflection system 
of, 100f 


principal characteristics of 
ultrahigh sensitivity 
HARP camera tube 
and applications, 108-110 
lag characteristics, 107 
noise, 107-108 
resolution characteristics, 
107 
sensitivity, 106 
spectral response 
characteristics, 107 
ultrahigh sensitivity 
photoconductive camera 
tube, 101-102 
Canon, 174f 
monitor, 189 
Capacitance, 270-271 
charging, 276-277 
Capacity 
fiber-coax networks, 57, 58f 
fiber-wireless networks, 56 
multiwavelength networks, 56 
transmission lines, 22-23 
Carbon dioxide lasers, 453, 458, 
460, 462, 468, 472 
Carbon nanotubes (CNTs), 626 
Case studies, spectroscopy, 597-599 
Cathode 
additional metallization, 288 
cathode materials, 287-288 
processing, 288 
Cathode ray tubes (CRTs), 
113-121 
beam index, 119, 120f 
color, 116-118 
contrast, 118 
flat, 119 
monochrome, 115 
projection, 115-116, 116f 
safety, 119 
structure and its operation, 
113-115, 114f, 116f 
CATV networks, 54-56 
C-band, 6 
EDFA, 26-27 
CB effect, see Controlled 
birefringence effect 
CB, see Conduction band 
CCD, see Charge-coupled devices 
CD-based technologies, evolution 
of, 507t 
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Recordable format 
CD-ROM format, see Compact 
Disc read-only-memory 
format 
CD-RW format, see Compact Disk- 
Rewritable format 
CD, see Chromatic dispersion 
Cell phone displays, measurement 
of, 417-418 
Cell structure, 129-130 
Cell voltage, 126 
Center thickness, 414-415 
Cerenkov effect, 352f 
CGS, see Continuous grain silicon 
Channels, 26-28 
Charge behavior, 99f 
Charge-coupled devices (CCD) 
cameras, 101-102 
document scanners, 339-340 
imaging sensors, 101-102 
Checklists, components, 679-681 
Chemical detection, 365-369, 432, 
467-471, 478, 490-491 
lidar, 432, 467-471, 484, 490-491 
Chemical indicators, 364, 367 
Chemical sensing, 365-369, 432, 
467-471, 478, 490-491 
Chemical warfare (CW), 478 
Chemiluminescence, 583, 590 
Chip-on-board, 154 
Chip-on-glass (COG), 154 
Cholesteric flexoelectricity, 
211-213, 212f 
Cholesteric liquid crystal displays 
(LCDs), 232-233 
bistable cholesterics, 198-199 
temperature/strain/gas sensors, 
197-198 
Cholesterics, 140 
Chromatic dispersion (CD), 13-15 
Chromophores, 579 
Cinoptics, 206 
Circuits, solar cells, 603f 
Cis-isomers, 149 
Cis-trans isomerism, 624 
Clearing point, 140 
CLEAR system, see Hi-Contrast 
and Low Energy Address 
and Reduction of False 
Contour Sequence system 
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Climatology, 457 
Closed-loop operation, 376 
Clouds, 448-449, 457-458, 
485-487 
CMOS sensors, see Complementary 
metal—-oxide- 
semiconductor sensors 
CNTs, see Carbon nanotubes 
Coating(s) 
components, 669 
thickness, on-line 
measurements, 419-421 
COG, see Chip-on-glass 
Coherence 
coherent heterodyne 
multiplexing, 383 
coherent heterodyne 
spectroscopy, 436-438, 
440-442, 445-446, 453, 
454 
coherent light-beating, 436-438, 
440 
function synthesis, 402-403, 
402f, 408-410, 408f, 409f 
interferometric sensors, 410-421 
multiplexing, 383, 389-390, 390f 
photorefractive spatial light 
modulators, 247 
remote optical sensing, 436- 
438, 440-443, 445-446, 
453, 454, 472 
Coherent lidar, 493t 
Color 
cathode ray tubes, 116-118 
gamut, 281 
plasma displays, 121-127 
reproduction, 299 
Communications 
optical fibers, 349 
spatial light modulators, 
252-253 
telecommunications, 34-36, 349 
Compact Disc read-only-memory 
(CD-ROM) format, 506, 
508-510 
Compact Disk-Recordable (CD-R) 
format, 506, 508-510, 
509f 
Compact Disk-Rewritable (CD- 
RW) format, 506, 
508-510, 509f 


Complementary metal-oxide- 
semiconductor (CMOS) 
sensors, 656 
circuitry, 150 
imaging sensors, 101-102 
Components 
coatings, 669 
considerations, 680t 
physical mounting, 672-673 
property checklists, 679-681 
spectrometers, 584-588 
systems, 666-677 
Computers, interconnects/optics, 
544-545, 550 
Concave cylinder scanners, 323 
Conduction band (CB), 630 
Construction, of LCDs, 149-154 
Contact lenses, 414-415 
Contact-type array scanner optics, 
335 
Contention resolution, 85 
Continuous grain silicon (CGS), 
278 
Continuous wave (CW), 622 
Contrast, 299 
cathode ray tubes, 118 
Controlled birefringence (CB) 
effect, 232 
Conventional ring topology, 75f 
Convex cylinder scanners, 323 
Core networks, 36-41 
Core node, 84f 
Core routers, 84, 85 
Correlation 
auto/self-correlation, 411 
photon correlation spectroscopy, 
593-594, 596 
Cost approach, market 
introduction, 92-93 
Counter-propagating optical pump 
pulses, 407-408, 407f 
Cramer-Rao limit, 441, 456 
Cross-bar switches, 562-563 
Crossconnects, 39, 40f, 41f, 
252-253 
Cross-phase modulation (XPM), 
16-17 
Crosstalk, 45, 299, 535 
CRTs, see Cathode ray tubes 
Cryogenic fluid leaks, 355 
Cryogenic leak detectors, 356f, 395f 


Crystalline silicon solar cells, 
608-610 

Current programming, 280-281, 
280f 

Current sensors, 377-378 

Current-voltage (I-V) 
characteristics, 104, 
602-603 

Curved bed scanners, 323 

CW, see Chemical warfare; 
Continuous wave 

Cylinder hologon deflectors, 332f 
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Daily solar radiation, 613 
DART, see Distributed anti-Stokes 
Raman thermometry 
Data arrays, 563 
Data coding/processing, 254, 
522-526, 659-660 
Data recording, 534-535 
Data routing, 544-545 
Data storage, 505-539, 563 
DAVID project, 86-89, 87, 89f 
Deactivation process, 578, 579t 
Decay lifetime sensors, 364 
Deflection/deflectors 
camera tubes, 100 
cathode ray tubes, 114-115, 118 
position classification, 322-323 
scanning, 328-335, 337 
yoke, 115f 
Deformable helix FLC (DHF), 234 
Deformable membrane devices 
(DMDs), 557 
Deformed helix mode ferroelectric 
liquid crystal, 209 
Degree of order, 140 
Delta arrangement plasma displays, 
132-133 
Dense wavelength multiplexing, 
8,56 
Depolarization of scattering, 575 
Depth cues, in two-dimensional 
image, 294f 
Depth penetration, 482-483 
Depth perception, 293-298, 
314-317 
Design, 647-683 
criteria, 653-666 
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homilies, 682t, 683 
models, 650-653 
remote optical sensing, 
442-444 
sequences, 650 
solar cells, 612-616 
tools, 653t 
wavelength-division 
multiplexing, 20-21 
Desktop laser printers, 341-342 
Detection/detectors, 673-674; see 
also Chemical detection; 
Monitoring/detection; 
Optical detectors 
acoustic signal detection, 377 
codes, 7-8 
cryogenic leak detectors, 395f 
incoherent, 436-438, 441 
noise, 657-658 
photomultiplier tubes, 591, 674 
remote sensing, 436-438, 440— 
44], 442, 453, 456, 472 
spectrometers, 584f 
Device driving, 272 
DGD, see Differential group delay 
DHE, see Deformable helix FLC 
DIAL, see Differential absorption 
lidar 
Diaphragm-actuated devices, 635, 
635f 
Diaphragm microactuator, 634-635 
Dielectric anisotropy, 142-143 
Dielectric voltage, 126 
Differential absorption lidar 
(DIAL), 432, 467-469, 
485t, 490-491 
Differential group delay (DGD), 20 
Differential phase shift keying 
(DPSK), 40 
Diffraction, 551-553 
gratings, 558, 586, 587f 
Digital micro-mirror device (DMD) 
SLM, 238-239 
Digital serrodyne waveform, 375f, 
376 
Digital signal processing (DSP) 
algorithms, 4 
Digital-versatile-disk (DVD) 
format, 506, 510-512 
Digital-versatile-disk recordable 
(DVD-R) format, 511 


Digital-versatile-disk rewritable 
(DVD-RAM/DVDRW) 
format, 511 
DIMOS, 158 
Diodes, 517-518 
properties, 271-272 
Direct-detection 
lidar, 493t 
remote optical sensing, 
436-438, 441, 442, 453, 
456, 472 
Direction finding, 555 
Direct optical spectroscopy, 
365-369 
Disc hologon deflectors, 331, 331f 
Disclinations/defects, 145 
Discotic liquid crystals, 149 
Dispersion, 573-574, 585-588 
chromatic, 13-15 
Display applications, 253-254 
Display driving, OLED devices, 
274-281 
Display fabrication, 283-288 
Display performance, 281-283 
Display resolution, 314 
Distance, 22-23, 357 
Distortions, signal propagation, 
13-20 
Distributed anti-Stokes 
Raman thermometry 
(DART), 397-399, 
397f, 398f 
Distributed cable microbend 
sensors, 353-355 
Distributed fiber amplifier, 9 
Distributed optical fiber sensors, 
378, 392-410 
Disturbance location, 402-404 
Diversification, of fibers, 25-26 
DMDs, see Deformable membrane 
devices 
DNA nanomotors, 624, 625f 
Documentation, systems, 
664-666 
Doping; see also Erbium-doped 
fiber amplifiers 
intrinsic spectral filtering 
sensors, 361 
Doppler effect, 434, 436f, 460 
Doppler velocimetry, 432, 451, 456, 
474, 475 


Doppler wind lidars (DWLs), 488, 
489f, 491-492f 

Dosimeters, 355, 355f, 394f 

Double edge technique, 456, 457f 

DPSK, see Differential phase shift 
keying 

Drive, plasma display panels, 132 

Driver bonding, 154 

Driving gears, 623 

DSP algorithms, see Digital signal 
processing algorithms 

DSSCs, see Dye-sensitized solar cells 

Dual-channel technique, 456, 457f 

Duty cycles, 327 

DVD format, see Digital-versatile- 
disk format 

DVD-RAM/DVDRW format, see 
Digital-versatile-disk 
rewritable format 

DVD-R format, see Digital- 
versatile-disk recordable 
format 

DWLs, see Doppler wind lidars 

Dye-sensitized solar cells 
(DSSCs), 611 

Dynamic capacity allocation, 
56-60 

Dynamic contrast, 281 

Dynamic performance, and 3D, 
282-283 

Dynamic range-resolution trade- 
offs, 255-256 

Dynamic range-speed trade-offs, 
256, 257-259 

Dynamic Scattering Mode 
LCD, 137 

Dynamic wavelength allocation, 60f 

Dynamic wavelength routing, 56f, 
57f, 59 
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Earth observing systems, 485-492 
EDFAs, see Erbium-doped fiber 
amplifiers 
Effective photocount, 441f, 442 
Effective refractive index, 624 
Efficiency 
solar cells, 605, 605f, 609f 
spectral efficiency, 27-28, 28f, 
29t 


690 Index 


EFM, see Eight-to-fourteen 
modulation 
Eight-to-fourteen modulation 
(EFM), 524 
E-ink, 200 
EL-805 (electronic calculator), 137, 
150 
Elastic scattering, 402-403 
Electrically-addressed liquid crystal 
devices, 229, 234-236, 
235f 
Electrically addressed MQW-SLM, 
245t 
Electrically-addressed spatial light 
modulators (E-SLM), 559 
Electrically controlled 
birefringence mode, 
155-161, 159f, 160f 
Electrical response, OLED devices, 
270-272 
Electrical time division 
multiplexing (ETDM), 
36-39 
Electrical-to-optical transduction, 
562 
Electroabsorption modulators, 226, 
547-548 
Electroclinic effect, 209 
Electrodes 
patterning, 152-153 
plasma display, 123, 126-127, 
129-130 
Electro-holography, 226, 250, 559 
Electromagnetic radiation 
full spectrum, 573f 
matter interaction, 573-584 
properties, 570-573 
spectral regions, 572 
Electron guns, 114, 117-118 
for color CRT, 118f 
Electronic levels, 581 
Electronics, overview, 675-676 
Electronic stereoscopic images, 
296-298 
Electron-injection 
amplification target, 100f 
Electro-optics, 151, 175 
deflectors, 332-335 
modulators/applications, 557 
in NLC cells, 231-232 
scanning, 332-335 


spatial light modulators, 226 
wavelength processing, 559 
Electrophoretics, 200 
Electrostatic pressure, mediated by 
photogenerated electrons, 
631-632 
Electrowetting, 200 
EL, see Electroluminescence 
Emission, spectra comparisons, 
585f 
Encapsulation, 288 
Encoding techniques, 522-526, 
542-544 
Energy 
address and false contour, 132 
levels, 77, 122 
solar cells, 613, 614f 
Environmental studies, 467-471 
Environment considerations, 
678-679, 680t 
EPON, see Ethernet PON 
Erbium-doped fiber amplifiers 
(EDFAs), 27f 
C-band, 26-27 
L-band, 26-27 
optical transmission, 7, 26-27 
Error correction, 337, 338f, 523-524 
ETDM, see Electrical time division 
multiplexing 
Ethernet PON (EPON), 53-54 
downstream traffic in, 54f 
upstream traffic in, 54f 
Euler-Lagrange equation, 160, 192f 
Evanescent fields, 350-352, 395, 
577-578, 595-596 
External drum scanners, 323 
External modulator sensors, 357 
Extinction, 448-449 
Extrinsic optical fiber sensors, 349, 
350, 356-357, 363-364 
Extrinsic spectral filtering sensors, 
363 


F 


FIT tube, tracking principle of, 120, 
121f 
Fabrication, 212 
display, 283-288 
plasma display panels, 127-129 
Fabry-Perot filters, 45, 62 


Fabry-Perot modulators (FPM), 
243, 244 

False contour, 132 

Faraday rotator mirrors (FRM), 371 

Fast switching matrix, 86-88 

Fast wavelength-switchable 
OADMs, 46-47, 46f, 47f 

FBG, see Fiber Bragg grating 

FDM, see Frequency-division 
multiplexing 

Feasibility studies, 77, 86-90 

Femtosecond pulses, spatial light 
modulators, 253 

Ferroelectric liquid crystal displays 
(LCDs), 152, 187-191, 189f 

FES, see Focus error signal 

FFP filter, see Fiber Fabry-Perot 
filter 

FFS mode, see Fringe field 
switching mode 

Fiber Bragg grating (FBG), 45 

Fiber-coax network, 58f 

Fiber diversification, 25-26 

Fiberdyne systems, 390-392, 391f 

Fiber Fabry-Perot (FFP) filter, 45, 
62 

Fiber gratings, 45, 361-363, 386, 410 

Fiber optic gyroscopes (FOG), 372f, 
373-377 

Fiber optics, see Optical fibers 

Fiber pyrometer, 356f 

Fiber standards, 25-26 

Fiber-to-the-building (FT TB), 47 

Fiber-to-the-cabinet (FTTCab), 47 

Fiber-to-the-home (FTTH), 47 

Fiber-wireless systems, 59, 60 

FIFO buffers, see First-in—first-out 
buffers 

Filling and sealing, in LCDs, 154 

Film thickness, 420f 

Filtering, spectrometers, 585-588 

FIREBIRD, see Firepond Bus 
Imaging Radar 
Demonstration 

FIREFLY, see Firepond Laser Radar 
Research Facility 

Firepond Bus Imaging Radar 
Demonstration 
(FIREBIRD), 481 

Firepond Laser Radar Research 
Facility (FIREFLY), 481 
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First-in—first-out (FIFO) buffers, 
659, 676 
First-mile network technologies, 
47-48, 48t 
Fixed-channel OADMs, 45 
Fixed/variable optical packet 
comparison, 72 
Flatbed scanners, 323 
Flat cathode ray tubes, 119 
Flat-field laser scanner lenses, 
336-337 
FLC, see Ferroelectric liquid crystal 
Flexible substrate, 284 
Flowcharts, 651f 
Flow fields, 464 
Fluids, cryogenic leaks, 355, 395f 
Fluorescence 
lidar, 467, 470-471 
monitoring/detection, 363-364, 
368-369, 399-400 
spectroscopy, 579t, 582-583, 
590-592 
Flying spot scanners, 339, 340 
FMCW, see Frequency-modulated 
carrier waves 
Focal conic texture, 233 
Focus error signal (FES), 520 
Focus/focusing 
beams, 434 
camera technology, 100 
optical disk storage systems, 
520-521 
FOG, see Fiber optic gyroscopes 
Folded spectrum analysis, 556 
Forces, by light, 620-624 
Forecasting weather, 457 
Forth Dimension Displays, 206 
Forward error-correcting 
codes, 23 
Fourier transforms, 542-543, 542f, 
544, 554-557 
Four-wave mixing (FWM), 17-18 
FPMs, see Fabry-Perot modulators 
Frame rates, 560-562 
Frame response, 163 
Frame sizes, 561 
Fréedericksz transitions, 156f, 
175, 231 
Free-space optics, 551, 562-563 
Frequency-division multiplexing 
(FDM), 384 


Frequency domain spectroscopy, 
436 

Frequency-modulated carrier waves 
(FMCW), 388-389, 388f, 
403-404, 404f, 474f, 475 

Frequency multiplying, 61-62 

Frequency shift keying (FSK), 40 

Frequency- /time-domain optical 
storage, 528-529 

Fresnel scattering, 576 

Fringe field switching (FFS) mode, 
172, 194-195, 195f 

Fringe imaging, 456 

FRM, see Faraday rotator mirrors 

Front parallax barrier, 304f 

FSK, see Frequency shift keying 

FTTB, see Fiber-to-the-building 

FTTCab, see Fiber-to-the-cabinet 

FTTH, see Fiber-to-the-home 

Fujitsu, 195-196 

FWM, see Four-wave mixing 

f-@ characteristic, 336 


G 


G.653 fibers, 25 

G.655 fibers, 25-26 

GABLE, see Global Atmospheric 
Backscatter Lidar 
Experiments 

Gain, distributed optical fiber 
sensors, 408-410 

Galvanometer mirror deflectors, 


328-329 
Gas lasers, 453, 458, 460, 462, 468, 
472 


Gas sensors, 197-198 
Gaussian beam behavior, 663 
Gaussian intensity, 621, 622f 
Geometric factors, depth 
perception, 297-298 
Geometric scattering, 576 
Geosciences, 482-484, 491 
Glass(es), 149-150 
cleaning, 152 
free 3D displays, 206 
as substrate, 284 
Glint, 475 
Global Atmospheric Backscatter 
Lidar Experiments 
(GABLE), 446-448 


Global Backscatter Experiment 
(GLOBE) missions, 447 
Global wind field measurements, 
487-490, 495 
GLOBE missions, see Global 
Backscatter Experiment 
missions 
Glue seal, 153 
GLV device, see Grating light valve 
device 
Gradation, plasma displays, 
130-132 
Gradient forces, 621, 624 
Granularities, 70-71 
Grating light valve (GLV) device, 
238, 239-240 
Gratings; see also Bragg gratings 
array-waveguide, 46-47 
diffraction, 558, 585-588, 587f 
distributed optical fiber 
sensors, 410 
fiber, 45, 361-363, 386, 410 
grating light valves, 239-240 
intensity-based optical fiber 
sensors, 361-363 
multiplexed optical fiber 
sensors, 386 
volume diffraction, 558 
Gratzel dye-sensitized solar 
cells, 611 
Gravure printing, 287 
Gray-scale reproduction, 281 
Grid-connected systems, 615-616 
Group velocity dispersions 
(GVDs), 5 
Guided radiation, 598 
Guided-wave Mach-Zehnder 
interferometers, 
546-551 
Gust warnings, 453 
GVDs, see Group velocity 
dispersions 
Gyroscopes, 373-377, 375f 


H 


HARP, see High-gain avalanche 
rushing amorphous 
photoconductor camera 
tube 

HDTV technology, 98 
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HE,, mode, 370 
Head-up display (HUD) 
technology, 413, 415-417, 
416f 
Health and status monitoring, 
677-678 
Heilmeier, G.H., 174 
Hertz dipoles, 571f, 573, 574-575 
Heterodyne 
interferometer, 372 
microwave signal generation, 
61-62, 61f 
spectroscopy, 436-438, 440- 
442, 445-446, 453, 454 
time-division multiplexing, 383 
Hi-Contrast and Low Energy 
Address and Reduction of 
False Contour Sequence 
(CLEAR) system, 132, 
133f 
Hierarchy, success, 648, 649t 
High capacity transmission 
lines, 29t 
High-gain avalanche 
rushingamorphous 
photoconductor (HARP) 
camera tubes, 98, 102; see 
also Camera tube 
appearance of hand-held HDTV, 
109f 
appearance of ultrahigh- 
sensitivity, 108f 
applications, 108-110 
characteristics, 106-108 
deep-sea organism captured 
with, 109f 
evolution of, 106t 
operating principle, 104-105 
specifications of ultrahigh- 
sensitive, 108t 
target structure, 105 
High-resolution sensing, 402-403, 
410-421 
High-sensitivity sensing, 373-378 
High-throughput routers, 81-82, 
83f 
Historical overviews 
camera technology, 97-98 
optical information processing, 
541-546 
optical transmission, 5-7 


spectral efficiency, 27-28 
HOEs, see Holographic optical 
elements 
Hole conduction layer, 286 
Holographic optical elements 
(HOE), 205, 311 
Holographic storage, 533f 
Holography 
liquid crystal displays, 311 
optical information storage, 
532-537 
scanning deflectors, 329-332 
spatial light modulators, 226, 
250, 563 
storage, 563 
wavelength processing, 559 
Homilies 
attitude, 682t 
design, 682t, 683 
Hopper purging study, 420, 421f 
Hopping electron cathode, 121, 122f 
Horopter, 295 
Hostile environments, 678-679 
HUD technology, see Head-up 
display technology 
Hughes Aircraft Research 
Laboratories, 236 
Human depth perception, 293-298 
Hybrid fiber coax networks, 54-56, 
55f 
Hydrogels, 626 
Hydrophones, 377, 383, 384f 


Identification of technologies, 
77-80 
IGZO, see Indium-gallium-zinc 
oxide 
Image amplification, 563 
Image camera tubes, 98-99, 99f 
Image converters, 255 
Image geometry classification, 
323-324 
Image processing, 562-563 
Imager flatness assessment, 
418-419 
Imagery projection, 562 
Imaging 
cameras, 97-110, 101-102 
charge-coupled devices, 101-102 


complementary metal-oxide-— 
semiconductors, 101-102 
fringe imaging, 456 
lens, 335 
lidar applications, 475-477 
military applications, 475-477, 
481 
perceived depth, 317 
sensors, 101-102 
spatial light modulators, 
561-562 
spectrometer, 588 
thermal imaging, 493t 
Implementation, systems, 647-683 
Incoherent detection, 436-438, 441 
Index profiling, 418-419 
Indium-gallium-zinc oxide 
(IGZO), 278 
Indium tin oxide, 150 
Inelastic scattering, 570 
In-fiber Bragg grating sensor, 362f 
In-fiber grating sensors, 361, 386, 410 
In-fiber Mach-Zehnder 
interferometers, 547-551 
In-fiber optical information 
processing, 545-546, 
547-548, 548-551 
In-fiber optical processing 
functions, 548f 
Information 
flow, 662 
processing, 541-564 
storage/recovery, 505-539, 
658-660 
“Injected” smectic phase, 146 
Injection amplification, 100 
Injection target types, 100-101 
Inkjet printing, 286-287, 287f 
In-plane switching (IPS) LCDs 
basics of operation, 191-194, 193f 
FFS mode, 194-195 
Input scanning systems, 338-340 
Inspection and test, in LCDs, 154 
Instrumentation 
fluorescence spectroscopy, 
590-592 
optical intensity, 588 
photon correlation spectroscopy, 
593-594 
Raman spectroscopy, 592-593 
spectroscopic analysis, 584-599 
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Integrated optics, circuit 
modulators, 375 
Intensity 
based optical fiber sensors, 
352-369 
modulation, 547-548 
scale methods, 276, 276f 
Interconnects, 252-253, 544-546, 
551, 562-563, 611f 
Interfaces, 660-664 
maps, 661t 
operators, 655, 661-662 
Interference 
carrier ratio, 18f 
multiple reflections, 598 
Interferometry 
coherence sensors, 410-421 
direction detection, 453 
distributed optical fiber sensors, 
404-407 
low-coherence, 410-421 
Mach-Zehnder, 370f, 372f, 389, 
546-552, 598 
Michelson interferometers, 370f, 
390 
optical fiber sensors, 369-378, 
383, 389, 404-407, 
410-421 
remote optical sensing, 453 
time-division multiplexing, 383 
Young interferometer, 598 
Interlaced scan, 325 
Internal drum scanners, 323 
International Telecommunications 
Union (ITU), 25-26 
Intersymbol interference (ISI), 14 
Intrinsic optical fiber sensors, 349, 
350, 352-355, 361 
Intrinsic spectral filtering sensors, 
361 
Irradiance (E), 633 
ISI, see Intersymbol interference 
ITU, see International 
Telecommunications 
Union) 


J 


Jablonski energy level diagrams, 
578-579 
Janus colloids, 200 


Japan Broadcasting Corporation 
(NHK), 101-102 

JAWS, see Joint Airport Weather 
Studies 

Joint Airport Weather Studies 
(JAWS), 460 

Jones matrices, 155 

Judgement of surface curvature, 
298 


K 


Kerr effect, 210 
Kubrick, Stanley, 137 


L 

Lab-on-a-Chip (LOC) devices, 620, 
635 

Lag, 107 


Lambert-Beer-Bouguer law, 582 
Laminated LCE film, 628 
Land-groove recording, 515 
Landing aids, 477-478 
Lanthanum-doped lead zirconate 
titanate (PLZT), 629 
Large-format document scanners, 
339f 
Laser(s) 
active remote sensing, 433 
Doppler velocimetry, 432, 451, 
456, 474, 475 
flat-field laser scanner lenses, 
336-337 
lidar technology, 439 
optical disk storage systems, 
517-519 
power modulation recording, 
514 
printers, 341-342 
radar, 432-438, 476f 
remote optical sensing, 432-433, 
451, 484-492 
sounding, 484-492 
Laser Induced Thermal Imaging 
(LITD), 287 
Laser True Airspeed System 
(LATAS), 458-460 
LATAS, see Laser True Airspeed 
System 
L-band operation, 26-27 


LC domains, see Liquid crystal 
domains 
LCDs, see Liquid crystal displays 
LCI, see Low-coherence 
interferometry 
LCLVs, see Liquid crystal light 
valves 
LCOS, see Liquid crystal on silicon 
Leakage, 355, 395 
and defects, 275-276 
Lebedev, Pyotr, 620 
Lenses, 668 
anamorphic, 338f 
astigmatic, 520-521 
optical disk storage systems, 517, 
528-529 
scanners/scanning, 335-337 
Lenticular elements, 308-310, 308f, 
312 
Lidar; see also Remote optical 
sensing 
atmospheric sensing, 444-458 
aviation, 458-467 
backscattering, 392-393 
building blocks, 438-439 
calibration, 439-442 
geoscience, 482-484, 491 
military applications, 471-481 
remote optical sensing, 432-433 
spaceborne, 484-492 
systems, 438-444 
technology, 438-444 
Lidar In-space Technology 
Experiment (LITE), 448, 
485-487, 488f 
Lifetime, 283 
Ligand field interaction, 581 
Light 
beating, 436-438 
collecting extrinsic optical fiber 
sensors, 356-357 
conversion, 562 
detection and ranging, see Lidar 
driven diaphragms, 634-635 
driven microflows, 635-637 
driven single-molecule DNA, 
624 
emitting devices, see Organic 
light emitting devices 
generation, 352-353 
induced elastic CNT, 632-633 
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Light (cont.) 
output coupling, 273 
sources, 584-585, 666-667 
Light intensity modulation (LIM), 
514 
LIM, see Light intensity modulation 
Line broadening effects, 580-581 
Line-of-sight wind component, 451 
Linewidth, 580-581 
Liquid crystal alignment, 230-231, 
231t 
Liquid crystal displays (LCDs), 
137-214 
basics 
display construction, 
149-154 
electrically controlled 
birefringence mode, 
155-161 
passive matrix addressing 
and multiplexing limit, 
161-165 
polarization optics, 154-155 
thin film transistors and 
active matrix addressing, 
165-171 
history, 138-139, 175f 
LCD Master, 158 
market dominance, 173t 
micro-optic displays, 298-314 
modes for potential future 
applications, 206-207 
analog ferroelectric liquid 
crystal modes, 208-209 
blue phase TFT displays, 
209-211 
cholesteric flexoelectricity 
and the uniform lying 
helix mode, 211-213 
liquid crystal lasers, 
213-214 
pi-cell, 207-208 
for nonmainstream and niche 
markets 
bistable nematic LCDs and 
ZBD, 199-203 
cholesteric LCDs, 197-199 
liquid crystal on silicon 
(LCOS), 205-206 
polymer dispersed liquid 
crystals (PDLC), 203-205 


physical properties 
formulating liquid crystal 
mixtures, 145-146 
functional liquid crystal 
compounds, 146-149 
general anisotropic, 142-145 
orientational order, 139-141 
spatial light modulators, 226, 
227-237, 559-560 
standard modes 
dynamic scattering mode, 
174-175 
ferroelectric LCDs, 187-191 
in-plane switching LCDs, 
191-195 
overview, 171-174 
supertwist nematic LCDs, 
185-187 
twisted nematic LCDs, 
175-185 
vertically aligned nematic 
modes, 193f, 195-197, 
196f 
three-dimensional displays, 
298-314 
Liquid crystal light valves (LCLVs), 
229 
Liquid crystal on silicon (LCOS), 
205-206 
Liquid crystals, 139f 
compounds and physical 
properties, 141t 
domains, 627 
lasers, 213-214, 213f 
mixtures, in LCDs, 145-146 
Liquid level sensors, 357 
LITE, see Lidar In-space 
Technology Experiment 
LITI, see Laser Induced Thermal 
Imaging 
LOC devices, see Lab-on-a-Chip 
devices 
Long short range wind 
measurements, 453-456 
Looping back, 42, 43f 
Low-coherence interferometry 
(LCI), 410-421 
measurement geometry, 414, 
415f 
Low-temperature polysilicon 
(LTPS), 170, 278, 285 


LTPS, see Low-temperature 
polysilicon 

Luminance, 281 

Luminescence, 269-289, 583, 590 

LUMO (lowest unoccupied 
molecular orbital), 287 

Lunar laser, 483-484 


M 


Mach-Zehnder interferometers, 
547f, 598 
multiplexed optical fiber 
sensors, 389 
optical fiber sensors, 370f, 372f, 
389 
temporal modulation/ 
processing, 546-551 
MacIntosh Powerbook 100, 172 
Magink, 198 
Magneto-optical (MO) disks, 
512-515, 513f 
Magneto-optical spatial light 
modulators, 226, 227, 247, 
248t 
Magneto-optic effect, 247f 
Management, systems, 664-666 
MANSs, see Metropolitan area 
networks 
Maps/mapping, 482-483, 661t 
Market introduction criteria, 
91-93 
Market share, solar cells, 607 
Matrices 
Benes switching, 39-40 
fast switching, 86-88 
optical information processing, 
556 
optical switching, 86-88 
Maxwell, James, 620 
Mechanical modulators/ 
applications, 557 
Memory effect, 126 
MEMS, see Micro-electro- 
mechanical-systems 
Mesogenic compounds, 148-149, 
148t 
Mesogens, 140 
Mesosphere, 470 
Metal tracking, 288 
Meteor showers, 471, 472f 
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Metro, 35 
part, cost approach, 92 
Metropolitan area networks 
(MANs), 35, 41-47 
advanced technologies, 77-80 
circuit switching technology, 74 
cost approach, 92-93 
multiplexed architectures, 72-73 
multiwavelength, 42f, 44f 
optical add/drop multiplexing, 
42-47, 74, 76 
optical switching solutions, 
73-81 
protection, 42-44 
ring networks, 42 
semiconductor optical 
amplifiers, 78-79 
technologies, 77 
Michelson interferometers, 370f, 
390 
Microbend sensors, 353-355, 395 
Microburst wind shear, 460 
Microcantilever, 637 
beam, 629 
Microcell, 635, 635f 
Micro-Doppler, 475 
Micro-electro-mechanical-systems 
(MEMS), 200, 226, 
237-241, 557, 560 
Microfluidic devices, 635 
Microfluidic mixer, 635 
Microfluidic pump, 623 
Microholography, 534-537, 536f, 
538 
Micromanipulator, 623, 624f, 638, 
639f 
Micro-mechanical slit-positioning, 
240-241 
Micro-mirror spatial light 
modulators, 226, 238-239 
Micro-optics, 298-314, 302f 
Micro-polarizers, 310-311 
display, 300f, 311f 
Microstereo-lithography 
techniques, 623 
Microwaves 
frequency multiplying, 61-62 
generating signals by optical 
frequency multiplying, 
62f 
heterodyning, 61-62, 61f 


network architectures, 60-61 
optics, 548-550 
radar, 493t 
signals by heterodyning, 61f 
Mie scattering, 448, 489, 491f, 575f, 
576 
Military applications 
imaging technology, 475-477, 
481 
lidar, 471-481 
remote optical sensing, 471-481 
Minute blood vessels (mouse 
cancer), HARP camera, 
110f 
Mirrors 
deflectors, 328-329, 330f 
Faraday rotator, 371 
spatial light modulators, 226, 
238-239, 240 
Modeling 
optical air data systems, 
650-652 
remote optical sensing, 489 
systems, 650-653 
MO disks, see Magneto-optical 
disks 
Modularity, 662 
Modulation/modulators; see also 
Spatial light modulators 
amplitude, 474 
code words, 523 
electroabsorption modulators, 
226, 547-548 
extrinsic optical fiber sensors, 
356-357 
Fabry-Perot modulators, 
243, 244 
femtosecond pulses, 253 
future schemes, 28 
optical transmission codes, 7-8 
phase, 243-244 
self-phase modulation, 15-16 
transfer functions, 326 
Molecular photoswitch, 625 
Molecular scattering, 445, 489, 491f 
Monitoring/detection 
chemiluminescence, 590 
fluorescence, 363-364, 368-369, 
399-400 
health and status monitoring, 
677-678 


Monochrome cathode ray tubes, 
115, 120f 
Monocular depth cues, 294-295 
Mont Cenis Academy, 616 
Morthane, 633 
Mounting components, 672-673 
Mouse cancer, HARP camera, 110f 
MQW, see Multiple quantum wells 
Multidimensional devices/ 
processing, 553-554, 
559-563 
Multilayer optical information 
storage, 529-532 
Multilayer recording, 535 
Multimode fiber, 353f 
Multimode microbend sensors, 
353-355 
Multiphoton spectroscopy, 584 
Multiple access PONs, 48-50 
Multiple quantum wells (MQW), 
226, 241, 560 
device structure, 242-243 
modulator, 243f 
MQW SLMs, 244-245 
phase modulation using, 
243-244 
physical background, 242 
Multiple reflections, 598 
Multiplex control, 277 
Multiplexing limit, and passive 
matrix addressing, 
161-165 
Multiplexing/multiplexers; see 
also Networks; Optical 
add/drop multiplexer; 
Wavelength division 
multiplexing 
architectures, 67-94 
backbone networks, 73 
benefits, 72-73 
MANSs, 72-73 
coherence, 383, 389-390, 390f 
core networks, 36-41 
dense wavelength, 56 
frequency-division, 384 
holography, 532-533 
optical fiber sensors, 378-392 
spatial, 379-380, 379f, 380f 
subcarrier access, 48-49, 
54-55 
time-division, 48-49, 380-384 
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Multiplex section shared protection 
ring, 42 

Multiring topology, 74-75, 75f 

Multi-view systems, 311-314, 312f 

slanted arrangement of 

lenticular lens, 313f 

Multi-wavelength fiber -wireless 
networks, 57, 58f, 59f 


N 


N*-SMC*%, 208-209 

Nanocomposites, 633 

hydrogel, 626, 627f 

Nanocontraction, 630 

NA, see Numerical aperture 

NB mode, see Normally black mode 

Near fields, 526, 527-528 

Nematic elasticity, 144f 

Nematic LCD, 230 

mixtures, physical properties 
of, 147t 

Neogenesis blood vessels, 110 


Networks 
access, 47-62 
all-optical solution, 84-85, 
89-90, 91 
architectures, 33-64 
core, 36-41 
first-mile technologies, 47-48, 
A48t 
future, 63-64 
historical overviews, 33-34 
re-allocation, 60, 60f 
telecommunication hierarchy, 
34-36 
topologies, 36 
NF, see Noise figure 
NHK, see Japan Broadcasting 
Corporation 
Niche applications, 171 
Noise; see also Signal-to-noise ratio 
accumulation, 12-13 
amplified spontaneous emission, 
10-13 
bit error rate relationship, 11 
camera technology, 107-108 
detection systems, 657-658 
photo-receiver thermal 
sources, 11 
sources, 10-13, 656-657 


transmission lines, 10-13 
Noise figure (NF), 6 
Non-flatbed scanners, 323 
Nonlinear effects, 15 
Nonlinear signal distortions, 15 
Non-return-to-zero (NRZ) scheme, 
7-8, 525 
Non-zero-dispersion-shifted fibers, 
25-26 
Normally black (NB) mode, 175 
Normally white (NW) mode, 175 
Novel funnel shape, 121f 


Novel technologies 
information storage/ 
recovery, 526-529, 
526f, 537-539 
spatial light modulators, 
250-252 
Nozzle coating, 287 
NRZ scheme, see Non-return to 
zero scheme 
Numerical aperture (NA) 
optical information storage/ 
recovery, 517, 528-529 
Numerical optical processing, 
543-544 
NW mode, see Normally white 


mode 

O 

OADM, see Optical add/drop 
multiplexer 

OADS, see Optical air data 
systems 


Objective lens, 517 

Objective scanning, 322 

Obstacle avoidance, 478-479 

OCB mode, see Optically 
compensated bend mode 

Ocean shelf airborne mapping, 
482-483 

OCT, see Optical coherence 
tomography 

Oculomotor depth cues, 294-295 

OFDR, see Optical frequency 
domain reflectometry 

OLEDs, see Organic light emitting 
devices 

One-dimensional devices 

analogue processors, 554-557 


electro-optic modulators/ 
applications, 557 
mechanical modulators/ 
applications, 557 
optical information processing, 
551-557 
spatial light modulators, 
551-557 
On-line thickness measurements, 
419-421, 421f 
On-off keying (OOK) modulation, 
ll 
Open-circuit voltage, 605, 606f 
Open-loop operation, 376 
Operators, interfaces, 655, 661-662 
Opposed discharge, 126f 
Optical add/drop multiplexer 
(OADM), 45f 
crosstalk, 45 
fixed/dynamic, 76f 
looping back, 42, 43f 
metropolitan area networks, 
42-47, 74, 76 
self-healing, 42, 43f 
structure of, 78f 
tunable OADM using fiber 
Fabry-Perot filter, 45f 
two-stage for handling 
wavelength groups, 46f 
using fiber Bragg grating (FBG), 
45f 
Optical air data systems (OADS), 
460, 650-652 
specification for, 648, 649t 
Optical bounce effect, 207 
Optical coherence tomography 
(OCT), applications of, 
410-421 
Optical communication, 252-253 
Optical compensation, 179-182 
Optical crossconnects (OXC), 
252-253 
Optical data processing, 254 
Optical detectors, 586-588 
Optical disk storage, 507-526, 516f 
lenses, 517, 528-529 
three generations of, 525t 
Optical dispersion, 574f, 585-588 
Optical domain spectroscopy, 436 
Optical fibers 
communication, 349 
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current sensors, 377-378 
leads, 594-596 
Mach-Zehnder interferometers, 
547-551 
overview, 670-672 
questions, 671t 
sensors, 347-422 
current, 377-378 
distributed, 378, 392-410 
intensity-based, 352-369 
interferometric, 369-378, 
383, 389, 404-407, 
410-421 
low-coherence 
interferometry, 410-421 
multiplexed, 378-392 
temporal modulation/ 
processing, 546-551, 
548-551 
Optical frequency domain 
reflectometry (OFDR), 
401-402 
Optical gradient forces, 621 
Optical heads, 515-516 
Optical information 
processing, 541-564 
applications, 546-563 
optical devices, 546-563 
spatial light modulators, 
551-557, 559-563 
wavelength processing, 
557-559 
storage/recovery, 505-539, 507t 
disk systems, 515-526 
disk technologies, 507-515 
holography, 532-537, 534 
novel approaches, 526-529, 
526f, 537-539 
spatial light modulators, 563 
three-dimensional, 529-539 
Optical intensity, 352-355, 352f, 588 
Optical interconnects/optics, 
252-253, 544-546, 550, 
562-563, 611f 
Optically-addressed spatial light 
modulators (O-SLM), 
244, 559 
Optically compensated bend (OCB) 
mode, 207 
Optical microactuators, 619-640 
Optical nanoactuators, 619-640 


Optical network 
access networks, 47 
architectures, 33-64; see also 
Networks 
ATM PON, 50-53 
core networks, 36-41 
dense wavelength multiplexing, 
in access networks, 56 
dynamic capacity allocation, 
56-60 
Ethernet PON, 53-54 
hybrid fiber coax networks, 
54-56 
metropolitan area networks 
(MAN), 41-47 
microwave signals over fiber, 
60-62 
multiple access PONs, 48-50 
network protection, 42-44 
optical add/drop multiplexing, 
44-47 
optical signal multiplexing 
techniques, 36-39 
telecommunication hierarchy, 
34-36 
traffic routing, 39-41 
Optical packets, 70-71 
routing, 41f 
types, 71f 
Optical porosity, 448 
Optical pump pulses, 407-408, 407f 
Optical reach, 9-10 
Optical regeneration, 88-90 
Optical response, OLED devices, 
271f, 272-273 
Optical rings, 74, 75, 77, 77f 
Optical scanning, 321-342 
classification 
by deflector position, 
322-323 
by image geometry, 323-324 
by scan pattern, 324-326 
contact-type array scanner 
optics, 335 
deflectors 
error correction, 337 
galvanometer mirror, 
328-329 
holographic deflectors, 
329-332 
optoelectronics, 332-335 


polygon mirror, 329 
desktop laser printer, 341-342 
flat-field laser scanning lenses, 

336-337 
input systems 
CCD document scanner, 
339-340 

flying spot scanners, 340 
performance specification 

radiometry, input/output 

systems, 327-328 

resolution, 326-327 

scan duty cycle, 327 
reduction-type array imaging 

lens, 335 
systems, 340 
Optical sensors, 550 
Optical signal multiplexing, 
36-39 
Optical signal-to-noise ratio 
(OSNR), 11, 20-21 
Optical storage system, channel 
data flow in, 523f 
Optical switching 
backbone networks, 81-91 
benefits of multiplexed 
architectures, 72-73 
future, 69-71 
granularities, 70-71 
MANs, 73-74 
market introduction criteria, 
91-93 
matrices, 86-88 
and multiplexed architectures, 
67-94 
next generation, 69-71 
performance, 90-91 
positioning techniques, 69-71 
purpose, 69-70 
specificity of metro and 
proposed solutions, 73-81 
wavelengths, 70 
Optical synchronization, 88 
Optical time division multiplexing 
(OTDM), 37-38 
Optical time-domain reflectometers 
(OTDR), 379, 381, 382f, 
393-395, 400 
Optical to electrical energy 
conversion, 562, 601-616 
Optical train, 663 
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Optical transmission, 3-29 electrical response, 270-272 expected for packet approach, 
amplifier noise, 10-11 optical response, 272-273 80 
bit error rate, 10-13 passive matrix displays, of grating light valve, 241t 
design of optical WDM system, 274-277, 275f magneto-optic SLM, 248t 
20-21 Organic photovoltaic (OPV), 612 optical switching, 90-91 
detection codes, 7-8 Orientational elasticity, 143 of PA, a-Si LCLV, 237t 
future, 22-29 Orientation, in LCDs, 139-141 scanning, 326-328 
general structure, 7-10 OSNR, see Optical signal-to-noise SLM specifications, 228t 
high capacity lines, 29t ratio trade-offs, 255-256 
historical overviews, 5-7 OTDM, see Optical time division WDM systems, 20-21 
limitation, 10-20 multiplexing Perovskite solar cells, 611, 612f 
modulation codes, 7-8 OTDR, see Optical time-domain Persistent spectral hole burning 
noise accumulation, 12-13 reflectometers (PSHB), 528, 538 
noise sources, 10-13 Output scanning systems, 340-342 PET, see Polyethylene terephthalate 
Raman amplification future, OXC, see Optical crossconnects Phase change media, 509-510, 515 
24-25 Ozone differential absorption lidar, | Phase-induced carrier method, 372 
structure, 7-10 469 Phase modulation, 243-244 
ultra-long haul technology, Phosphor screens, 115, 116-117, 118f 
23-24 P Photoacoustic spectroscopy, 
Optical trapping, 622 589-590 
Optical tweezing, 622-623 Packet format selection, 83 Photo-activated liquid crystal 
Optics Packet mode receivers, 80 devices, 236-237 
definition, 667 Packet routing, 39-41 Photoconductive camera tube, 
scanners/scanning, 335-337 Page-oriented holographic 97-110, 99f 
Opto-thermic magnetic effects, memories, 532-534, Photocount, 441f, 442 
512-515 538-539 Photomechanical response of 
OPV, see Organic photovoltaic Parallax barriers, 304-308 materials, 625-633 
Orbitals, 578 Passive matrix addressing, 172 Photomultiplier tube (PMT) 
Organic and hybrid solar cells, and multiplexing limit, detectors, 591, 674 
611-612 161-165 Photon correlation 
Organic electroluminescent Passive matrix displays, 152, 153f, spectroscopy (PCS), 
displays, 269-289 274-277, 275f 576, 593-594, 596 
Organic layers, 286-287 Passive matrix OLED (PMOLED), Photon-driven nanoactuators, 
Organic light emitting devices 281 624-625 
(OLEDs), 174, 269-289 Passive mixers, 636 Photon-generated electrons, 620 
display fabrication, 283 Passive optical networks (PONS), Photonic bandgaps (PBG) 
cathode and encapsulation, 47-54, 49f, 51f, 52f materials, 226, 250-251 
287-288 PBG, see Photonic bandgaps Photoreceiver, 11 
organic layers, 286-287 PCS, see Photon correlation Photorefractive effect, 248f, 249f, 
substrate stack, 284-285 spectroscopy 361 
display performance PDLC, see Polymer dispersed liquid  Photorefractive spatial light 
dynamic performance and crystal modulators, 226, 247-249 
3D, 282-283 PDPs, see Plasma display panel Photostimulated effect, 630 
static image performance, Peak power, 383 Photostimulated LC polymer, 628 
281-282 PECVD, see Plasma-enhanced Photostimulated vibrations, 630 
system performance, 283 chemical vapor Photostrictive ferroelectric 
display technology deposition materials, 629-631 
active matrix displays, PEN, see Polyethylene naphthalate Photothermal actuator, 638 
277-281 Perceived depth, 293-298, 297f, Photothermal effect, 636 
aging and burn-in, 273-274 314-317 Photothermal interaction, with 


display driving, 274-281 Performance, 90-91 materials, 633 
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light-driven diaphragms, 
634-635 
light-driven microflows, 
635-637 
photothermal vibration of 
optical fibers, 639 
temperature-induced shape- 
changing metal alloys, 
637-639 
Photothermal spectroscopy, 
589-590 
Photothermal vibrations, 633, 
637 
of optical fibers, 639 
Photovoltaics (PV), 601-616; see 
also Solar cells 
effects, 629 
Physical intensity-based optical 
fiber sensors, 352-364 
Pi-cell, 207-208, 208f 
Pictograms, 621 
Pictorial/empirical cues, 294 
Picture quality, 102-103 
Piezoelectric effects, 629 
PixCon, 314 
Pixels, 478 
Planck’s constant, 621 
Plasma display panels (PDPs), 


121-134, 172 
cell structure, 129-130 
color AC 


features of, 126-127 
operating principle of, 
124-126 
principle of 
three-electrode, 127 
delta arrangement, 133f 
development, 121-124 
diagonal color, 125f 
drive systems, 132 
electrode configuration of three 
electrode, 127f 
fabrication, 127-129, 128f 
future prospects of, 132-133 
gradation, 130-132 
improvement of cell structure, 
129-130 
improvement of drive 
systems, 132 
luminance model of each cell 
in, 125f 


practical panel structure and 
fabrication process, 
127-129 
structure, 127-130 
with surface discharges, 
development of, 123t 
Plasma-enhanced chemical vapor 
deposition (PECVD), 170 
Plasmon resonance, 351 
Plastics, as substrate, 284 
Pleochroic dyes, 148 
PMD, see Polarization mode 
dispersion 
PMT, see Photomultiplier tube 
PNIPAM gel, 626 
Point-to-point 
communication, 252 
Polar diagrams/plots, 452f, 575 
Polarization 
fading, 370 
interferometric optical fiber 
sensors, 370, 378f 
optical time-domain 
reflectometers, 395 
polarizers, 310-311 
Polarization mode dispersion 
(PMD), 4, 20 
Polarization optical time- 
domain reflectometry 
(POTDR), 395 
Polarization optics, 154-155 
Gooch-Tarry-type dependence 
of, 186 
Polarizers, 154 
Polarizing film, 150 
Polar MO Kerr effect, 513 
Polar solvents, 581 
Pollution studies, 467-471 
Poly(vinylidene difluoride) (PVDF), 
630 
optically driven leg actuator, 
631f 
Polycrystalline devices, 278 
Polycrystalline silicon (p-Si), 170 
Polyethylene naphthalate (PEN), 
284 
Polyethylene terephthalate (PET), 
284 
Polygon mirror deflectors, 329, 330f 
Polyimide micro objects, 622 
Polymer carbon nanotube, 633 


Polymer dispersed liquid crystal 
(PDLC), 203-205, 204f 
Polymers, LCDs, 234, 310-311 
PONSs, see Passive optical networks 
Porosity, 448 
Post-detection spectroscopy, 436 
Post-objective scanning, 322f, 323, 
324 
POST, see Power on system test 
POTDR, see Polarization 
optical time-domain 
reflectometry 
Power 
consumption, 276f, 283, 563 
supplies, 676-677 
systems, 662-663, 667 
wavelength, 667 
Power on system test (POST), 677 
Pre-objective scanning, 322, 322f 
Pressure 
error calibration, 458-462 
sensitive cable, 354f, 395f 
Printers/printing, 321, 341-342 
Prisms, 585-586 
Programmable optics applications, 
253-255 
Projection cathode ray tubes, 
115-116 
Projection displays, 115-116 
Projectors, 206 
Propagation 
delay sensors, 355-356, 356f 
signal distortions and, 13-20 
Protection/protecting networks, 
42-44, 51-52 
Prototype target, 103f 
Provision of color, in LCDs, 154 
Proximity detector, 358f 
Proximity/distance sensors, 357 
PSHB, see Persistent spectral hole 
burning 
PSTN, see Public switched 
telephone network 
Public switched telephone network 
(PSTN), 35 
Pulse lengths, 442 
Pulse lidar systems, 453 
Pulse width modulation (PWM), 276 
Pumping 
pump pulses, 407-408, 407f 
Raman fiber amplifiers, 24-25 
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Purity, cathode ray tubes, 118 

Push-pull tracking, 521-522 

PVDF, see Poly(vinylidene 
difluoride) 

PV, see Photovoltaics 

PWM, see Pulse width modulation 


Q 


QSXGA monitor, 169 
Quadratic detectors, 10 
Quadrature points, 371 
Quantum wells 
multiple, 226, 241-245, 560 
spatial light modulators, 560 


R 


Radar, 432-438, 460, 476f, 556 
Radiation; see also Electromagnetic 
radiation; Guided 
radiation 
dosimeters, 355, 355f, 394f 
spectra, 604 
Radiometry, scanning, 327-328 
Raman amplification, 24-25 
Raman fiber amplifier, 24f, 25f 
Raman gain, 19 
Raman scattering 
chemical optical fiber sensors, 
368-369 
in-fiber, 595 
lidar, 467, 469-470, 470f 
stimulated, 6, 19-20 
Raman spectroscopy, 583-584, 
592-593 
Raman thermometry, 397-399 
Range 
perceived depth, 316 
rangefinders/rangefinding, 
473-475 
remote sensing, 442-444, 485t, 
491 
resolution, 256, 442-444 
speed trade-offs, 256, 257-259 
Range-Doppler processing, 556 
Rare earth-transition metal alloys, 
513 
Rawinsonde wind speed, 454, 455f 
Rayleigh scattering, 574-576 
backscattering, 395-397 


distributed optical fiber sensors, 
402-403 
remote optical sensing, 
489-490, 491f 
RCA, 137, 165 
Reactive mesogens, 149 
Read-only-memory (ROM) media, 
506, 507, 508-510, 515 
Recordable storage media, 506, 
508-510, 511 
Recovery, information, 505-539, 
658-660 
Reduction-type array imaging lens, 
335 
Referencing, 359-361 
Reflection, 577 
Reflective sensors, 381 
Reflective spatial light modulators, 
227, 234-236 
Refraction, 577-578 
Refractive index, 420f 
biaxiality, 188 
coherence interferometric 
sensors, 414-417 
measurements of contact lenses 
in solution, 414-415 
Regeneration, 85 
Relative depth judgement, 298 
Remote optical sensing; see also 
Lidar 
aviation, 458-467 
chemical studies, 467-471 
environmental studies, 
467-471 
geoscience, 482-484, 491 
lasers, 432-433, 451, 484-492 
military applications, 471-481 
pollution studies, 467-471 
scattering, 445-449, 456, 467, 
469-470, 484-492 
Remote spectroscopy, 366f 
Repeaterless systems, 9, 21 
Requirements, systems, 648-650 
Resolution, 281-282 
camera technology, 107 
limit, 527-528 
perceived depth, 315 
scanning, 326-327 
sensing, 402-403, 410-421 
speed trade-offs, 256 
Resonance 


absorption, 432 
differential absorption lidar, 
432, 467-469, 485t, 
490-491 
frequency, 637 
surface plasmon resonance, 598 
Response time, 272 
Retardation, 177 
Return to zero (RZ) codes, 7-8 
Reverse leakage, 272 
Rewritable storage media, 506, 
508-510, 511, 512-515 
Riding-the-ray, 653 
Ring networks, 42 
Ring topologies, 74, 75, 77 
Risk reduction, 655-656 
Risley prisms, 326 
RLL channels, see Run-length 
limited channels 
Rocketry, 477-478 
ROM media, see Read-only- 
memory media 
Run-length limited (RLL) channels, 
524 
RZ codes, see Return to zero codes 


Ss 


SABLE, see South Atlantic 
Atmospheric Backscatter 
Lidar Experiments 

Safety, cathode ray tubes, 119 

Sagittal depth, 414-415 

Sagnac effect, 373 

Sagnac loop interferometers, 378f, 
404-407, 405f, 406f 

Sampled servo scheme, 522 

Samsung, 196 

KN55S89, 283 

Sandwich construction, 560 

SAR, see Synthetic aperture radar 

Satellite laser ranging (SLR), 
483-484 

Saticon, 101f 

S-band, 7 

SBE, see Supertwist birefringence 
effect 

SBS, see Stimulated Brillouin 
scattering 

Scanners/scanning, 321-342 

classification, 322-326 
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configuration, 322-326 
deflectors, 328-335, 337 
duty cycles, 327 
electron beams, 100 
lenses, 335-337 
optics, 335-337 
optoelectronic deflectors, 
332-335 
pattern classification, 324-326 
performance specifications, 
326-328 
system configuration, 322-326 
Scattering, 574-577; see also Raman 
scattering; Rayleigh 
scattering 
aerosols, 446-448, 456, 485- 
487, 489, 489-490f, 491f 
backscattering, 392-393, 
400-401, 445, 447-448, 
485t, 489, 490 
Brillouin, 400-401, 408-410 
elastic, 402-403 
Fresnel, 576 
geometric, 576 
inelastic, 570 
Mie, 448, 489, 491f, 575f, 576 
molecular, 445, 489, 491f 
polar diagrams, 575 
remote optical sensing, 445- 
449, 456, 467, 469-470, 
484-492 
Schroeder-Van Laar equation, 146 
SCMA (subcarrier multiplex 
access), 48-49, 54-55 
passive optical network, 49f 
SDI, see Strategic Defence Initiative 
Secchi disk depths, 483; see also 
Ethernet PON 
SEED, see Self-electro-optic effect 
device 
Selection criteria, 91 
Self-correlation functions, 411 
Self-electro-optic effect device 
(SEED), 544 
Self-healing, 42, 43f 
by looping back at OADMs, 43f 
Self-phase modulation (SPM), 
15-16 
Self-tests, 677 
Semiconductor optical amplifiers 
(SOA), 69-70, 79f, 80f 


based matrix, 86-88, 87f 
gate array modules, 86, 88f 
for gating, 78-79 
MANs, 78-79 
optical matrix-based on, 87f 
technology feasibility, 86-88 
Sensing/sensors; see also Optical 
fibers, sensors; Remote 
optical sensing 
grating sensors, 361, 386, 410 
imaging, 101-102 
interferometry, 369-378, 383, 
389, 404-407, 410-421 
optical sensors, 550 
referencing, 359-361 
spectral filtering, 361, 363 
temporal modulation/ 
processing, 550 
Sensitivity 
camera technology, 102-103, 106 
sensing, 373-378 
speed trade-offs, 256 
Serrodyne waveforms, 375f, 376 
Servo systems, 519-522 
Shadow masking, 116, 117f, 286 
Shape-changing polymers, 626-629 
Shape memory, 633 
Shape memory polymers (SMP), 
619-620 
Sharp, 138, 165, 196 
Shockley-Queisser detailed balance 
limit, 603-606 
Short-circuit current density cells, 
604, 605, 606f 
Short period gratings, see Bragg 
gratings 
Short-range wind measurements, 
449-453 
Signals 
distortions, 13-20 
estimation, 656-658 
information retrieval, 659-660 
multiplexing, 36-39 
noise sources, 656-657 
propagation, 13-20 
strength, 439-442 
Signal-to-noise ratio (SNR) 
intensity-based optical fiber 
sensors, 357-359 
remote optical sensing, 439- 
442, 440-441, 444 


Silicon (Si) 
microcantilever, 630 
solar cells, 608-610 
SIL, see Solid immersion lens 
Single-facet hologon deflectors, 332f 
Single-LCD front parallax barrier 
design, 306f 
Single liquid crystal displays 
holographic elements, 311 
lenticular element designs, 
308-310 
micro-polarizer designs, 
310-311 
parallax barrier designs, 
304-308 
Single-mode standard fiber (SSMF), 
25 
Single polarization (SP), 37 
Single polarizer TN LCDs, 182-184, 
183f, 184t 
Singulation, 154 
Si-Schottkydiode LC SLM, 227f 
Slit-positioning, 240-241 
SLMs, see Spatial light modulators 
Slot Coating, 287 
SLR, see Satellite laser ranging 
SMA Ni-Ti, 637 
Smartphone (touchscreen), 417, 418f 
Smart-pixel spatial light 
modulators, 249 
Smart routers, 81-82, 82f 
Smectic liquid crystal devices/ 
displays, 233-244 
SMP, see Shape memory polymers 
SNCP, see Sub-network connection 
protection 
SNR, see Signal-to-noise ratio 
SOA, see Semiconductor optical 
amplifiers 
SOCE, see Synthesis of optical 
coherence function 
Sodium measurements, 470 
Software, systems, 660 
Solar cells, 601-616 
absorption coefficients, 608, 610f 
applications, 602t 
circuits, 603f 
crystalline silicon, 608-610 
design issues, 612-616 
dye-sensitized, 611 
efficiency, 605, 605f, 609f 
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Solar cells (cont.) 
energy availability, 613, 614f 
Gratzel type, 611 
grid-connected systems, 
615-616 
interconnected modules, 611f 
I-V characteristics, 602-603 
market share, 607 
open-circuit voltage, 605, 606f 
organic and hybrid, 611-612 
perovskite, 611, 612f 
principles, 602-607 
production, 602f 
Shockley-Queisser balance 
limit, 603-606 
short-circuit current density, 
604, 605, 606f 
sizing curve, 615 
stand-alone systems, 614-615 
technologies, 607-612 
thin film type, 610-611 
UNSW PERL type, 610f 
Solar radiation, 604, 613 
Solid earth studies, 490-491 
Solid immersion lens (SIL), 528-529 
Solid-state spatial light modulators, 
226, 245 
Solution processing, 286-287 
South Atlantic Atmospheric 
Backscatter Lidar 
Experiments (SABLE), 
446-448 
Space-based systems, 490-491 
Spaceborne Doppler wind lidar, 489 
Space integrating acousto-optical 
correlators, 554-557 
Spacers, 153 
Spacing channels, 27-28 
Spatial coherence, 518 
Spatial light modulators (SLMs) 
applications, 226, 253-254 
characterization parameters, 
227, 227t 
definition, 226-227 
limitations, 255-256 
liquid crystal devices, 226, 
227-237, 561 
main characterization 
parameters of, 227t 
mirrors, 226, 238-239, 240 
novel devices, 250-252 


optical information processing, 
551-557, 559-563 
optically-addressed, 244, 559 
performances, 227-249 
physical mechanisms, 227, 228t 
physical mechanisms, 228t 
specifications and performance 
parameters, 228t 
technology, 225-259 
trade-offs, 255-256 
types, 227-249 
Spatial localization, 298 
Spatial multiplexing, 379-380, 379f, 
380f 
Specialized optoelectronic 
spectroscopy, 596-597 
Specifications, 648 
Speckle, 434, 435f 
Spectral efficiency, 27-28 
of optical systems, 28f 
Spectral filters, 361-363 
Spectral hole burning, 528-529 
Spectral regions, 572 
Spectral response, 107 
Spectrometers 
components, 584-588 
detection, 584f 
light sources, 584-585 
optical dispersion, 585-588 
optical intensity 
instrumentation, 588 
Spectrophotometers, 588-589 
Spectroscopy, 569-599 
case studies, 597-599 
coherent heterodyne, 436-438, 
440-442, 445-446, 453, 
454 
direct optical, 365-369 
fluorescence, 579t, 582-583, 
590-592 
frequency domain, 436 
heterodyne, 436-438, 440-442, 
445-446, 453, 454 
instrumentation, 584-599 
methods, 584-599 
multiphoton, 584 
optical domain, 436 
photoacoustic, 589-590 
photon correlation, 593-594, 
596 
photothermal, 589-590 


post-detection, 436 
radiation/matter interaction, 
573-584 
Raman, 583-584, 592-593 
remote, 366f 
specialized optoelectronics, 
596-597 
theory, 570-584 
time-domain, 436 
Spectrum analysers, 554-557 
Speed trade-offs, 255-256, 257-259 
Spherical aberration, 517 
Spin coating, 286 
Splitters, 546-548 
SPM, see Self-phase modulation 
Spontaneous Brillouin 
backscattering, 400-401 
Spontaneous noise factor, 10 
Spot size, 517-518, 528-529 
SP, see Single polarization 
SRS, see Stimulated Raman 
scattering 
SSME, see Single-mode standard 
fiber 
Stability, systems, 667 
Stand-alone systems, 614-615 
Star Wars, 472, 479, 481 
State-of-the-art LCD, 174f 
Static image performance, 281-282 
Stereo acuity, 295-296, 296f 
Stereopsis, 294, 295 
Stereoscopic resolution, 316, 316f 
Stereoscopic systems, 299-300, 311, 
312f 
Stimulated Brillouin scattering 
(SBS), 6, 18-19 
Stimulated Raman scattering (SRS), 
6, 19-20 
Stokes Raman thermometry, 
397-399 
Storage 
area networks, 35 
density, 519f 
optical disk systems, 515-526 
optical disk technologies, 
507-515 
optical information, 505-539 
spatial light modulators, 563 
Strain profiles, 410 
Strategic Defence Initiative (SDD, 
472, 479, 481 
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Stray light, 668-669 
Structure 
plasma display panels, 127-130 
spatial light modulators, 
560-561 
systems, 664 
Sub-carrier multiplexing, 48-49, 
54-55, 55f, 386-388 
Sub-network connection protection 
(SNCP), 42 
Substrate stack, 284-285 
Success 
factors, 650f 
hierarchy, 648, 649t 
Super Audio Compact discs, 511 
Supertwist birefringence effect 
(SBE), 186, 187f 
Supertwist nematic LCDs, 185-187, 
186f 
Surface discharge, 124f 
Surface material perception, 298 
Surface mount technology, 154 
Surface plasmon resonance, 598 
Surface profiling, 418-419, 420f 
Surface stabilized bistable FLC 
mode (SSFLC), 187 
Surface stabilized ferroelectric LCD 
(SSFLCD), 172, 174f 
Surface stabilized FLC (SSFLC), 
233 
Switches/switching; see also Optical 
switching 
Benes matrix, 39-40 
cross-bar, 562-563 
fast matrix, 86-88 
matrices, 39-40, 86-88 
optical packets, 70-71 
wavelengths, 46, 70 
Synchronization, 84 
Synchronous Optical NETworks, 5 
Synthesis of optical coherence 
function (SOCE), 402 
Synthetic aperture radar (SAR), 
556 
Systems 
adjustment, 654t 
alignments, 654t 
assembly methods, 653-654 
calibration, 654, 677-678 
components, 666-677 
design, 647-683 


documentation, 664-666 
environment considerations, 
678-679, 680t 
implementation, 647-683 
information retrieval, 658-660 
interfaces, 660-664 
management, 664-666 
modeling, 650-653 
noise detection, 657-658 
OADS components, 652f 
optical train, 663 
performance, 283 
physical structure, 664 
possible effect matrix, 681t 
power, 662-663, 667 
requirements, 648-650 
risk reduction, 655-656 
signal estimation, 656-658 
software, 660 
stability, 667 
structure, 7-10 
testing, 677-678 


T 


TAB-mounted drivers, 154 
Tachometer, 360f 
TAKT time, 152 
Target imaging, 475-477 
Target marking, 473-475 
Target types, 100-101, 103, 105 
TBW product, see Time-bandwidth 
product 
TDFA, see Thulium-doped fiber 
amplifier 
TDMA passive optical network, 49f 
TDM, see Time-division 
multiplexing 
TDWR, see Terminal Doppler 
Weather Radars 
TEA lasers, 454 
Technologies 
feasibility, 77, 86-90 
identification, 77-80 
selection criteria, 91 
solar cells, 607-612 
TechWiz, 158 
Telecentricity, 336 
Telecommunications, 34-36, 349 
evolution of services, 34f 
hierarchy of, 35f 


Temperature 
induced shape-changing metal 
alloys, 636-639 
profiles, 410 
strain and gas sensors, 197-198 
Temporal modulation/processing, 
546-551 
Terminal Doppler Weather Radars 
(TDWR), 460 
Terrain airborne mapping, 
482-483 
Terrain following, 478-479 
TES, see Track error signal 
TFT, see Thin-film transistor 
Thermal conductivity, 634 
Thermal diffusivity, 634 
Thermal evaporation, 286 
Thermal imaging, 493t 
Thermometer, 364f 
Thermometry, 397-399 
Thermo viscous effects, 633 
Thickness profiling/measurements, 
418-419 
Thin-films 
micro-mirror arrays, 240 
solar cells, 610-611 
Thin-film transistor (TFT), 150, 
165-167, 277-280 
and active matrix addressing, 
165-171 
construction and operation of, 
166f 
fabrication of, 170 
Three-beam method, 522 
Three-dimensional displays 
applications, 314-317 
depth perception, 293-298, 
314-317 
micro-optics, 299-314 
perceived depth, 293-298, 
314-317 
performance, 314-317 
Three-dimensional optical 
information storage, 530f 
bit-oriented holographic storage, 
534-537 
holographic data storage, 532 
multilayer optical information 
storage, 529-532 
page-oriented holographic 
memories, 532-534 
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Three-electrode plasma display 
panels, 127, 128 
with waffle rib, 130f 
Three-electrode surface discharge, 
123, 128 
Thulium-doped fiber amplifier 
(TDFA), 7 
Time-bandwidth (TBW) product, 
552 
Time-delay sensors, 355-356 
Time-division multiplexing (TDM), 
38f, 48-49, 49f, 380-384, 
381f 
Time-domain fluorescence 
monitoring, 399-400 
Time-domain optical information 
storage, 528-529 
Time-domain spectroscopy, 436 
Time-frequency transformations, 
556 
Timing principles, 435 
TIR, see Total internal reflection 
T-micromixers, 636 
TN LCD, 182-184, 183f, 184t, 203 
Topologies, networks, 36 
Total display resolution, 314-315 
Total internal reflection (TIR), 577- 
578, 582, 589f, 595-596 
Total transmitted bit rate, 27 
Touchscreen, 417, 418f 
TPP, see Triple-product processor 
Track error signal (TES), 521-522 
generated by push-pull method, 
522f 
Tracking, 521-522 
Track numbers, 524 
Trade-offs, 20-21 
Traffic profile evolution, 90-91 
Traffic routing, 39-41 
Transient effects, 283 
Trans-isomers, 149 
Transmission; see also Optical 
transmission 
network history, 5-7 
optical fiber sensors, 365-368 
Transmissive sensors, 381 
Transverse modes, 518 
Triphenylmethane leuco 
derivatives, 626, 627 
Triple-product processor (TPP), 
555-556 


True airspeed, 458-462 
Tunable lasers, optical matrix- 
based on, 87f 
Tunable source, feasibility, 88 
Turbidity, 365-368 
Turbulence, 458-462 
Tweezer, light-driven, 637, 638f 
Twin-liquid crystal displays, 
301-302, 302f 
Twisted nematic (TN) LCD, 137, 
175-176 
single polarizer reflective mode 
TN, 182-185 
on state and optical 
compensation, 179-182 
off state and TN design, 176-179 
Two-dimensional spatial light 
modulators, 559-563 
Two-photon excitation/recording, 
530-531, 531f, 538-539 
Two-view liquid crystal displays, 
301-302, 301f 
Tyndall scattering, 140 


U 


Ultrahigh sensitivity 
photoconductive camera 
tubes, 101-102 

Ultra-long haul technology, 23-24 

Ultraviolet (UV) radiation, 
619-620 

Uniform standing helix, 212 

UNSW PERL solar cells, 610f 

UV radiation, see Ultraviolet 
radiation 


Vv 


Vacuum tubes, 113-121 
beam index CRT, 119 
color CRT, 116-118 
contrast, 118 
CRT structure and operation, 

114-115 

flat CRT, 119 
monochrome CRT, 115 
projection CRT, 115-116 
recent developments, 119-121 
safety, 119 

VanderLugt correlator, 562 


VAN modes, see Vertically aligned 
nematic modes 
Vapor phase deposition, 286 
Variable/fixed optical packet 
comparison, 72 
VCSEL-Si-based smart pixel, 250f 
Velocity 
Doppler velocimetry, 432, 451, 
456, 474, 475 
remote optical sensing, 451 
resolution, 444 
velocimetry, 432, 451-454, 456, 
474, 475 
wake vortices, 464 
wind, 451 
Vertically aligned nematic (VAN) 
modes, 181, 193f, 195-197, 
196f 
Vibrometry, 474, 475 
Vidicon camera tube, 98 
Viewing angle, 282 
Viewing position indicator (VPI), 
306 
display, 307f 
Viewing windows, 302-304, 303f, 
312 
Viscosity, 144-145 
Visibility curves, 456 
Vogel-Fulcher-Tammann 
equation, 145 
Volcanic activity, 489 
Voltage aging, 273 
Voltage programming, 279-280 
Volume diffraction gratings, 558 
Volume holography, 532-533, 
538-539, 563 
Vortices/vorticity, 462-467 
Voxels, 315 
VPI, see Viewing position indicator 


Ww 


Wake vortices, 462-467 
“Warped capacitor” microactuator, 
631, 632f 
Water vapor measurements, 469, 470£ 
Watt peak, 604 
Waveguides 
Mach-Zehnder interferometers, 
546-551 
splitters, 546-548 
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Wavelength division multiplexing (WDM), 3-29, 37, Wind 
37f, 49, 50f generators, 453 
40G systems, 26-29 infrared Doppler lidar, 462 
amplified communication links, 8-9 lidar, 462, 485t, 487, 489, 491f, 492f 
basic architecture, 8-9 remote optical sensing, 449-457, 458-462, 
capacity/distance future, 22-23 477-478, 487-490 
design, 20-21 shear, 458-462 
dimension, 72, 73 speed, 449-457 
enhancement G.983.3, 53f velocity, 451 
forward error-correcting codes, 23 Windows, properties, 669-670 
future technology, 22-29 WINDSAT, 487 
global performance, 20-21 Windshields, 415-417, 416f 
optical fiber sensors, 384-386, 385f Wireless local area networks (WLANs), 60-62 
passive optical networks, 47 Wiring, power, 662-663 
structure of amplified optical system, 8f WLANs, see Wireless local area networks 
technology, 22-29 WORM technology, see Write-once read-many 
temporal modulation/processing, technology 
550-551 Write-once read-many (WORM) technology, 507, 508 


wavelength processing, 557-559 
Wavelengths, 667 


channels, 59-60, 60f x 
conversion, 255 XPM, see Cross-phase modulation 
dynamic capacity allocation, 57 
image converters, 255 Y 
optical switching, 46, 70 
paths, 39-41, 39f Y-micromixers, 636 
processing, 550-551, 557-559 Yokohama, 623, 624 
selector structure, 80, 80f Young interferometer, 598 
WDMA passive optical network, 50f 
WDM, see Wavelength division multiplexing Zz 
Weather forecasting, 457 
Weiner-Khintchine relationship, 659 ZBD Displays Ltd., 200, 203 
White-to-black luminance ratio, 281 ZBD, see Zenithal bistable display 


Williams, Richard, 174 Zenithal bistable display (ZBD), 145, 199-203, 202f 


